When using Windows, and where the path to the target directory has been defined e.g. base_dir = Path(r”D:\Python\Real Python...\Lesson 6”), then s this is a WindowsPath I found two options.

Using glob.glob(os.path.join(base_dir, “backup”)) gives the full path for each file e.g. “D:\Python\Real Python...\Lesson 6\data_01_backup.txt”
To avoid that and just generate the file names matching the pattern, consider the use of glob.glob(“backup”, root_dir=base_dir) which then produces the desired list of just the file names e.g. [‘data_01_backup.txt’, ‘data_02_backup.txt’, ‘data_03_backup.txt’].

Is that an acceptable approach within the context of the specification for glob.glob()? Or is there a better way to get just the file names matching the pattern in the target directory?

tonypy on March 12, 2023

One other observation. Using

glob.glob("**/*.py", root_dir=base_dir, recursive=True)

in the example given produces [‘admin.py’, ‘tests.py’, ‘sub_dir\file1.py’, ‘sub_dir\file2.py’]. Is there an easy way to tidy this list up so that the directory separators are either ‘' or ‘/’?

tonypy on March 13, 2023

Regarding the comment above. This should have read that the example given produces

[‘admin.py’, ‘tests.py’, ‘sub_dir\\file1.py’, ‘sub_dir\\file2.py’]

Is there an easy way to tidy this list up so that the directory separators are either ‘’ or ‘/’?

tonypy on March 13, 2023

One final question regarding pathlib. Using the example I can get file names using

[file.name for file in base_dir.glob("**/*.py")]

The result is

['admin.py', 'tests.py', 'file1.py', 'file2.py']

What I can’t see is a structure to get the equivalent of glob.glob() which gives the result relative to the defined reference path which in this case is the directory ‘Lesson 6’. That would give the result

[‘admin.py’, ‘tests.py’, ‘sub_dir\file1.py’, ‘sub_dir\file2.py’]

Any suggestions?

tonypy on March 13, 2023

Following on from above, I did determine that the following works

for pyfile in base_dir.glob("**/*.py"):
    pyfile_rel = os.path.relpath(pyfile, base_dir)
    print(pyfile_rel)

Where base_dir = Path(r“D:\Python\Real Python…\Lesson 6”) The output is then as expected, although not in a list

admin.py
tests.py
sub_dir\file1.py
sub_dir\file2.py

However, not very elegant. Any ideas on improving?

Martin Breuss RP Team on March 14, 2023

@tonypy hi, nice research! :D

I’m not on a Windows machine to check for path representation of glob.glob(), but you see the double-backslash characters because Python needs to escape backslash characters. So, in a normal string that’s the way they’ll show up.

pathlib solves this issue by a layer of abstraction around paths. When you work with pathlib, then a path isn’t a Python string, but a Path object instead. That gives you a lot of additional possibilities.

Two things that I wanted to pick up from your previous comments:

Recursive Search with .rglob()

You can make recursive search even more clear when you work with Path objects by using .rglob("*"):

>>> [file.name for file in base_dir.rglob("*.py")]
['admin.py', 'tests.py', 'file1.py', 'file2.py']

If you use .rglob() instead of .glob(), then you can omit the **/ part of the pattern. The method specifically does a recursive search.

Relative Paths with pathlib

You can achieve the same behavior that you’re looking for from glob.glob() also with pathlib, using .relative_to():

>>> [pyfile.relative_to(base_dir) for pyfile in base_dir.rglob("*.py")]
[PosixPath('admin.py'),
 PosixPath('tests.py'),
 PosixPath('sub_dir/file2.py'),
 PosixPath('sub_dir/file1.py')]

And if you wanted to show only the string representation of these Path objects, then you could wrap them into str():

>>> [str(pyfile.relative_to(base_dir)) for pyfile in base_dir.rglob("*.py")]
['admin.py', 'tests.py', 'sub_dir/file2.py', 'sub_dir/file1.py']

Hope that helps! If you enjoy pathlib (I do!), then you can check out the following resources we have on the site:

tonypy on March 14, 2023

Martin,

Many thanks for your feedback and suggestions, they were very useful. I did have realpython.com/courses/pathlib-python/ bookmarked so will be taking that soon.

anaghost on May 9, 2023

it would be nice to add how to deal with shutil being unable to delete dirs when there are permissions issues.

Become a Member to join the conversation.