implement wildcard pattern for input #59

yucongalicechen · 2024-05-15T01:32:06Z

Initial commit
Related to Issue Implement input function to handle skipped files and check ouputs and header #52, examples are here user input files UCs #48
I skipped file_list files as usual. Shall we allow the user to specify something like "file*"?

sbillinge

very nicely done. Just one small typo and one suggestion in the input.

I also see some copy-pasted code which makes me wonder if it might be better to simply look for the wildcard and expand it before passing into this code block. Kind of like a prefilter that does something like

for input_name in args.input:
     if '*" in name 
             do the globbing and find the list of files
             add the glob list to args.inputs and remove the "*" name

then again,

for input_name in args.inputs:
    code unchanged from before

which would be easier to maintain.

sbillinge · 2024-05-15T10:57:02Z

src/diffpy/labpdfproc/labpdfprocapp.py

-        "file_list.txt that can be found in the folder ./data).",
+        "file_list.txt that can be found in the folder ./data). "
+        "Wildcard character (*) is accepted. Examples include './*chi'"
+        " (load all files with .chi extension) and 'data/test*' (load "


maybe change to, ./*.chi to be closer to that description.

What happens if the wild-card expands to files and directories? We should tell the user what happens then.

…les for help message

yucongalicechen · 2024-05-15T17:02:13Z

I added a function expand_wildcard_file to take care of wildcard patterns and return the expanded list of inputs. Shall we put the two expand functions into set_input_lists for convenience? So that in the labpdfproc.py and tests, we only need to call for set_input_lists.
I wrote an error message for wildcard patterns that cannot match any files. But if the user specifies an invalid wildcard pattern like "**", this error message might not be that helpful as the original error message from Python (as it would tell user that it is an invalid wildcard pattern). I currently cannot think of a way to distinguish between these two situations, unless checking that if wildcard pattern is one of the invalid cases, but there are many invalid cases to check..

sbillinge

please see inline comments.

src/diffpy/labpdfproc/labpdfprocapp.py

src/diffpy/labpdfproc/tools.py

sbillinge · 2024-05-15T19:15:31Z

re your comment, I agree that it might be good to put the two pre-filters into the expand_user_input function. Perhaps we shoud also make them private functions (add an underscore at the beginning) which makes the code more readable. It is nice that we have independent tests for them, no need to undo that.

…cessary test cases

sbillinge

we may need a test for a user putting a wildcard in the file-list file. I think your code may currently fail this test, but let's make the test and see.

Almost there, see a few last cleaning edits, and also I think my request to not support wildcards on directories may have gotten lost in those long comments on the last review, bu tplease remove support for wildcards on directories.

src/diffpy/labpdfproc/labpdfprocapp.py

src/diffpy/labpdfproc/tests/test_tools.py

src/diffpy/labpdfproc/tools.py

…rently failing

sbillinge

nice work. I could merge it like this, but there are a couple of final comments.

sbillinge · 2024-05-16T02:57:44Z

src/diffpy/labpdfproc/tools.py

@@ -48,6 +48,11 @@ def expand_list_file(args):
            file_inputs = [input_name.strip() for input_name in f.readlines()]
        args.input.extend(file_inputs)
        args.input.remove(file_list_input)
+    wildcard_inputs = [input_name for input_name in args.input if "*" in input_name]
+    for wildcard_input in wildcard_inputs:
+        input_files = [str(file) for file in Path(".").glob(wildcard_input) if "file_list" not in file.name]


I think we may be able to remove if file_list not in file.name because file_list files have been removed already.

I think this is for files in the glob directory (not in args.input), so if we have a file list in the same directory as the wildcard, then it'll be loaded if we don't skip it.

I see, yes better to have it to be on the safe side. It may be better to make it stricter so if file.name == file_list.txt but it is probably ok as it is.

src/diffpy/labpdfproc/tests/conftest.py

yucongalicechen added 2 commits May 14, 2024 21:17

initial commit on implementing a wildcard pattern for input

fc3a570

relaxed wildcard check condition

a02a085

sbillinge reviewed May 15, 2024

View reviewed changes

added pre-filter for wildcard patterns and tests, included more examp…

f0ffd0f

…les for help message

sbillinge reviewed May 15, 2024

View reviewed changes

src/diffpy/labpdfproc/labpdfprocapp.py Outdated Show resolved Hide resolved

src/diffpy/labpdfproc/tools.py Outdated Show resolved Hide resolved

added pre-filters to expand_user_input private function, removed unne…

0c19e3d

…cessary test cases

sbillinge reviewed May 15, 2024

View reviewed changes

yucongalicechen added 2 commits May 15, 2024 19:12

removed more unnecessary tests and added test for file-list file, cur…

0124e43

…rently failing

added test case of a wildcard in a file-list file

2fe6286

sbillinge reviewed May 16, 2024

View reviewed changes

edited wildcard in file-list file

bfa3d8e

sbillinge merged commit 5c3eed7 into diffpy:main May 16, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

implement wildcard pattern for input #59

implement wildcard pattern for input #59

Uh oh!

yucongalicechen commented May 15, 2024

Uh oh!

sbillinge left a comment

Uh oh!

sbillinge May 15, 2024

Uh oh!

yucongalicechen commented May 15, 2024

Uh oh!

sbillinge left a comment

Uh oh!

Uh oh!

Uh oh!

sbillinge commented May 15, 2024

Uh oh!

sbillinge left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sbillinge left a comment

Uh oh!

sbillinge May 16, 2024

Uh oh!

yucongalicechen May 16, 2024

Uh oh!

sbillinge May 16, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

implement wildcard pattern for input #59

implement wildcard pattern for input #59

Uh oh!

Conversation

yucongalicechen commented May 15, 2024

Uh oh!

sbillinge left a comment

Choose a reason for hiding this comment

Uh oh!

sbillinge May 15, 2024

Choose a reason for hiding this comment

Uh oh!

yucongalicechen commented May 15, 2024

Uh oh!

sbillinge left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sbillinge commented May 15, 2024

Uh oh!

sbillinge left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sbillinge left a comment

Choose a reason for hiding this comment

Uh oh!

sbillinge May 16, 2024

Choose a reason for hiding this comment

Uh oh!

yucongalicechen May 16, 2024

Choose a reason for hiding this comment

Uh oh!

sbillinge May 16, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!