-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
creating function main #4
base: main
Are you sure you want to change the base?
Changes from all commits
36fce4d
2ffed0c
42dac41
dfa4910
b982382
6d8be90
6fb2879
07cd3ec
1be46c2
32bf269
e527a19
865251e
8f6b94e
a174475
8b26f4f
a024bb5
1331805
7d4c97f
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,64 @@ | ||
import numpy as np | ||
import argparse | ||
from pathlib import Path | ||
|
||
from diffpy.snmf.io import load_input_signals, initialize_variables | ||
from diffpy.snmf.subroutines import update_weights_matrix, get_residual_matrix, objective_function | ||
|
||
|
||
def create_parser(): | ||
parser = argparse.ArgumentParser( | ||
prog="stretched_nmf", | ||
description="Stretched Nonnegative Matrix Factorization" | ||
) | ||
|
||
parser.add_argument('-i', '--input-directory', type=str, default=None, | ||
help="Directory containing experimental data. Defaults to current working directory.") | ||
parser.add_argument('-o', '--output-directory', type=str, | ||
help="The directory where the results will be written. Defaults to '<input_directory>/snmf_results'.") | ||
parser.add_argument('-t', '--data-type', type=str, choices=['powder_diffraction', 'pdf'], | ||
help="The type of the experimental data.") | ||
parser.add_argument('-l', '--lift', type=float, default=1, | ||
help="The factor that determines how much the data is lifted. By default, the data will be vertically translated to make the minimum value 0.") | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. how about "The lifting factor. Data will be lifted by lifted data = data - min(data)*lift. Default is 1" btw, it occurs to me that if the min(data) is positive, the "lift" will "lower" the data to zero if lift = 1 and will lower it to below zero if lift > 1. This is probably undesirable behavior. Make sure you have tests for this eventuality that give the behavior you want. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I will make this changes. The same thought occurred to me. If the min(data) is positive, it's not apparent to me that the data would need to be lifted. My idea is that if the minimum is positive, then do nothing to the data. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. That is reasonable, though why not allow the user to lift the data if she wants to? Maybe make sure the lift is a lift, so it adds the absolute value of min(data)*lift to the data, something like that. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Ok. |
||
parser.add_argument('components', type=int, | ||
sbillinge marked this conversation as resolved.
Show resolved
Hide resolved
|
||
help="The number of component signals for the NMF decomposition. Must be an integer greater than 0.") | ||
parser.add_argument('-v', '--version', action='version', help='Print the software version number') | ||
args = parser.parse_args() | ||
return args | ||
|
||
|
||
def main(): | ||
args = create_parser() | ||
if args.input_directory is None: | ||
args.input_directory = Path.cwd() | ||
|
||
grid, data_input, data_type = load_input_signals(args.input_directory) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think it is maybe clearer to put the default logic here that turns input_directory from None to cwd. Or else, make it an optional argument in load_input_data. Again, it is about code readability. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I will do this. |
||
if args.data_type: | ||
data_type = args.data_type | ||
variables = initialize_variables(data_input, args.components, data_type) | ||
|
||
if variables["data_type"] == 'pdf': | ||
sbillinge marked this conversation as resolved.
Show resolved
Hide resolved
|
||
lifted_data = data_input - np.ndarray.min(data_input[:]) # Will later use the lift_data function in subroutines | ||
|
||
maxiter = 300 | ||
|
||
for out_iter in range(maxiter): | ||
variables["weights_matrix"] = update_weights_matrix(variables["number_of_components"], | ||
variables["signal_length"], | ||
variables["stretching_factor_matrix"], | ||
variables["component_matrix"], lifted_data, | ||
variables["number_of_moments"], variables["weights_matrix"], | ||
None) | ||
|
||
residual_matrix = get_residual_matrix(variables["component_matrix"], variables["weights_matrix"], | ||
variables["stretching_factor_matrix"], lifted_data, | ||
variables["number_of_moments"], variables["number_of_components"], | ||
variables["signal_length"]) | ||
fun = objective_function(residual_matrix, variables["stretching_factor_matrix"], variables["smoothness"], | ||
variables["smoothness_term"], variables["component_matrix"], variables["sparsity"]) | ||
|
||
return 1 | ||
|
||
|
||
if __name__ == "__main__": | ||
main() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think a comment line would help here with the intent. I.e., why do we ignore and not handle if the data are on a different grid. If I remember correctly t here was a physics reason (mathematically it should be straightforward). Just recording for future people.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I will add a comment line to explain this