[Enhancement] Implement Concurrency to Optimize Processing Time for Large File Operations #33

pradhanhitesh · 2025-01-15T05:58:39Z

Hello, devs! I frequently use LST-AI to extract WMH for large-scale cohort studies, processing 500+ files every 3 months. Currently, LST-AI lacks built-in support for batch processing. I've developed a custom script using ProcessPoolExecutor to enable parallel CPU processing. It would be fantastic to integrate this feature into LST-AI. Thoughts?

Also, compile-stats feature can be added to LST-AI to create a single .csv file compiling all the segmented volumes, subject-wise.

The text was updated successfully, but these errors were encountered:

twiltgen · 2025-01-15T08:08:16Z

Hi @pradhanhitesh,

It’s great to hear that you’re using LST-AI frequently! 😊 You’re absolutely right—LST-AI was designed to work with individual images and doesn’t currently support batch processing. However, you can specify the number of threads to use for registration. If you’d like, feel free to create a new branch with the integrated batch processing update and the stats file.

Since we also handle large datasets, I created a repository for processing BIDS-structured databases (https://github.com/twiltgen/LST-AI_BIDS). It assumes a BIDS-compliant structure, so it won’t work if the database isn’t set up that way. If you’re interested, feel free to check it out—it might overlap with what you’ve done.

As for the segmented volumes, the latest LST-AI updates introduced some new features, including generation of image-wise CSV files with lesion statistics. It’s possible this overlaps with what you’ve compiled.

We really appreciate your contributions and would love to see your proposed updates for LST-AI. 🙌

pradhanhitesh · 2025-01-15T08:33:00Z

Thanks for the reply, @twiltgen! I noticed similar functionality in the LST-AI_BIDS repo. I'll create a branch and add my changes—let me know if this feature fits the current LST-AI repo. Also, I use a Python script to compile stats from subject folders into a single annotated WMH volumes dataframe.

twiltgen · 2025-01-15T08:40:15Z

Sounds great, we'll have a look at the changes when you've created the branch. In my LST-AI_BIDS repo there is also a script called "collect_volumes.py" which gathers the lesion data and generates a single csv file. If you want (and if it is easier for you), you can simply integrate your updates regarding the stats in there.

pradhanhitesh linked a pull request Jan 16, 2025 that will close this issue

Added concurrency using lst-parallel.py #34

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Enhancement] Implement Concurrency to Optimize Processing Time for Large File Operations #33

[Enhancement] Implement Concurrency to Optimize Processing Time for Large File Operations #33

pradhanhitesh commented Jan 15, 2025 •

edited

Loading

twiltgen commented Jan 15, 2025

Uh oh!

pradhanhitesh commented Jan 15, 2025

Uh oh!

twiltgen commented Jan 15, 2025

Uh oh!

[Enhancement] Implement Concurrency to Optimize Processing Time for Large File Operations #33

[Enhancement] Implement Concurrency to Optimize Processing Time for Large File Operations #33

Comments

pradhanhitesh commented Jan 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

twiltgen commented Jan 15, 2025

Uh oh!

pradhanhitesh commented Jan 15, 2025

Uh oh!

twiltgen commented Jan 15, 2025

Uh oh!

pradhanhitesh commented Jan 15, 2025 •

edited

Loading