Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[sample] filtering to consensus sample #159

Open
tijyojwad opened this issue Dec 14, 2020 · 0 comments
Open

[sample] filtering to consensus sample #159

tijyojwad opened this issue Dec 14, 2020 · 0 comments
Assignees
Labels

Comments

@tijyojwad
Copy link
Contributor

Right now the BAM based interface of the consensus pileup generator encodes all reads in the input files. But it's useful to limit encoding to certain regions.

E.g. there could be regions to avoid (e.g. regions with known variations or lots of error)
E.g. specific regions to be used for training and validation and test

Regions can be passed using a BED file (the parser already exists in VW).

So I think the interface to the tool can have the following options, and each will take a BED file as input. We only need to support this for the case where BAM is supported as input (not where directories are supported as input).

--train-regions
--val-regions
--test-regions
--ignore-regions

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants