Create a psake task to generate the dictionaries #2

abhi18av · 2022-07-18T11:42:16Z

I noticed that the https://github.com/Mxrcon/BioNameGenerator/blob/main/BioNameGenerator/Databases/Generate-Databases.ps1 is basically a small task to generate the sqlite database from the TSV files.

I was wondering whether it makes sense to

Add a task to download these TSV files into a folder: This way the sources would be self-documenting
Add another task to generate the database

Nothing urgent, but something nice to have as it'll guide the collaborators.

Mxrcon · 2022-07-18T12:54:16Z

I completely agree Abhinav, I was worried about the TSV's outside the repository, I don't want to distribute the tsv's file within the package, but for development propose I can add the on the root folder of the repository and them create a psake task as you mentioned.

Mxrcon · 2022-07-19T15:05:48Z

Hey H@abhi18av, The tsv sources are actually modified from different web sources, and some of them had inconsistencies that I had to solve manually, like names with u instead of ü. and cleanups on some unnecessary compound names like Iron-II vs Iron-III.

What do you think about adding TSV files on the root folder of this repository on this structure:

Dictionaries/
├── Generate-Databases.ps1
└── TsvDictionaries
    ├── 28kAdjectives.tsv
    ├── 5kColors.tsv
    ├── Aminoacids.tsv
    ├── Animals.tsv
    ├── BacterialGeneras.tsv
    ├── BacterialSpecies.tsv
    ├── BiologicalBooks.tsv
    ├── BrazilianScientists.tsv
    ├── ChemicalCompounds.tsv
    ├── Colors.tsv
    ├── ComputationKeywords.tsv
    ├── Dictionaries.db
    ├── FieldsWinners.tsv
    ├── LaboratoryKeywords.tsv
    ├── MetalsAndAlloys.tsv
    ├── NF-Adjectives.tsv
    ├── NF-Names.tsv
    ├── NobelLaureates.tsv
    ├── NucleicAcids.tsv
    ├── PeriodicTableElements.tsv
    └── RPGKeywords.tsv

And this Script would be called by the psake task in order to generate a database, and we would have 2 tasks:

Generate database (to use all TSVs and generate a sqlite database)
Update database (Generate a new database and update the Bionamegenerator/Databases/Dictionaries.db

Unfortunately I'm not sure that I'll be able to write a pwsh script able to completely reproduce the process on getting wikipedia pages and them formating them to the TSV's.

Kindly, Davi

Mxrcon self-assigned this Jul 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a psake task to generate the dictionaries #2

Create a psake task to generate the dictionaries #2

abhi18av commented Jul 18, 2022

Mxrcon commented Jul 18, 2022

Mxrcon commented Jul 19, 2022

Create a psake task to generate the dictionaries #2

Create a psake task to generate the dictionaries #2

Comments

abhi18av commented Jul 18, 2022

Mxrcon commented Jul 18, 2022

Mxrcon commented Jul 19, 2022