|
2 | 2 |
|
3 | 3 | This repository holds a collection of Jython ETL (extract-transform-load) scripts that are used at QBiC that define the behaviour of openBIS dropboxes. |
4 | 4 | The ETL processes combine some quality control measures for incoming data and data transformation to facilitate the registration in openBIS. |
| 5 | + |
| 6 | +## Data format guidelines |
| 7 | + |
| 8 | +These guidelines describe the necessary file structure for different |
| 9 | +data types to be met in order to ingest and register them correctly in |
| 10 | +openBIS. |
| 11 | + |
| 12 | +Formats: |
| 13 | + |
| 14 | +- [NGS single-end / paired-end data](#ngs-single-end--paired-end-data) |
| 15 | + |
| 16 | +### NGS single-end / paired-end data |
| 17 | + |
| 18 | +**Responsible dropbox:** |
| 19 | +[QBiC-register-fastq-dropbox](drop-boxes/register-fastq-dropbox) |
| 20 | + |
| 21 | +**Resulting data model in openBIS** |
| 22 | +Q_TEST_SAMPLE -> Q_NGS_RAW_DATA (with sample code) -> DataSet (directory |
| 23 | +with files contained) |
| 24 | + |
| 25 | +**Description** |
| 26 | +For paired-end sequencing reads in FASTQ format, the file structure |
| 27 | +needs to look like this |
| 28 | + |
| 29 | +``` |
| 30 | +<QBIC sample code>.fastq // Directory |
| 31 | + |-- <QBIC sample code>_R1.fastq |
| 32 | + |-- <QBIC sample code>_R1.fastq.sha256sum |
| 33 | + |-- <QBIC sample code>_R2.fastq |
| 34 | + |-- <QBIC sample code>_R2.fastq.sha256sum |
| 35 | +``` |
| 36 | + |
| 37 | +or in the case of gzipped FASTQ files: |
| 38 | + |
| 39 | +``` |
| 40 | +<QBIC sample code>.fastq.gz // Directory |
| 41 | + |-- <QBIC sample code>_R1.fastq.gz |
| 42 | + |-- <QBIC sample code>_R1.fastq.gz.sha256sum |
| 43 | + |-- <QBIC sample code>_R2.fastq.gz |
| 44 | + |-- <QBIC sample code>_R2.fastq.gz.sha256sum |
| 45 | +``` |
| 46 | + |
| 47 | +In the case of single-end sequencing data, the file structure needs to |
| 48 | +look like this: |
| 49 | + |
| 50 | +``` |
| 51 | +<QBIC sample code>.fastq.gz // Directory |
| 52 | + |-- <QBIC sample code>.fastq.gz |
| 53 | + |-- <QBIC sample code>.fastq.gz.sha256sum |
| 54 | +``` |
| 55 | + |
| 56 | + |
0 commit comments