List view
A third, coordinated release of Bio2RDF datasets and endpoints will focus on - large datasets that were not included in release 2 (genbank, refseq, pdb, pubmed) - incorporation of the Linking Open Drug Data (LODD) datasets - see http://www.w3.org/wiki/HCLSIG/LODD/Data - finalization and use of the Life Science Registry - https://docs.google.com/spreadsheet/ccc?key=0AmzqhEUDpIPvdFR0UFhDUTZJdnNYdnJwdHdvNVlJR1E#gid=0 - explicit provenance and support for other SPARQL endpoints (e.g. uniprot SPARQL endpoint) - add dataset versions where possible - see status of individual scripts here: https://docs.google.com/spreadsheet/ccc?key=0AmzqhEUDpIPvdEpiZEJ2cHpGT1YyaERNOG81ZXJRRmc&usp=sharing The release will be available as: gzipped n-triple files and virtuoso databases - http://download.bio2rdf.org
No due date•75/75 issues closedA second, coordinated release of Bio2RDF datasets and endpoints will focus on data that can be regenerated. This means that datasets that cannot be regenerated because the data is no longer available or the scripts have not been updated will not be included in the official release, but will remain available as an independent endpoint. In other cases, datasets are available from a centralized source, and hence individual endpoints will be deprecated. a list of datasets for the release is available here: https://docs.google.com/spreadsheet/ccc?key=0AnGgKfZdJasrdElfQzRWWWhKUFR0UnRpeG14NGZRS2c#gid=4 The release will include: 1. downloads: gzipped n-triple files and virtuoso databases - http://download.bio2rdf.org 2. access: faceted and iri ranked virtuoso endpoints - listed from http://bio2rdf.org 3. provenance and licensing: each data item will be linked to a bio2rdf dataset which will be related to the source data. datasets will be linked to their license terms where available.
Due by December 1, 2012•1/1 issues closed