Skip to content

Word picture: Download results in a tabular format #200

@janiemi

Description

@janiemi

It would be nice to be able to download also the word picture results in a tabular format (CSV, TSV and/or XLSX), which might be more accessible to non-technical users than JSON. It would also be consistent with the download options for the KWIC, statistics and trend diagram results.

What would be a good format for the downloadable table? What about having the following columns:

  1. dependency relation
  2. head lemgram or word form
  3. head part of speech
  4. dependent lemgram or word form
  5. dependent part of speech
  6. LMI value
  7. absolute frequency
  8. source corpora

Another option might be to have searched and related lemgrams or word forms instead of head and dependent ones. In that case, the searched lemgram or word form would be the same for all rows in a single table, but I think it might better to have a uniform format for all rows, instead of having a separate first row containing only the searched lemgram or word form. However, having head and dependent would correspond more closely to the JSON format.

Each subtable in the word picture would correspond to a group of rows with the same dependency relation. The downloadable table could be sorted by the dependency relation and LMI value, so the order would be the same as in the word picture.

Source corpora could be a space- or comma-separated list of either corpus ids or corpus titles; which would be more useful? The source sentence id might not be very useful for a typical user.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions