Skip to content
This repository has been archived by the owner on Nov 22, 2024. It is now read-only.

EMA NLP #26

Open
wants to merge 21 commits into
base: main
Choose a base branch
from
Open

EMA NLP #26

wants to merge 21 commits into from

Conversation

Firverior02
Copy link
Collaborator

This branch includes the full process of natural language processing (NLP).

  • A selection of leaflets have been collected
  • The leaflets have been retrieved in PDF format
  • The Leaflets have been pared and converted to TXT
  • All documents provided in both English and Swedish have been used
  • The documents have been segmented and validated to make sure SV-EN pairs are formed (not perfect)
  • The sentences have been used to further train the Helsinki translation model

@Firverior02 Firverior02 added the enhancement New feature or request label Apr 3, 2024
@Firverior02
Copy link
Collaborator Author

There are still many bugs, but for the time being it should be good enough

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
Status: Review
Development

Successfully merging this pull request may close these issues.

2 participants