diff --git a/README.md b/README.md index ed6eb3e6..b3669179 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,9 @@ # Charabia Library used by Meilisearch to tokenize queries and documents +## Why Did We Choose This Name? +We choose the name of this repository because it starts with char which was done in the same mood as discord or meili, giving the name of the problem we want to solve. + ## Role The tokenizer’s role is to take a sentence or phrase and split it into smaller units of language, called tokens. It finds and retrieves all the words in a string based on the language’s particularities. @@ -72,4 +75,4 @@ let mut segments = orig.segment_str(); assert_eq!(segments.next(), Some("The")); assert_eq!(segments.next(), Some(" ")); assert_eq!(segments.next(), Some("quick")); -``` \ No newline at end of file +```