Skip to content
/ segtok Public

Rust port of the famous python package. A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic features

Notifications You must be signed in to change notification settings

xamgore/segtok

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

segtok

A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic features. Ported from python package (unmaintained), fixes contractions bug.

About

Rust port of the famous python package. A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic features

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages