Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

VikParuchuri / surya Public

Notifications You must be signed in to change notification settings
Fork 1.1k
Star 16.8k

Code
Issues 114
Pull requests 9
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: VikParuchuri/surya

Releases Tags

Releases · VikParuchuri/surya

Minor bugfixes

08 Oct 16:34

VikParuchuri

v0.6.1

986677b

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Minor bugfixes

Small bugfix after the table recognition release

Assets 2

All reactions

Table recognition model release!

08 Oct 16:11

VikParuchuri

v0.6.0

a87dede

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Table recognition model release!

Add a new table recognition model that detects rows/columns and cells
Add benchmarks for accuracy and speed (seems to be very accurate wrt to current state of the art open model)
Improve memory efficiency of layout and text detection (hopefully no more memory leaks)
Improve resolution handling for layout/text detection/ocr, which should improve accuracy quite a bit

Assets 2

cthulhu-tww, daboe01, akshayrakate085, and pramjana reacted with thumbs up emoji

All reactions

👍 4 reactions

4 people reacted

OCR v2

16 Aug 17:40

VikParuchuri

v0.5.0

8d5affa

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

OCR v2

A new version of the OCR model with a custom architecture.

20% faster
Automatic language detection, with support for optional language hints
Better accuracy on old/noisy documents
Basic english handwriting support (to be improved soon)

Assets 2

israelsaba, m7mdhka, 596050, lithium0003, driscoll42, moritzwilksch, marquaye, jamesfeigenbaum, kelechi-c, socratic-irony, and 11 more reacted with rocket emoji

All reactions

🚀 21 reactions

21 people reacted

Faster text detection + layout

12 Jul 16:06

VikParuchuri

v0.4.15

03b859e

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Faster text detection + layout

Switched model architecture for the text detection and layout models:

30% faster on GPU
4x faster on CPU
12x faster on MPS (M series macs)

Accuracy should be about the same, or slightly better, from my benchmarks.

Assets 2

socratic-irony, quythanh, styrowolf, xiaominghero, ZhengRui, VipinVIP, shividhar, tomcotter7, ksxkq, abclution, and harsha20032020 reacted with hooray emoji

ZhengRui, ashwanthkumar, shividhar, mateusnobre, shwu-nyunai, harsha20032020, and azizahtas reacted with heart emoji

All reactions

🎉 11 reactions
❤️ 7 reactions

15 people reacted

v0.4.14: Merge pull request #141 from VikParuchuri/dev

30 Jun 14:39

VikParuchuri

v0.4.14

f7c6c04

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.4.14: Merge pull request #141 from VikParuchuri/dev

New transformers version added a new kwarg to donut embeddings. This now handles and ignores that kwarg, and also slightly future-proofs in case this happens again.

Assets 2

kelechi-c and MassChargeRatio reacted with rocket emoji

All reactions

🚀 2 reactions

2 people reacted

Minor bugfixes

28 May 21:44

VikParuchuri

v0.4.12

c5f5e77

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Minor bugfixes

Fix rotation and copy bugs

Assets 2

All reactions

Fix image bugs

28 May 21:16

VikParuchuri

v0.4.11

53135d0

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Fix image bugs

Fix bugs with RGBA images
Fix assert bug
Add back in thumbnail method for resizing
Slightly optimize segformer code

Assets 2

All reactions

Change image resize

28 May 02:55

VikParuchuri

v0.4.10

d167369

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Change image resize

Image resize from cv2 to PIL - cv2 caused benchmark regressions

Assets 2

kelechi-c reacted with rocket emoji

All reactions

🚀 1 reaction

1 person reacted

OCR speedups

27 May 21:56

VikParuchuri

v0.4.9

31e36e7

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

OCR speedups

Speed up base OCR model ~15-20%, and reduce memory usage by ~25% (can do higher batch sizes)
Add static cache for compilation - torch.compile will result in another 15% speedup
Other optimizations, like faster image resizing
Bugfixes, like enabling different length language inputs for OCR (batching different docs with different languages together)

Assets 2

651961, kelechi-c, Josephrp, david-nikolai-mueller, and Gbillington1 reacted with heart emoji

All reactions

❤️ 5 reactions

5 people reacted

Processor improvements

23 May 23:12

VikParuchuri

v0.4.8

80889bd

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Processor improvements

Remove unneeded format conversions
Fix bug in OCR, where only one color channel was used for OCR - results should be better now
Speed up layout/text detection a bit

Assets 2

mesutde, kelechi-c, hopez13, Jamalianpour, and hyotaime reacted with thumbs up emoji

All reactions

👍 5 reactions

5 people reacted

Previous 1 2 3 4 5 6 Next

Previous Next

Footer

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.