Skip to content

feat(search): harden ingestion cleanup and chunking#20

Merged
lurkshark merged 1 commit intomainfrom
lowertown
Mar 6, 2026
Merged

feat(search): harden ingestion cleanup and chunking#20
lurkshark merged 1 commit intomainfrom
lowertown

Conversation

@lurkshark
Copy link
Owner

Add a shared cleanup pipeline for URL and feed ingestion, preserve outbound links from cleaned source content, and enforce hard-bounded chunk emission with fallback splitting.\n\nAlso adds coverage for noisy content handling and archives the completed OpenSpec change.

Add a shared cleanup pipeline for URL and feed ingestion, preserve outbound links from cleaned source content, and enforce hard-bounded chunk emission with fallback splitting.\n\nAlso adds coverage for noisy content handling and archives the completed OpenSpec change.
@lurkshark lurkshark merged commit 4b727a9 into main Mar 6, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant