Explain data flow of self-hosted Sentry's architecture #3585

aldy505 · 2025-02-24T04:30:04Z

Problem Statement

We need a step by step description of what data goes where and last time I checked we kinda lacked that. It's hard to know which container you have to debug if something isn't ingesting, since we have dozens of Kafka queues and processing services, and I know some of the services fees data back from Kafka into Kafka.

And yes, there is a rough outline with a chart somewhere, but it's not really specific enough at this point

Solution Brainstorm

No response

aldy505 · 2025-03-31T08:30:36Z

Question: What do you think would best to explain this, since a diagram would probably result in duplicated info with https://develop.sentry.dev/application-architecture/overview/

Probably it's better to explain what each container does, instead of the event flow?

....or probably... just update this frequently? https://github.com/getsentry/event-ingestion-graph -- and make it more detailed, since I don't think Relay only publish to "ingest-events" topic.

BYK · 2025-03-31T10:34:44Z

@hubertdeng123, thoughts? (or just tag other people? Maybe @untitaker or @markstory)

markstory · 2025-03-31T15:48:03Z

I agree we could do a better job on documenting how the various consumers and tasks interact for ingestion. The event-ingestion-graph diagram/document might be a good place to add the additional context of which topics and consumer pods are involved.

hubertdeng123 · 2025-03-31T18:35:27Z

IMO, if we want to explain the data flow here of Sentry's architecture in a diagram we should do so in terms of groups of containers, otherwise the diagram could get pretty overwhelming. Ingest consumers, snuba consumers, post-process-forwarders, etc. Like Mark mentioned above, mapping of topic to consumers would be really useful context to add

aldy505 · 2025-04-01T01:00:55Z

The event-ingestion-graph diagram/document might be a good place to add the additional context of which topics and consumer pods are involved.

@markstory Would the better idea is to move the event-ingestion-graph into the sentry-docs (dev section) instead? That way, it'll get noticed relatively quick by the employee if they know some parts of the ingestion pipeline is changing.

IMO, if we want to explain the data flow here of Sentry's architecture in a diagram we should do so in terms of groups of containers, otherwise the diagram could get pretty overwhelming. Ingest consumers, snuba consumers, post-process-forwarders, etc.

@hubertdeng123 I agree with this one.

aldy505 · 2025-04-07T11:17:37Z

Saw @markstory's thumb of approval. I'll work on this sometime next week.

github-project-automation bot added this to Self-hosted Sentry Feb 24, 2025

aldy505 added the Category: Docs label Feb 24, 2025

aldy505 mentioned this issue Mar 3, 2025

docs(self-hosted): move configuration guide outside of index getsentry/sentry-docs#12891

Merged

3 tasks

aldy505 self-assigned this Mar 31, 2025

getsantry bot added the Waiting for: Product Owner label Mar 31, 2025

getsantry bot added this to GitHub Issues with 👀 3 Mar 31, 2025

getsantry bot moved this to Waiting for: Product Owner in GitHub Issues with 👀 3 Mar 31, 2025

getsantry bot removed the Waiting for: Product Owner label Mar 31, 2025

getsantry bot removed the status in GitHub Issues with 👀 3 Mar 31, 2025

aldy505 linked a pull request May 17, 2025 that will close this issue

docs(self-hosted): explain self-hosted data flow getsentry/sentry-docs#13745

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explain data flow of self-hosted Sentry's architecture #3585

Explain data flow of self-hosted Sentry's architecture #3585

aldy505 commented Feb 24, 2025

aldy505 commented Mar 31, 2025 •

edited

Loading

BYK commented Mar 31, 2025

markstory commented Mar 31, 2025

hubertdeng123 commented Mar 31, 2025

aldy505 commented Apr 1, 2025

aldy505 commented Apr 7, 2025

Explain data flow of self-hosted Sentry's architecture #3585

Explain data flow of self-hosted Sentry's architecture #3585

Comments

aldy505 commented Feb 24, 2025

Problem Statement

Solution Brainstorm

aldy505 commented Mar 31, 2025 • edited Loading

BYK commented Mar 31, 2025

markstory commented Mar 31, 2025

hubertdeng123 commented Mar 31, 2025

aldy505 commented Apr 1, 2025

aldy505 commented Apr 7, 2025

aldy505 commented Mar 31, 2025 •

edited

Loading