-
-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Explain data flow of self-hosted Sentry's architecture #3585
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Question: What do you think would best to explain this, since a diagram would probably result in duplicated info with https://develop.sentry.dev/application-architecture/overview/ Probably it's better to explain what each container does, instead of the event flow? ....or probably... just update this frequently? https://github.com/getsentry/event-ingestion-graph -- and make it more detailed, since I don't think Relay only publish to "ingest-events" topic. |
@hubertdeng123, thoughts? (or just tag other people? Maybe @untitaker or @markstory) |
I agree we could do a better job on documenting how the various consumers and tasks interact for ingestion. The event-ingestion-graph diagram/document might be a good place to add the additional context of which topics and consumer pods are involved. |
IMO, if we want to explain the data flow here of Sentry's architecture in a diagram we should do so in terms of groups of containers, otherwise the diagram could get pretty overwhelming. Ingest consumers, snuba consumers, post-process-forwarders, etc. Like Mark mentioned above, mapping of topic to consumers would be really useful context to add |
@markstory Would the better idea is to move the event-ingestion-graph into the sentry-docs (dev section) instead? That way, it'll get noticed relatively quick by the employee if they know some parts of the ingestion pipeline is changing.
@hubertdeng123 I agree with this one. |
Saw @markstory's thumb of approval. I'll work on this sometime next week. |
Problem Statement
From @kanadaj on Discord:
We need a step by step description of what data goes where and last time I checked we kinda lacked that. It's hard to know which container you have to debug if something isn't ingesting, since we have dozens of Kafka queues and processing services, and I know some of the services fees data back from Kafka into Kafka.
And yes, there is a rough outline with a chart somewhere, but it's not really specific enough at this point
Solution Brainstorm
No response
The text was updated successfully, but these errors were encountered: