The ML and Infrastructure Architecture Behind striff.io
A walkthrough of the async Kafka-staged pipeline, Triton-based inference serving, and degradation hierarchy that powers striff.io’s architectural review system. Covers why the pipeline moved from synchronous to event-driven, how three independent Kafka worker tiers decouple graph construction, GNN scoring, and LLM annotation, the distributed systems problems that Triton separation introduces, and the three-tier degradation strategy…
Read article ↗