September 4, 2023
Compute-Compute Separation
↯ One big idea to spark your curiosity
In first-generation database architectures, including Elasticsearch and Druid, clusters contain the compute and memory for both streaming ingestion and queries, causing compute contention. Elasticsearch attempted to address compute contention by creating dedicated ingest nodes to transform and enrich the document, but this happens before indexing, which still occurs on data nodes alongside queries. Indexing and compaction are compute-intensive, and putting those workloads on every data node negatively impacts query performance.
In contrast, Rockset enables multiple virtual instances for compute isolation. Rockset places compute-intensive ingest operations, including indexing and handling updates, on the streaming ingest virtual instance and then uses a RocksDB CDC log to send the updates, inserts, and deletes to query virtual instances. As a result, Rockset is now the only real-time analytics database to isolate streaming ingest from query compute without needing to create replicas.
A Breakthrough Architecture for Real-Time Analytics — An Overview of Compute-Compute Separation in Rockset, Rockset
Funding
↯ Developer-friendly companies that announced new funding rounds
Agave, provider of a unified API to access data across all the construction software platforms like Procore, BIM 360, and QuickBooks, raised $3m in Seed funding.
Qualiti, a platform for automatically creating and maintaining application test coverage using generative AI, raised $6.5m in Seed funding.
Ivy, an open banking platform building APIs for instant bank payments, raised $20m in Series A funding.
Rockset, a real-time analytics database which enables queries on massive, semi-structured data, raised $44m in funding.
AI121 Labs, creator of AI21 Studio that provides API access to Jurassic-2 and Task-Specific language models, raised $155m in Series C funding.