1. https://appdevelopermagazine.com/big-data
  2. https://appdevelopermagazine.com/a-serverless-search-engine-for-data-scientists-and-developers-emerges/
3/22/2019 9:03:59 AM
A serverless search engine for Data Scientists and Developers emerges
Data Scientist,Rockset,Cloud Data,Analytics Engine
/Developers-Using-Complex-Data-Sets-App-Developer-Magazine_0khp0yx7.jpg
App Developer Magazine
A serverless search engine for Data Scientists and Developers emerges

Big Data

A serverless search engine for Data Scientists and Developers emerges


Friday, March 22, 2019

Richard Harris Richard Harris

Serverless search and analytics engines, such as Rockset can ingest data in all popular semi-structured formats and automatically turn them into fast SQL tables.

Rockset announced the general availability of a cloud service that allows developers and data scientists to put complex data sets to use in minutes instead of weeks.

Current trends in technology  -  smart devices, digital lending, fraud detection, omnichannel retail, machine learning, microservices, and real-time business dashboards - all require clean data from a multitude of sources. However, businesses are struggling with tons of high-value, low-quality data in fragmented systems like data lakes, NoSQL databases, and data streams. Increasingly, data comes from many different sources and tends to be machine data in JSON, CSV, XML and Parquet formats or business data like XLSX and PDF. Rockset empowers teams to serve data-intensive applications using a wide range of data formats from multiple sources, without having to model schemas or manage fragile extract-transform-load (ETL) pipelines.

“Modern data sets are messy and unstructured. Traditional data management systems force teams to spend more than 50 percent of their time just preparing and loading these data sets,” said Venkat Venkataramani, co-founder and CEO of Rockset. “What's the point of collecting new data if you cannot use it immediately and iterate quickly? Rockset has reimagined the stack in a new developer-friendly way, with fast SQL on raw data -- so you can go from data to applications in a few minutes.”

Rockset is a serverless data backend that continuously ingests raw data as it is generated and delivers real-time SQL queries at scale. Its patent-pending Converged Indexing, which combines an inverted index, columnar index, and document index, is optimized for key-value, time-series, document, search, aggregation and graph type queries out of the box. It is backed by RocksDB, which combined with its Cloud Native Distributed Query Engine, delivers the performance and scale necessary for real-time operational applications and interactive data science.

This release delivers new functionalities including:

  • Continuous schema-less ingest from data lakes, databases and streams including Apache Kafka, Amazon Kinesis, Amazon DynamoDB, Amazon S3, and Google Cloud Storage
  • Smart schemas for JSON, XML, CSV, Parquet, Excel, and PDF
  • SQL, including JOINs, with 1,000+ QPS
  • SDKs for Java, Python, Node.js, Golang and REST API
  • Delivered as-a-service with self-serve access

Developer and Data Scientist Communities Embrace Rockset

“We need to carefully monitor our growth in real-time. Is a certain product suddenly selling more? Is there a fraudulent transaction? We easily generate 20-30 million events per day, all captured in Kafka streams. Our applications query the data every few seconds. By sending our raw event data directly from Kafka to Rockset, we save a lot of time and energy. We track over 40 metrics in real time and constantly take immediate actions.” says Amboj Goyal, principal engineer at Fynd, a leader in online-to-offline retail with a fashion e-commerce portal that helps millions of customers find fashion from local stores.

“We receive data from many different sources in many different formats - each with different delimiters, headers and escape characters. It is time-consuming to load the data to multiple systems, individually infer the schema and handle malformed data. We also deal with constantly changing schemas as new versions of the data are received. Rockset has the potential to free our data scientists to run ad-hoc SQL queries on raw data while minimizing ETL work and maintenance.”  says Alex Izydorczyk, head of data science at Coatue, a technology sector hedge fund - known for its data prowess — that invests in public and private equity markets.






Subscribe to App Developer Magazine

Become a subscriber of App Developer Magazine for just $5.99 a month and take advantage of all these perks.

MEMBERS GET ACCESS TO

  • - Exclusive content from leaders in the industry
  • - Q&A articles from industry leaders
  • - Tips and tricks from the most successful developers weekly
  • - Monthly issues, including all 90+ back-issues since 2012
  • - Event discounts and early-bird signups
  • - Gain insight from top achievers in the app store
  • - Learn what tools to use, what SDK's to use, and more

    Subscribe here



Featured Stories


Tether QVAC SDK Powers AI Across Devices and Platforms
Tether QVAC SDK Powers AI Across Devices and Platforms Wednesday, April 22, 2026


APAC 5G expansion to fuel 347B mobile market by 2030
APAC 5G expansion to fuel 347B mobile market by 2030 Tuesday, April 21, 2026


How AI is causing app litter everywhere
How AI is causing app litter everywhere Tuesday, April 21, 2026




The App Economy Is Thriving
The App Economy Is Thriving Monday, April 20, 2026


NIKKE 3.5 anniversary update livestream coming soon
NIKKE 3.5 anniversary update livestream coming soon Friday, April 17, 2026


New AI tool targets early dementia detection
New AI tool targets early dementia detection Thursday, April 16, 2026


Jentic launch gives AI agents api access
Jentic launch gives AI agents api access Wednesday, April 15, 2026


Experts warn ai-generated health content risks misinterpretation without human oversight
Experts warn ai-generated health content risks misinterpretation without human oversight Wednesday, April 15, 2026


Ludo.ai Unveils API and MCP Beta to Power AI Game Asset Pipelines
Ludo.ai Unveils API and MCP Beta to Power AI Game Asset Pipelines Tuesday, April 14, 2026


AccuWeather Launches ChatGPT Integration for Live Weather Updates
AccuWeather Launches ChatGPT Integration for Live Weather Updates Tuesday, April 14, 2026


Stay Updated

Sign up for our newsletter for the headlines delivered to you

SuccessFull SignUp

Get More App News



/sites/themes/prod/assets/js/less.js"> ' ' %>