There are numerous companies of various sizes in production with Druid. This list is incomplete and we hope to add everyone's use cases in the near future.
At Alibaba Search Group, we use Druid for real-time analytics of users' interaction with its popular e-commerce site.
Druid is the major player in the real-time analytics pipeline at Appsflyer, and it serves as customer facing analytics database for dashboard reporting.
Druid is used to power dynamic analytics and charting in Archive-It Reports. Reports help Archive-It partners understand what content they captured, why some content did not get captured, and assists with quality assurance and crawl scoping to ensure they are building the highest quality collections.
Cisco uses Druid to power a real-time analytics platform for network flow data.
Criteo is using druid to provide its customers with user-centric analytics & reporting. With more than 1 billion unique users reached per month, 3 billion ads displayed per day, and a 70% growth in 2014, Criteo's previous stack was hard pressed keeping with the load. Replacing it with druid helped us achieved linear scalability while letting our customers explore their data in new and interactive ways.
Didi Chuxing is the world’s largest mobile transportation platform, offering a full range of commuting options to 400 cities in China. Didi uses Druid as a core component of our real-time bigdata processing pipeline. Druid powers Didi's real-time monitor system which has hundreds of key business metrics. We are deep impressed by Druid's fast aggregations and sub-second OLAP queries. With Druid, we can get insights from data in real-time.
DripStat uses Druid as a timeseries database to power the DripStat GUI. DripStat ingests transactional data for Java, Scala, and Groovy applications into Druid.
Easemob Technologies, Inc. is the leading provider of open IM platform, cloud computing customer services. We enable PaaS services for Instant Messaging and SaaS services for Mobile Customer Service.
eBay uses Druid to aggregate multiple data streams for real-time user behavior analytics by ingesting up at a very high rate(over 100,000 events/sec), with the ability to query or aggregate data by any random combination of dimensions, and support over 100 concurrent queries without impacting ingest rate and query latencies.
Druid powers a dashboard used internally to visualize real-time analytics on GumGum's Real Time Bidding platform. GumGum runs Druid on Amazon EC2 and can ingest up to 300,000 events per second at peak time.
At Hulu, we use Druid to power our analytics platform that enables us to interactively deep dive into the behaviors of our users and applications in real-time.
Imply uses Druid to power public demos and to power our internal monitoring and metrics.
Italiaonline exploits Druid for Internet trends and analytics management inside its new Data-Driven Contents Management System. Italiaonline is the first Italian internet company, with the two most visited web portals, Libero and Virgilio, and the most used email service of the country @libero.it. Italiaonline features 16.8 million unique users per month*, 4.8 billion impressions per month**, 10.2 million active email accounts** and a 58% active reach*.
* Source: Audiweb View, powered by Nielsen, TDA 2H 2015; ** Internal data, December 2015
Jolata leverages Druid as the analytics data store for the realtime network perfomance managment platform. Injesting over 35 billion events per day, Jolata calculates a billion metrics every minute to visualize precise network metrics in real-time, and enable operators to quickly drill down and perform root cause analysis.
LDMobile is a mobile DSP for the RTB. We use Druid to aggregate some metrics in order to propose to our customers a real-time dashboard showing performance indicators of their campaigns.
LifeBuzz is a popular web property that serves tens of millions of pageviews per day. We use Druid for all our advanced analytics needs, including in-house analysis and professional realtime analytics for our sponsored media partners.
LiquidM uses Druid for real-time drill-down reporting. LiquidM is also contributing back to the community by creating and maintaining a ruby client library for interacting with Druid located at http://github.com/liquidm/ruby-druid.
Druid is the primary data store for Metamarkets’ full stack visual analytics service for the RTB (real time bidding) space. Ingesting over 30 billion events per day, Metamarkets is able to provide insight to its customers using complex ad-hoc queries at a 95th percentile query time of around 1 second.
Druid is a critical component in Monetate's personalization platform, where it acts as the serving layer of a lambda architecture. As such, Druid powers numerous real-time dashboards that provide marketers valuable insights into campaign performance and customer behavior.
N3TWORK uses Druid for real-time analysis of its Internet of Interests social entertainment network. It uses Druid analytics both to optimize user experiences and to guide the evolution of its product.
Netflix engineers use Druid to aggregate multiple data streams, ingesting up to two terabytes per hour, with the ability to query data as its being ingested. They use Druid to pinpoint anomalies within their infrastructure, endpoint activity and content flow.
OneAPM http://oneapm.com is an IT service compmay focusing on Application Performance Management (APM). In OneAPM, Druid is used to power clients' interactive queries on performance data collected from their applications in realtime.
Optimizely uses Druid to power the results dashboard for Optimizely Personalization. Druid enables Optimizely to provide our customers with in-depth, customizable metrics in real time, allowing them to monitor, test and improve their Personalization campaigns with greater ease and flexibility than before.
The Druid production deployment at PayPal processes a very large volume of data and is used for internal exploratory analytics by business analytic teams. Here is what they have to say:
Around early Feb, 2014, the Paypal Tracking Platform team, lead by Suresh Kumar, stumbled upon an article talking about a new upcoming kid in Real Time Analytics world. After first glance it seemed just like any other new cool looking technology. But after reading little deeper into the papers(they had referred) and few blogs, it was clear it is different. The fundamental approach to query the data itself looked very different and refreshing.
Coincidently, at the same time, the team was struggling to create a very high volume real-time data query system. We had already explored Drill, Hive, Cassandra, TSDB, Shark etc. Dating back at least a year, none of these technologies were fulfilling our low latency needs for very high volumes of data.
So, as an option we started the Druid prototype and within couple of weeks it was looking like a very promising alternate. Very soon with great help from Core Druid development team our prototype was doing great.
We then started the prototype with large 7-10 billion records and see the response time for query. It was quite amazing.
Today our Druid implementation in PayPal processes a very large volume of Data and is used for our internal exploratory analytics by business analytic teams.
The thing we liked the most was amazing support provided by core Druid team. I have never seen a Open Source Community providing such a very high level of responsiveness for ANY issue related to Druid setup and tuning.
PubNative uses Druid for its real-time reports and analysis of millions of daily ad views, clicks, conversions and other events.
redBorder is an open source, scale out, cybersecurity analytics platform based on Druid. We hope its full-blown web interface, dashboard and report systems, and ready-to-use real-time pipeline foster other Druid users to create a strong community around it. To see more, please visit redborder.org
Weibo UVE(Unified Value Evaluation) team of Advertising Platform is using Druid as the realtime analysis tool of the data insight system, which processing billions events everyday.
SK Telecom is the leading telecommunication and platform solution company. Druid enable us to discover the business insight interactively from telecommunication, manufacturing big data.
Skyport Systems provides zero-effort, low-touch secure servers that help organizations to rapidly deploy and compartmentalize security-critical workloads. We use Druid as part of our analytics backend to provide real-time insight to our customers about their workload behavior.
Smyte provides an API and UI for detecting and blocking bad actors on the internet. Druid powers the analytics portion of our user interface providing insight into what users are doing on the website, and specifically which features are unique between different sets of users.
Streamlyzer uses Druid as a next generation online video analytics for online video companies or publishers. Streamlyzer is gathering information from real end-users of our customers and provides visualized real-time analytics in dashboard showing how video contents are delivered and how end-users are experiencing the streaming service.
TWC uses Druid for exploratory analytics.
TripleLift uses Druid to provide insights into performance aspects of its native programmatic exchange for sales/business development opportunities, and to provide reporting used by advertisers and publishers.
At VideoAmp, Druid is a key component of our Big Data stack. It powers our real-time video advertising analytics at low granularity and huge scale. Druid has helped us minimized the time between event, insight and action.
Vigiglobe turns the noise of Social Media into real-time Smart Content. To this end, Druid enables us to maintain high request throughput coupled with huge data absorption capacity.
ViralGains uses Druid for real-time analysis of millions of viral video views, shares, and conversations.
Xiaomi uses Druid as an analytics tool to analyze online advertising data.
Yahoo uses Druid to power various customer-facing audience and advertising analytics products.
YeahMobi uses Druid to power a dashboard used for ad-tech analytics such as impression and conversion tracking, unique IP statistics, and aggregating metrics such as costs and revenues.
Youku Tudou employs Druid for real-time advertising analysis of huge volumes of data.