s o l u t i o n s g u i d e
Splunk® for Big Data
Turn massive machine data into real-time operational intelligence
Making Machine-generated Data Accessible,
Usable and Valuable to Everyone
Splunk Enterprise is the leading platform for collecting,
analyzing and visualizing machine data. It provides a unified way
to organize and extract real-time insights from massive amounts
of machine data from virtually any source. This includes data
from websites, business applications, social media platforms,
app servers, hypervisors, sensors, traditional databases and
open source data stores.
Once your data is in Splunk, you can search, monitor, report and
analyze it, no matter how unstructured, large or diverse it may
be. Splunk software gives you real-time understanding of what’s
happening and deep analysis of what’s happened, driving new
levels of visibility and insight. This is called operational intelligence.
Enterprise-scale big data. Splunk software scales to collect and
index tens of terabytes of data per day, across multi-geography,
multi-datacenter and hybrid cloud infrastructures. Because the
insights from your data are mission-critical, Splunk provides
the resilience you need, even as you scale out your low-cost,
distributed computing environment.
Robust platform for developing big data apps. Developer
teams will find a whole host of ways to leverage Splunk and
maximize enterprise technology investments. Built-in SDKs for
JavaScript and JSON with additional downloadable SDKs for
Java, Python, PHP, C# and Ruby make it easy to customize and
extend the power of Splunk.
Powerful connectivity. Most organizations maintain a diverse
set of data stores – machine data, relational data and other
unstructured data. Splunk DB Connect delivers real-time
connectivity to one or many relational databases and Splunk
Hadoop Connect delivers bi-directional connectivity to Hadoop.
Both Splunk apps enable you to drive more meaningful insights
from all of your data.
Real-time monitoring of the entire Hadoop stack. The Splunk App
for HadoopOps provides real-time monitoring and analysis of the
health and performance of the end-to-end Hadoop environment,
encompassing all layers of the supporting infrastructure.
Proven results. Splunk Enterprise is proven at over 5,600
enterprise customers. These organizations are using
 Splunk to
improve service levels, reduce operations costs, mitigate security
risks, enable compliance, enhance DevOps collaboration and create
new product and service offerings. Splunk customers typically
achieve a return on investment (ROI) measured in weeks or months,
sometimes even before being deployed into production.
Big Data Comes from Machines
All your IT applications, systems and technology infrastructure
generate data every millisecond of every day. This machine data
is one of the fastest growing, most complex areas of big data. It’s
also one of the most valuable, containing a definitive record of user
transactions, customer behavior, sensor activity, machine behavior,
security threats, fraudulent activity and more.
Machine data holds critical insights useful across the enterprise.
Here are a few examples:
•	 Monitor end-to-end transactions for online businesses
providing 24x7 operations
•	 Understand customer experience, behavior and usage of
services in real time
•	 Fulfill internal SLAs and monitor service provider agreements
•	 Identify spot trends and sentiment analysis on social
platforms
•	 Map and visualize threat scenario behavior patterns to
improve security posture 

Making use of machine data is challenging. It’s difficult to
process and analyze by traditional data management methods
or in a timely manner.
•	 Machine data is generated by a multitude of disparate sources;
correlating meaningful events across these is complex
•	 The data is unstructured and difficult to fit into a pre-
defined schema
•	 Machine data is high-volume and time-series based,
requiring new approaches for management and analysis
•	 The most valuable insights from this data are often needed
in real time 

Traditional business intelligence, data warehouse or IT analytics
solutions are simply not engineered for this class of high-
volume, dynamic and unstructured data. Emerging open source
technologies
 can provide part of the answer, but require
expensive, highly-trained developers who possess specialized
skill sets. When requirements change, these brittle solutions
typically lack the agility to quickly respond. 
Today’s enterprises
can’t wait. Key stakeholders across the organization need to
keep pace and adapt quickly to rapidly changing business
environments. They need a technology that supports real-time
analysis, data mining and ad hoc reporting—a solution that can
give them answers as fast as they think of questions.
S o l u t i o n s G u i d e
www.splunk.com
250 Brannan St, San Francisco, CA, 94107 info@splunk.com | sales@splunk.com 866-438-7758 | 415-848-8400 www.splunkbase.com
Copyright © 2013 Splunk Inc. All rights reserved. Splunk Enterprise is protected by U.S. and international copyright and intellectual property laws. Splunk is a registered trademark
or trademark of Splunk Inc. in the United States and/or other jurisdictions. All other marks and names mentioned herein may be trademarks of their respective companies. Item # SG-Splunk-BigData-108
What Makes Splunk Unique
Splunk Enterprise is an integrated, end-to-end, real-time solution
for machine data delivering the following core capabilities:
•	 Universal collection and indexing of machine data, from
virtually any source
•	 Powerful search processing language to search and
analyze real-time and historical data
•	 Real-time monitoring for patterns and thresholds; real-time
alerts when specific conditions arise
•	 Powerful reporting and analysis
•	 Custom dashboards and views for different roles
•	 Resilience and scale on commodity hardware
•	 Granular role-based security and access controls
•	 Support for multi-tenancy and flexible, distributed
deployments
•	 Connectivity with other data stores includes scalable,
real-time integration with relational databases and bi-
directional connectivity with Hadoop
•	 Robust, flexible platform for big data apps
Deploying Hadoop?
Hunk: Splunk Analytics on Hadoop, is a new software product
to explore, analyze and visualize data in Hadoop. Building upon
Splunk’s years of experience with big data analytics technology
deployed at thousands of customers, Hunk drives dramatic
improvements in the speed and simplicity of interacting with
and analyzing data in Hadoop without programming, costly
integrations or forced data migrations. Hunk is currently in beta.
Find out more at www.splunk.com/bigdata.
Customer Success with Splunk
5,600+ licensed customers are the best examples of machine-
generated big data in action.
Salesforce.com®
Salesforce.com, the industry-leading enterprise cloud computing
company, uses Splunk software to mine large quantities of data
generated from across their entire technology stack. 
Salesforce.
com has over 500 users of Splunk dashboards from IT users
monitoring customer experience to product managers
performing analytics on new services like ‘Chatter.’ 

“The fact that we had a data treasure chest was not obvious
 until
Splunk came in to the picture. Splunk has augmented our ability to
make data-driven decisions.”
Director Product Management, Salesforce.com
NPR®
NPR, the award winning, multimedia news organization reaching
26.8 million listeners per week, uses Splunk software to gain
better visibility and insight of their digital asset infrastructure.
NPR initially used Splunk to monitor and troubleshoot their end-
to-end asset delivery infrastructure. Before Splunk, there were
critical business metrics they couldn’t get from their traditional
web analytics solutions. They expanded their deployment
of Splunk and now measure program popularity, views by
device, reconcile royalty payments for digital rights, measure
abandonment rates and more.
“Only Splunk easily gives us the business reports about our web-
based digital assets that we need.”

Online Metrics Analyst, NPR
Cricket Communications
As a prepay telecommunications provider, Cricket
Communications is driven by web and point of sale channels. At
the heart of this is a homegrown CRM application, coupled with
third-party applications for middleware, billing and rating and
PoS (Point of Sale) support. Using Splunk to harness terabytes

of machine data generated by these systems and infrastructure,
the operations team calculates that Splunk helped them reduced
outage frequency by about 15%, translating into an annual ROI

of $1,200,000. In addition, analytics from this machine data has
enabled Cricket to provide executives real-time sales dashboards
that deliver sales by store, product type, device, rate plans,
z
ip code, etc. Machine data is being used to enrich data from
structured sources within the data management infrastructure.
Online Travel Company
One of the world’s leading online travel companies initially
used Splunk software to avoid website outages, saving
millions of dollars in lost revenue. They quickly expanded
their use of Splunk and within 10 months were monitoring
98% of their infrastructure. Today, over 2,700 users at this
organization 
use Splunk to gain real-time insights of not only
their IT infrastructure, but also online bookings, performance of
air- travel coupons and optimizing SEM.
“We achieved real-time visibility and insights across a wide range 
of
critical areas from server and application health and performance
monitoring to bookings trends, coupon use and deal analysis with
Splunk. We gained the ability to perform rapid, real-time analysis on
tens of terabytes of unstructured, time-sensitive machine data.”

Sr. Director Infrastructure Architecture
Free Download
Download Splunk for free. You’ll get a Splunk Enterprise
license for 60 days and you can index up to 500 megabytes
of data per day. After 60 days, or anytime before then,
you can convert to a perpetual Free license or purchase an
Enterprise license by contacting sales@splunk.com.

Splunk for big_data

  • 1.
    s o lu t i o n s g u i d e Splunk® for Big Data Turn massive machine data into real-time operational intelligence Making Machine-generated Data Accessible, Usable and Valuable to Everyone Splunk Enterprise is the leading platform for collecting, analyzing and visualizing machine data. It provides a unified way to organize and extract real-time insights from massive amounts of machine data from virtually any source. This includes data from websites, business applications, social media platforms, app servers, hypervisors, sensors, traditional databases and open source data stores. Once your data is in Splunk, you can search, monitor, report and analyze it, no matter how unstructured, large or diverse it may be. Splunk software gives you real-time understanding of what’s happening and deep analysis of what’s happened, driving new levels of visibility and insight. This is called operational intelligence. Enterprise-scale big data. Splunk software scales to collect and index tens of terabytes of data per day, across multi-geography, multi-datacenter and hybrid cloud infrastructures. Because the insights from your data are mission-critical, Splunk provides the resilience you need, even as you scale out your low-cost, distributed computing environment. Robust platform for developing big data apps. Developer teams will find a whole host of ways to leverage Splunk and maximize enterprise technology investments. Built-in SDKs for JavaScript and JSON with additional downloadable SDKs for Java, Python, PHP, C# and Ruby make it easy to customize and extend the power of Splunk. Powerful connectivity. Most organizations maintain a diverse set of data stores – machine data, relational data and other unstructured data. Splunk DB Connect delivers real-time connectivity to one or many relational databases and Splunk Hadoop Connect delivers bi-directional connectivity to Hadoop. Both Splunk apps enable you to drive more meaningful insights from all of your data. Real-time monitoring of the entire Hadoop stack. The Splunk App for HadoopOps provides real-time monitoring and analysis of the health and performance of the end-to-end Hadoop environment, encompassing all layers of the supporting infrastructure. Proven results. Splunk Enterprise is proven at over 5,600 enterprise customers. These organizations are using
 Splunk to improve service levels, reduce operations costs, mitigate security risks, enable compliance, enhance DevOps collaboration and create new product and service offerings. Splunk customers typically achieve a return on investment (ROI) measured in weeks or months, sometimes even before being deployed into production. Big Data Comes from Machines All your IT applications, systems and technology infrastructure generate data every millisecond of every day. This machine data is one of the fastest growing, most complex areas of big data. It’s also one of the most valuable, containing a definitive record of user transactions, customer behavior, sensor activity, machine behavior, security threats, fraudulent activity and more. Machine data holds critical insights useful across the enterprise. Here are a few examples: • Monitor end-to-end transactions for online businesses providing 24x7 operations • Understand customer experience, behavior and usage of services in real time • Fulfill internal SLAs and monitor service provider agreements • Identify spot trends and sentiment analysis on social platforms • Map and visualize threat scenario behavior patterns to improve security posture 
 Making use of machine data is challenging. It’s difficult to process and analyze by traditional data management methods or in a timely manner. • Machine data is generated by a multitude of disparate sources; correlating meaningful events across these is complex • The data is unstructured and difficult to fit into a pre- defined schema • Machine data is high-volume and time-series based, requiring new approaches for management and analysis • The most valuable insights from this data are often needed in real time 
 Traditional business intelligence, data warehouse or IT analytics solutions are simply not engineered for this class of high- volume, dynamic and unstructured data. Emerging open source technologies
 can provide part of the answer, but require expensive, highly-trained developers who possess specialized skill sets. When requirements change, these brittle solutions typically lack the agility to quickly respond. 
Today’s enterprises can’t wait. Key stakeholders across the organization need to keep pace and adapt quickly to rapidly changing business environments. They need a technology that supports real-time analysis, data mining and ad hoc reporting—a solution that can give them answers as fast as they think of questions.
  • 2.
    S o lu t i o n s G u i d e www.splunk.com 250 Brannan St, San Francisco, CA, 94107 [email protected] | [email protected] 866-438-7758 | 415-848-8400 www.splunkbase.com Copyright © 2013 Splunk Inc. All rights reserved. Splunk Enterprise is protected by U.S. and international copyright and intellectual property laws. Splunk is a registered trademark or trademark of Splunk Inc. in the United States and/or other jurisdictions. All other marks and names mentioned herein may be trademarks of their respective companies. Item # SG-Splunk-BigData-108 What Makes Splunk Unique Splunk Enterprise is an integrated, end-to-end, real-time solution for machine data delivering the following core capabilities: • Universal collection and indexing of machine data, from virtually any source • Powerful search processing language to search and analyze real-time and historical data • Real-time monitoring for patterns and thresholds; real-time alerts when specific conditions arise • Powerful reporting and analysis • Custom dashboards and views for different roles • Resilience and scale on commodity hardware • Granular role-based security and access controls • Support for multi-tenancy and flexible, distributed deployments • Connectivity with other data stores includes scalable, real-time integration with relational databases and bi- directional connectivity with Hadoop • Robust, flexible platform for big data apps Deploying Hadoop? Hunk: Splunk Analytics on Hadoop, is a new software product to explore, analyze and visualize data in Hadoop. Building upon Splunk’s years of experience with big data analytics technology deployed at thousands of customers, Hunk drives dramatic improvements in the speed and simplicity of interacting with and analyzing data in Hadoop without programming, costly integrations or forced data migrations. Hunk is currently in beta. Find out more at www.splunk.com/bigdata. Customer Success with Splunk 5,600+ licensed customers are the best examples of machine- generated big data in action. Salesforce.com® Salesforce.com, the industry-leading enterprise cloud computing company, uses Splunk software to mine large quantities of data generated from across their entire technology stack. 
Salesforce. com has over 500 users of Splunk dashboards from IT users monitoring customer experience to product managers performing analytics on new services like ‘Chatter.’ 
 “The fact that we had a data treasure chest was not obvious
 until Splunk came in to the picture. Splunk has augmented our ability to make data-driven decisions.” Director Product Management, Salesforce.com NPR® NPR, the award winning, multimedia news organization reaching 26.8 million listeners per week, uses Splunk software to gain better visibility and insight of their digital asset infrastructure. NPR initially used Splunk to monitor and troubleshoot their end- to-end asset delivery infrastructure. Before Splunk, there were critical business metrics they couldn’t get from their traditional web analytics solutions. They expanded their deployment of Splunk and now measure program popularity, views by device, reconcile royalty payments for digital rights, measure abandonment rates and more. “Only Splunk easily gives us the business reports about our web- based digital assets that we need.”
 Online Metrics Analyst, NPR Cricket Communications As a prepay telecommunications provider, Cricket Communications is driven by web and point of sale channels. At the heart of this is a homegrown CRM application, coupled with third-party applications for middleware, billing and rating and PoS (Point of Sale) support. Using Splunk to harness terabytes
 of machine data generated by these systems and infrastructure, the operations team calculates that Splunk helped them reduced outage frequency by about 15%, translating into an annual ROI
 of $1,200,000. In addition, analytics from this machine data has enabled Cricket to provide executives real-time sales dashboards that deliver sales by store, product type, device, rate plans,
z ip code, etc. Machine data is being used to enrich data from structured sources within the data management infrastructure. Online Travel Company One of the world’s leading online travel companies initially used Splunk software to avoid website outages, saving millions of dollars in lost revenue. They quickly expanded their use of Splunk and within 10 months were monitoring 98% of their infrastructure. Today, over 2,700 users at this organization 
use Splunk to gain real-time insights of not only their IT infrastructure, but also online bookings, performance of air- travel coupons and optimizing SEM. “We achieved real-time visibility and insights across a wide range 
of critical areas from server and application health and performance monitoring to bookings trends, coupon use and deal analysis with Splunk. We gained the ability to perform rapid, real-time analysis on tens of terabytes of unstructured, time-sensitive machine data.”
 Sr. Director Infrastructure Architecture Free Download Download Splunk for free. You’ll get a Splunk Enterprise license for 60 days and you can index up to 500 megabytes of data per day. After 60 days, or anytime before then, you can convert to a perpetual Free license or purchase an Enterprise license by contacting [email protected].