Hortonworks Logo

Hortonworks is the only 100-percent open source software provider to develop, distribute and support an Apache Hadoop platform explicitly architected, built and tested for enterprise-grade deployments. Developed by the original architects, builders and operators of Hadoop, Hortonworks stewards the core and delivers the critical services required by the enterprise to reliably and effectively run Hadoop at scale. Our distribution, Hortonworks Data Platform, provides an open and stable foundation for enterprises and a growing ecosystem to build and deploy big data solutions. Hortonworks also provides unmatched technical support, training and certification programs.


Yahoo is a guide focused on making users’ daily habits inspiring and entertaining. By creating highly personalized experiences for our users, we keep people connected to what matters most to them, across devices and around the world. In turn, we create value for advertisers by connecting them with the audiences that build their businesses. Yahoo is headquartered in Sunnyvale, California, and has offices located throughout the Americas, Asia Pacific (APAC) and the Europe, Middle East and Africa (EMEA) regions. For more information, visit the pressroom (pressroom.yahoo.net) or the Company’s blog (yahoo.tumblr.com).


Diamond Sponsors


EMC Corporation is a global leader in enabling businesses and service providers to transform their operations and deliver IT as a service. Fundamental to this transformation is cloud computing. Through innovative products and services, EMC accelerates the journey to cloud computing, helping IT departments to store, manage, protect and analyze their most valuable asset – information – in a more agile, trusted and cost-efficient way. Additional information about EMC can be found at www.EMC.com.


HP offers solutions that provide data analytics without compromise. HP Haven includes core software technologies like Vertica, SQL on Hadoop, IDOL and Distributed R to harness 100% of your data in delivering analytics.  It is backed by HP hardware and services to offer a complete platform for organizations who won’t compromise when making use of virtually all information sources in the enterprise.


Unlock actionable insights from structured, unstructured, and streaming data with Microsoft Big Data Solutions. Deploy Hadoop with HDInsight Service in the cloud with the simplicity and manageability of Windows Azure. Combine your structured and unstructured data with PolyBase for SQL Server 2012 Parallel Data Warehouse. Glean new insights from your data with familiar tools like Excel and SharePoint. Enrich your data by combining with the world’s data through Windows Azure Marketplace. Learn more at www.microsoft.com/hdinsight


As market leader in enterprise application software, SAP (NYSE: SAP) helps companies of all sizes and industries run better. From back office to boardroom, warehouse to storefront, desktop to mobile device – SAP empowers people and organizations to work together more efficiently and use business insight more effectively to stay ahead of the competition. SAP applications and services enable more than 261,000 customers to operate profitably, adapt continuously, and grow sustainably. For more information, visit www.sap.com.


Teradata helps companies get more value from data than any other company. Our big data analytic solutions, integrated marketing applications, and team of experts can help your company gain a sustainable competitive advantage with data.  Teradata helps organizations leverage all their data so they can know more about their customers and business and do more of what’s really important. Visit teradata.com.


Platinum Sponsors


Actian is facilitating innovation like no other company, and has made huge strides in democratizing big data with the Actian Analytics Platform. Recognizing the market opportunity for Hadoop, Actian introduced the highest-performing, most industrialized SQL on Hadoop solution – Actian Vortex.  By opening up Hadoop to the millions of SQL programmers around the globe, Actian helps its tens of thousands of customers get one step closer to leveraging their data investments for transformational business value.


BMC delivers software solutions that help IT transform digital enterprises for the ultimate competitive business advantage. We have worked with thousands of leading companies to create and deliver powerful IT management services that pair high-speed digital innovation with robust IT industrialization for optimized IT performance, cost, compliance, and productivity.


Cloudera is revolutionizing enterprise data management with the first unified Platform for Big Data, an enterprise data hub built on Apache Hadoop™. Cloudera offers enterprises one place to store, process and analyze all their data, empowering them to extend the value of existing investments while enabling fundamental new ways to derive value from their data.


Your Paths. Our Platforms. Great Partnerships.  For more than 30 years, Dell has played a critical role in transforming computing, enabling more affordable and more pervasive access to technology worldwide. When Michael Dell founded the company back in 1984, he revolutionized the PC industry. Today, we’re still applying our entrepreneurial drive to the next-generation of technology solutions like cloud computing, big data, security and mobility.


IBM BigInsights: Hadoop is great for managing many types of data with its ability to scale easily for large volumes and its schema on read for many data types. To fully extract the value of Hadoop, you need enterprise-ready features including analytics, visualization, and security.   IBM reduces complexity and is an integral part of the Logical Data Warehouse. Integrated with existing infrastructure, IBM gets you started quickly and confidently to address more complex problems tomorrow.


Informatica is the world’s number one independent provider of data integration software. This software is available on Hadoop, for fivefold productivity gains that transform more data into more accurate, insightful analysis in less time. Organizations around the world rely on Informatica to realize their information potential, drive their top business imperatives and fully leverage their information assets from devices to mobile to social to big data residing on-premise, in the Cloud and across social networks.


Intel, the world leader in silicon innovation, delivers hardware and software technologies to continually advance how people work and live. For over two decades, Intel’s contributions to open-source projects have helped ensure that a breadth of solutions run exceptionally well on Intel® architecture helping to unlock business opportunities, connect people, and enhance lives. Open source is bringing amazing experiences to life-and Intel is helping power these experiences as a Sponsor of Tomorrow.

MapR Logo (Red Background & White Text)-1

MapR delivers on the promise of Hadoop with a proven, enterprise-grade platform that supports a broad set of mission-critical and real-time production uses. MapR brings unprecedented dependability, ease-of-use, and world-record speed to Hadoop, NoSQL, database and streaming applications in one unified Big Data platform. MapR is used by more than 500 customers across financial services, retail, media, healthcare, manufacturing, telecommunications and government organizations as well as by leading Fortune 100 and Web 2.0 companies.


Oracle provides the world’s most complete, open, and integrated business software and hardware systems representing a variety of sizes and industries in more than 145 countries. Big data is revolutionizing the way businesses and government operate virtually overnight. As you explore how to leverage the power of big data in your business, let Oracle experts show you how to maximize value from big data and transform your world. Learn about our Big Data Solutions at http://www.oracle.com/us/technologies/big-data/index.html


Pivotal offers a modern approach to technology that organizations need to thrive in a new era of business innovation. Our solutions intersect cloud, big data and agile development, creating a framework that increases data leverage, accelerates application delivery, and decreases costs, while providing enterprises the speed and scale they need to compete.


Not only is SAS the only provider to offer an integrated platform for data management, visualization and analytics, but our support for Hadoop spans the entire analytics life cycle, from data to decision.

With SAS®, you can:

  • Execute Pig, Hive and MapReduce data transformations.
  • Automatically execute models.
  • Integrate with R and Python.
  • Score data in Hadoop or lift data into memory.
  • Choose from many hardware and database vendors.



Trifacta, the pioneer in data transformation, significantly enhances the value of an enterprise’s Big Data by enabling users to easily transform raw, complex data into clean and structured inputs for analysis. Leveraging decades of innovative work in human-computer interaction, scalable data management and machine learning, Trifacta’s unique technology creates a bi-directional partnership between user and machine, with each component learning from the other and becoming smarter through use.


WANdisco is a provider of enterprise-ready, non-stop software solutions that enable globally distributed organizations to meet today’s data challenges of secure storage, scalability and availability. WANdisco’s products are differentiated by its patented, active-active data replication technology, serving critical requirements for enterprise Hadoop deployments. Fortune Global 1000 companies, including Juniper Networks, Motorola, and Halliburton, rely on WANdisco for performance, reliability, security and availability. For additional information, visit www.wandisco.com.


 Gold Sponsors


Cisco is the worldwide leader in IT that helps companies seize the opportunities of tomorrow by providing that amazing things can happen when you connect the previously unconnected. For further details, please go to http://www.cisco.com


Datameer is the only end-to-end big data analytics application purpose-built for the Hadoop ecosystem, designed to make big data easy for everyone. More than 200 companies use Datameer to integrate, prepare, analyze and visualize all of their data to get actionable insights in hours instead of weeks. Founded in 2009 by Hadoop veterans, Datameer is headquartered in San Francisco with offices in New York and Germany. For more information, please visit http://www.datameer.com.


DataTorrent RTS is the industry’s only solution to have a high performing, fault tolerant unified architecture for both data in motion and data at rest. Proven in production environments to reduce time to market, development costs and operational expenditures for Fortune 100 and leading Internet companies.  DataTorrent is backed by leading investors including August Capital, GE Ventures, Singtel Innov8, Morado Ventures, and Yahoo co-founder Jerry Yang. For more information, visit our website or follow us on Twitter.


JethroData accelerates BI-on-Hadoop performance. Our SQL engine uniquely combines columnar storage with full-indexing. Queries then use indexes to access only the data they need instead of performing a full-scan of the entire dataset. The result is fast queries and reduced cluster load. Jethro is optimal for using BI tools on large datasets in Hadoop, where queries are typically selective and users require interactive response time. Jethro supports most BI tools and Hadoop distributions. www.jethrodata.com


Pentaho is building the future of business analytics. Pentaho’s open source heritage drives our continued innovation in a modern, integrated, embeddable platform built for accessing all data sources. With support for all of the leading Hadoop distributions, NoSQL databases and high performance analytic databases, Pentaho provides the broadest support for big data analytics, as well as integration and orchestration of big data and traditional sources. For more information visit pentaho.com or call +1 866-660-7555.

Platfora instantly transforms raw data in Hadoop into interactive, in-memory business intelligence with none of the friction or complexity of traditional approaches. Platfora is a complete solution – seamlessly connecting data to end-users, with no separate data warehouse or ETL software required. The Platfora solution is comprised of a web-based BI application, scale-out, in-memory data mart engine, and an automated hadoop data refinery.Visit www.platfora.com and follow us on Twitter @platfora.


RedPoint Global offers a comprehensive set of world-class ETL, data quality and data integration applications that operate in and across both traditional and Hadoop 2.0/YARN environments.  RedPoint also offers data-driven customer engagement solutions helping companies derive insights from customer behaviors and create consistent and relevant messages. All RedPoint applications offer a unique visual user interface, allowing enterprises to utilize all data to achieve their strategic business goals. For more information visit www.redpoint.net or email: contact.us@redpoint.net.


The SnapLogic Elastic Integration Platform allows enterprise IT organizations to connect big data, cloud applications and APIs faster. With an easy-to-use cloud-based designer, hybrid execution engine that respects data gravity, and 300+ connectors, called Snaps, SnapLogic’s modern platform is built handle big data ingestion, preparation and delivery at scale. Go beyond hand coding and legacy ETL tools and get a better return on all of your big data and cloud application investments with SnapLogic.


Syncsort provides enterprise software to collect, integrate, sort and distribute more data in less time, with fewer resources and lower costs.  Thousands of customers in more than 85 countries, including 87 of the Fortune 100 companies, use our fast and secure software to optimize and offload data processing workloads. Powering over 50% of the world’s mainframes, Syncsort provides specialized solutions spanning “Big Iron to Big Data,” including Hadoop, Windows, Linux, Unix, Cloud, and Splunk.www.syncsort.com


Talend’s integration solutions allow data-driven organizations to gain instant value from all their data. Through native support of modern big data platforms, Talend takes the complexity out of integration efforts and equips IT departments to be more responsive to the demands of the business, at a predictable cost. Based on open source technologies, Talend’s future-proof solutions address all existing and emerging integration requirements. For more information, please visit www.talend.com and follow us on Twitter: @Talend


VMware is a leader in cloud infrastructure and business mobility. Built on VMware’s industry-leading virtualization technology, our solutions deliver a brave new model of IT that is fluid, instant and more secure. Customers can innovate faster by rapidly developing, automatically delivering and more safely consuming any application. With 2014 revenues of $6 billion, VMware has more than 500,000 customers and 75,000 partners. The company is headquartered in Silicon Valley with offices throughout the world.


Zoomdata, developers of the world’s fastest big data exploration, visualization and analytics platform, lets business users see and interact with data in all new ways. Designed mobile and touch first, its patented micro-query architecture delivers results on billions of records in seconds and gives users a single plane of access for bridging old data and new data.


 Silver Sponsors


Arcadia Data delivers unified visual analytics and BI for Hadoop. With our fully converged platform, business users extract granular insights from all of their Hadoop data at speeds they never thought possible, and share them in interactive data-driven applications built with drag-and-drop ease. And, they do all of this without summarization, duplication or sampling. Learn more about the Converged Analytics PlatformTM difference and download a free version of the Arcadia Instant product at arcadiadata.com.


Arista Networks was founded to deliver software driven cloud networking solutions for large data center and high-performance computing environments. With more than three million cloud networking ports deployed worldwide, Arista delivers 1/10/40 and 100GbE products that redefine network architectures, bring extensibility to networking, and dramatically change the performance of data center networks. At the core of Arista’s platform is the Extensible Operating System, a ground-breaking network operating system with single-image consistency across hardware platforms.


Ataccama Corporation combines data quality, master data management, and data governance in a single technology platform ready for operational, analytical and BigData deployments. Ataccama Big Data Engine offers an easy-to-use development interface (GUI), shared metadata, and rich data integration layer — often replacing specialized ETL technologies. Understand the quality and value of your Hadoop data with Ataccama Big Data Analyzer or kick off your Hadoop initiative now and request a complimentary Big Data Test Drive.

atscale-700px-color (1)

We Make BI work on Hadoop. With AtScale, business users get interactive and multi-dimensional analysis capabilities, directly on Hadoop, at maximum speed, using the tools they already know, own and love – from Microsoft Excel to Tableau Software to QlikView. Built by Big Data Veterans from Yahoo!, Google and Oracle, AtScale is already enabling the BI on Hadoop revolution at major corporations. To see how AtScale can help you, go to www.atscale.com


Attunity software solutions enable access, management, sharing and distribution of data across heterogeneous enterprise platforms, organizations, and the cloud. Our software solutions include data replicationdata flow managementtest data managementchange data capture (CDC), data connectivityenterprise file replication (EFR), managed file transfer (MFT), data warehouse automation, and cloud data delivery. Using Attunity’s solutions, our customers enjoy significant business benefits by enabling real-time access and availability of data and files where and when needed.


BlueData is transforming Big Data infrastructure. The BlueData EPIC software platform leverages virtualization and patent-pending innovations to make it easier, faster, and more cost-effective to deploy Hadoop or Spark infrastructure. With BlueData, our customers can provide Hadoop-as-a-Service in an on-premises deployment model. They can spin up virtual Hadoop or Spark clusters within minutes – providing their data scientists with on-demand access to the applications, data and infrastructure they need. Learn more at www.bluedata.com


Bright Cluster Manager is enterprise-grade software for deploying, monitoring and managing Hadoop clusters of all sizes. From its bare-metal provisioning of the entire software stack (including Spark) to its beautiful graphical user interface, Bright provides the most advanced cluster management solution for Hadoop available. Dell, Cisco, Amazon and Intel are part of Bright’s partner ecosystem, and our customers include leading Fortune 100 companies


Cask is an open source big data software company providing simple access to powerful technology. Based in Palo Alto, CA and funded by leading investors such as Battery Ventures, Ignition Partners, and Andreessen Horowitz, Cask was founded by developers to build solutions for developers.  Cask’s flagship offering, the Cask Data Application Platform, makes it possible to quickly and easily develop and deploy more powerful applications for Hadoop.


Cloudwick (www.cloudwick.com) is the leading big data service provider to the Global 1000. As a certified systems integration partner to Cloudera, Hortonworks, DataStax, Databricks and AWS, our Hadoop, Spark and NoSQL administrators, developers and data scientists build, operate, monitor and manage on-premise and cloud big data systems for leading enterprises including 3M, Bank of America, Comcast, Home Depot, Intuit, JP Morgan, NetApp, Target, Visa, Walmart and more.


Supercomputing leader Cray provides high-performance data analytics and discovery platforms that help organizations derive value from big data. Cray’s analytics products accelerate time to insight by unifying hardware, software and management into turnkey, converged platforms. Organizations enjoy superior analytic productivity through Cray’s implementation of innovative memory and storage strategies and use of performance networking in its systems. Pre-installed software includes industry-standard Hadoop and Spark frameworks, and Cray’s graph engine for complex relationship analytics.


Databricks was founded out of the UC Berkeley AMPLab by the creators of Apache Spark. We’ve been working for the past six years on cutting-edge systems to extract value from Big Data. We believe that Big Data is a huge opportunity that is still largely untapped, and we’re working to revolutionize what you can do with it.


Dataguise helps enterprises safely unlock the benefits of big data with the most precise security solution that detects, audits, protects, and monitors sensitive data assets in real time wherever they live and move across all repositories. We deliver the only one-stop, out-of-the-box solution that provides the highest level of protection. We’re proud to secure the data of some of the largest, industry leading companies that are committed to being responsible data stewards. www.dataguise.com.


Elastic is a company that believes getting insight from data matters. Built around three open source products — Elasticsearch, Logstash, and Kibana — Elastic is extending what’s possible with data, delivering on the promise that good things come from connecting the dots. Designed to help users take data from any source and search, analyze, and visualize it in real time, Elastic products are changing the way organizations get value from data. To learn more, visit www.elastic.co.


H2O rewrites the rules of data science by bringing model design and scoring into a single platform. Leaders like PayPal, Nielsen, Cisco, and MarketShare rely on H2O’s extensible machine learning platform to power CPU-intensive predictions, high volume data analyses, and complex machine learning algorithms. H2O’s platform features a visual dashboard for non-technical users and easy-to-use JSON and Java APIs for R, Python, Excel, JavaScript and Tableau integrations.


HGST is a leader in data storage, unlocking greater potential by helping the world harness the power of data. Building on its world-class reputation, HGST’s smarter storage solutions are everywhere, touching lives and enabling possibilities for the enterprise, cloud computing, and sophisticated infrastructures in healthcare, energy, finance and government.


Impetus Technologies is a provider of innovative Big Data solutions and services that empower large enterprises to unlock the full value of Big Data opportunities.  Our proven methodologies and solutions span the full life-cycle of architecture advisory, proof of value, data science, application development and implementation services. We have launched solutions for Data Warehouse Modernization and StreamAnalytix for rapid development of real-time streaming data analytics applications using open source technologies.  For more info visit http://bigdata.impetus.com.


Kyvos is committed to unlock the power of Big Data Analytics with its unique “OLAP on Hadoop” technology. This allows you to build cubes in-place on Hadoop with linear scalability, eliminating the limitations of traditional OLAP solutions, and enabling interactive multi-dimensional analytics on your Big Data. Users can visualize, explore and analyze their data interactively on Hadoop with no programming required. Come and explore Kyvos to experience OLAP on Hadoop at unprecedented scale.


Pepperdata enables enterprises to rely on Hadoop in production. Pepperdata’s real-time cluster optimizer dynamically adjusts cluster utilization based on customer priorities so that jobs run faster, more reliably, and more efficiently. Pepperdata installs on existing clusters and works with any Hadoop distribution, including Cloudera, Hortonworks, IBM, MapR, and Apache.


QCT is a global datacenter solution provider extending the power of hyperscale datacenter design in standard and open SKUs to all datacenter customers.
Product lines include servers, storage, network switches, integrated rack systems and cloud solutions, all delivering hyperscale efficiency, scalability, reliability, manageability, serviceability and optimized performance for each workload. QCT offers a full spectrum of datacenter products and services from engineering, integration and optimization to global supply chain support, all under one roof.


Qubole delivers a Self-Service Platform for Big Data Analytics built on Amazon, Microsoft and Google Clouds. We were started by the team that built and ran Facebook’s Data Service and authored Apache Hive. With Qubole, a data scientist can now spin up hundreds of clusters on their public cloud of choice and begin creating ad hoc and/or batch queries in under five minutes and have the system autoscale to the optimal compute levels as needed.


Rackspace® is the #1 managed cloud company, the leader in hybrid cloud, and the founder of OpenStack®. Its technical expertise, multiple technology platforms, and Fanatical Support® allow companies to tap the power of the cloud without the pain of hiring experts in dozens of complex technologies.  www.rackspace.com.

Saama Technologies is one of the largest pure-play data science solutions and services companies focused on solving the data management and advanced analytics challenges of the world’s leading brands. Saama has over 15 years of experience implementing business analytics, big data, predictive analytics and data management solutions for global clients.


Simba Technologies (www.simba.com, @SimbaTech) connects the world. As the recognized leader in standards-based data connectivity for both relational and multi-dimensional data sources, Simba connects the world’s leading companies across multiple platforms, including Windows, Mac, UNIX, Linux and mobile OSes. Simba’s customers are industry leaders and Big Data innovators, and include Alteryx®, Altiscale®, Cloudera®, Couchbase®, Databricks®, DataStax®, Google®, Hortonworks®, Informatica®, MapR®, Microsoft®, Oracle®, Qubole®, SAP®, Splunk®, Tableau®, and Teradata®.


Skytree®—The Machine Learning Company® is disrupting the Advanced Analytics market with a Machine Learning platform that gives organizations the power to discover deep analytic insights, predict future trends, make recommendations and reveal untapped market and customer opportunities. Skytree’s flagship product—Skytree Infinity™— is an enterprise-ready scalable Machine Learning platform, built from the ground up to work on massive and fast changing datasets with the highest accuracy at unprecedented speed and scale.


StreamSets offers an innovative, new approach for managing data in motion. We help companies automatically reduce the risk and increase the efficiency of continuous data ingestion into big data stores. Today, much of this work is either done manually or by repurposing legacy technologies, leading to great difficulty in ensuring best practices.  StreamSets works with any Hadoop distribution to provide automatic integration, continuous data preparation and intelligent monitoring of ingested data.


Supermicro® (NASDAQ: SMCI), the leading innovator in high-performance, high-efficiency server technology is a premier provider of advanced server Building Block Solutions® for Data Center, Cloud Computing, Enterprise IT, Hadoop/Big Data, HPC and Embedded Systems worldwide. Supermicro is committed to protecting the environment through its “We Keep IT Green®” initiative and provides customers with the most energy-efficient, environmentally-friendly solutions available on the market.


Tableau Software helps people see and understand data. Tableau’s award-winning software delivers fast analytics, visualization and rapid-fire business intelligence on data of any size, format, or subject. The result? Anyone can get answers from data quickly, with no programming required. From executive dashboards to ad-hoc reports, Tableau lets you share mobile and browser-based, interactive analytics in a few clicks. More than 26,000 companies and organizations, including some of the world’s largest enterprises, rely on Tableau Software.


Tech Mahindra is a USD 3.1 billion company with 92,729 professionals across 51 countries, providing services to 632 global customers including Fortune 500 companies. Our BI portfolio consists of full life cycle system integration services available across 9 industries and 45 BI technologies. These technologies include BI and PM Consulting, Performance Management, Big Data and Analytics, Data Management, Data Warehousing (DW), Data Warehouse Appliances, Mobile BI, In-Memory Computing, Business Analytics, Social Media Analytics, and On-Demand BI Solutions.


Unravel optimizes big data applications and clusters automatically for maximum performance. It also provides big data developers and operations teams an easy-to-understand 360° view into applications, data, and resource usage for the quickest root-cause analysis and intelligent planning. With Unravel, enterprises have increased cluster utilization by 50%, application speed by 300%, and reduced problem troubleshooting time by 90%. Unravel installs in minutes, supports all popular systems including Hadoop, Spark, and NoSQL both on-premises and cloud.


WebAction is the most comprehensive, realtime stream analytics platform. Quickly build tailored enterprise-scale Big Data applications that assimilate, correlate and analyze disparate, high-velocity data. The continuous integration of realtime and historical information provides up-to-the-millisecond visibility into both customer and business health. Identify issues instantaneously and in-time to effectively resolve them.


Zettaset, the leader in Big Data security, is an ISV delivering proven enterprise-class data protection that is compatible with any Hadoop or NoSQL database. Zettaset data encryption, access-control, and authentication solutions are uniquely designed and optimized for scale and performance in today’s complex and demanding distributed-computing environments. Customers can rely on Zettaset to deliver advanced Big Data security solutions that are simple to deploy and easy to fit into existing IT security and policy frameworks.


Exhibitor Sponsors









InfoObjects Transparent Logo













Interested in Sponsoring Hadoop Summit?

Hadoop Summit North America 2015 is the premier industry event for Apache Hadoop users, developers and vendors. It is an ideal opportunity to showcase your Hadoop-related products and services to organizations interested in leveraging Apache Hadoop technology to solve their big data challenges.

A variety of sponsorship options are available depending on the exposure you want at Hadoop Summit. Some levels are limited in numbers so act now. Special deals are available if you sponsor both Hadoop Summit Europe and Hadoop Summit North America. For more information on becoming a sponsor, please contact:

Jeff Taylor
Hadoop Summit Sponsorship Sales
US Direct: 1.925.997.7831


2014 Hadoop Summit Community Showcase