Hortonworks Logo

Hortonworks is the only 100-percent open source software provider to develop, distribute and support an Apache Hadoop platform explicitly architected, built and tested for enterprise-grade deployments. Developed by the original architects, builders and operators of Hadoop, Hortonworks stewards the core and delivers the critical services required by the enterprise to reliably and effectively run Hadoop at scale. Our distribution, Hortonworks Data Platform, provides an open and stable foundation for enterprises and a growing ecosystem to build and deploy big data solutions. Hortonworks also provides unmatched technical support, training and certification programs.


Yahoo is a guide focused on making users’ daily habits inspiring and entertaining. By creating highly personalized experiences for our users, we keep people connected to what matters most to them, across devices and around the world. In turn, we create value for advertisers by connecting them with the audiences that build their businesses. Yahoo is headquartered in Sunnyvale, California, and has offices located throughout the Americas, Asia Pacific (APAC) and the Europe, Middle East and Africa (EMEA) regions. For more information, visit the pressroom (pressroom.yahoo.net) or the Company’s blog (yahoo.tumblr.com).


Diamond Sponsors


EMC Corporation is a global leader in enabling businesses and service providers to transform their operations and deliver IT as a service. Fundamental to this transformation is cloud computing. Through innovative products and services, EMC accelerates the journey to cloud computing, helping IT departments to store, manage, protect and analyze their most valuable asset – information – in a more agile, trusted and cost-efficient way. Additional information about EMC can be found at www.EMC.com.


HP offers solutions that provide data analytics without compromise. HP Haven includes core software technologies like Vertica, SQL on Hadoop, IDOL and Distributed R to harness 100% of your data in delivering analytics.  It is backed by HP hardware and services to offer a complete platform for organizations who won’t compromise when making use of virtually all information sources in the enterprise.


Unlock actionable insights from structured, unstructured, and streaming data with Microsoft Big Data Solutions. Deploy Hadoop with HDInsight Service in the cloud with the simplicity and manageability of Windows Azure. Combine your structured and unstructured data with PolyBase for SQL Server 2012 Parallel Data Warehouse. Glean new insights from your data with familiar tools like Excel and SharePoint. Enrich your data by combining with the world’s data through Windows Azure Marketplace. Learn more at www.microsoft.com/hdinsight


Teradata helps companies get more value from data than any other company. Our big data analytic solutions, integrated marketing applications, and team of experts can help your company gain a sustainable competitive advantage with data.  Teradata helps organizations leverage all their data so they can know more about their customers and business and do more of what’s really important. Visit teradata.com.


Platinum Sponsors


Actian is facilitating innovation like no other company, and has made huge strides in democratizing big data with the Actian Analytics Platform. Recognizing the market opportunity for Hadoop, Actian introduced the highest-performing, most industrialized SQL on Hadoop solution – Actian Vortex.  By opening up Hadoop to the millions of SQL programmers around the globe, Actian helps its tens of thousands of customers get one step closer to leveraging their data investments for transformational business value.


Cloudera is revolutionizing enterprise data management with the first unified Platform for Big Data, an enterprise data hub built on Apache Hadoop™. Cloudera offers enterprises one place to store, process and analyze all their data, empowering them to extend the value of existing investments while enabling fundamental new ways to derive value from their data.


Your Paths. Our Platforms. Great Partnerships.  For more than 30 years, Dell has played a critical role in transforming computing, enabling more affordable and more pervasive access to technology worldwide. When Michael Dell founded the company back in 1984, he revolutionized the PC industry. Today, we’re still applying our entrepreneurial drive to the next-generation of technology solutions like cloud computing, big data, security and mobility.


Informatica is the world’s number one independent provider of data integration software. This software is available on Hadoop, for fivefold productivity gains that transform more data into more accurate, insightful analysis in less time. Organizations around the world rely on Informatica to realize their information potential, drive their top business imperatives and fully leverage their information assets from devices to mobile to social to big data residing on-premise, in the Cloud and across social networks.


Intel, the world leader in silicon innovation, delivers hardware and software technologies to continually advance how people work and live. For over two decades, Intel’s contributions to open-source projects have helped ensure that a breadth of solutions run exceptionally well on Intel® architecture helping to unlock business opportunities, connect people, and enhance lives. Open source is bringing amazing experiences to life-and Intel is helping power these experiences as a Sponsor of Tomorrow.

MapR Logo (Red Background & White Text)-1

MapR delivers on the promise of Hadoop with a proven, enterprise-grade platform that supports a broad set of mission-critical and real-time production uses. MapR brings unprecedented dependability, ease-of-use, and world-record speed to Hadoop, NoSQL, database and streaming applications in one unified Big Data platform. MapR is used by more than 500 customers across financial services, retail, media, healthcare, manufacturing, telecommunications and government organizations as well as by leading Fortune 100 and Web 2.0 companies.


Pivotal offers a modern approach to technology that organizations need to thrive in a new era of business innovation. Our solutions intersect cloud, big data and agile development, creating a framework that increases data leverage, accelerates application delivery, and decreases costs, while providing enterprises the speed and scale they need to compete.


Not only is SAS the only provider to offer an integrated platform for data management, visualization and analytics, but our support for Hadoop spans the entire analytics life cycle, from data to decision.

With SAS®, you can:

  • Execute Pig, Hive and MapReduce data transformations.
  • Automatically execute models.
  • Integrate with R and Python.
  • Score data in Hadoop or lift data into memory.
  • Choose from many hardware and database vendors.



Trifacta, the pioneer in data transformation, significantly enhances the value of an enterprise’s Big Data by enabling users to easily transform raw, complex data into clean and structured inputs for analysis. Leveraging decades of innovative work in human-computer interaction, scalable data management and machine learning, Trifacta’s unique technology creates a bi-directional partnership between user and machine, with each component learning from the other and becoming smarter through use.


WANdisco is a provider of enterprise-ready, non-stop software solutions that enable globally distributed organizations to meet today’s data challenges of secure storage, scalability and availability. WANdisco’s products are differentiated by its patented, active-active data replication technology, serving critical requirements for enterprise Hadoop deployments. Fortune Global 1000 companies, including Juniper Networks, Motorola, and Halliburton, rely on WANdisco for performance, reliability, security and availability. For additional information, visit www.wandisco.com.


 Gold Sponsors



Cisco is the worldwide leader in IT that helps companies seize the opportunities of tomorrow by providing that amazing things can happen when you connect the previously unconnected. For further details, please go to http://www.cisco.com


Datameer is the only proven big data insights platform that quickly transforms businesses into agile, insights-driven organizations. Datameer provides an intuitive platform for fluid data discovery that reveals insights in hours instead of months. More than 200 companies including Citibank, Telefonica, Workday, and VISA use Datameer to integrate, prepare, analyze, and visualize all their data, driving significant competitive advantage and unprecedented ROI. For more information, please visit http://www.datameer.com.



JethroData accelerates BI-on-Hadoop performance. Our SQL engine uniquely combines columnar storage with full-indexing. Queries then use indexes to access only the data they need instead of performing a full-scan of the entire dataset. The result is fast queries and reduced cluster load. Jethro is optimal for using BI tools on large datasets in Hadoop, where queries are typically selective and users require interactive response time. Jethro supports most BI tools and Hadoop distributions. www.jethrodata.com


Pentaho is building the future of business analytics. Pentaho’s open source heritage drives our continued innovation in a modern, integrated, embeddable platform built for accessing all data sources. With support for all of the leading Hadoop distributions, NoSQL databases and high performance analytic databases, Pentaho provides the broadest support for big data analytics, as well as integration and orchestration of big data and traditional sources. For more information visit pentaho.com or call +1 866-660-7555.

Platfora instantly transforms raw data in Hadoop into interactive, in-memory business intelligence with none of the friction or complexity of traditional approaches. Platfora is a complete solution – seamlessly connecting data to end-users, with no separate data warehouse or ETL software required. The Platfora solution is comprised of a web-based BI application, scale-out, in-memory data mart engine, and an automated hadoop data refinery.Visit www.platfora.com and follow us on Twitter @platfora.



Syncsort provides enterprise software to collect, integrate, sort and distribute more data in less time, with fewer resources and lower costs.  Thousands of customers in more than 85 countries, including 87 of the Fortune 100 companies, use our fast and secure software to optimize and offload data processing workloads. Powering over 50% of the world’s mainframes, Syncsort provides specialized solutions spanning “Big Iron to Big Data,” including Hadoop, Windows, Linux, Unix, Cloud, and Splunk.www.syncsort.com


Talend’s integration solutions allow data-driven organizations to gain instant value from all their data. Through native support of modern big data platforms, Talend takes the complexity out of integration efforts and equips IT departments to be more responsive to the demands of the business, at a predictable cost. Based on open source technologies, Talend’s future-proof solutions address all existing and emerging integration requirements. For more information, please visit www.talend.com and follow us on Twitter: @Talend


VMware is a leader in cloud infrastructure and business mobility. Built on VMware’s industry-leading virtualization technology, our solutions deliver a brave new model of IT that is fluid, instant and more secure. Customers can innovate faster by rapidly developing, automatically delivering and more safely consuming any application. With 2014 revenues of $6 billion, VMware has more than 500,000 customers and 75,000 partners. The company is headquartered in Silicon Valley with offices throughout the world.


Zoomdata, developers of the world’s fastest big data exploration, visualization and analytics platform, lets business users see and interact with data in all new ways. Designed mobile and touch first, its patented micro-query architecture delivers results on billions of records in seconds and gives users a single plane of access for bridging old data and new data.


 Silver Sponsors


Arista Networks was founded to deliver software driven cloud networking solutions for large data center and high-performance computing environments. With more than three million cloud networking ports deployed worldwide, Arista delivers 1/10/40 and 100GbE products that redefine network architectures, bring extensibility to networking, and dramatically change the performance of data center networks. At the core of Arista’s platform is the Extensible Operating System, a ground-breaking network operating system with single-image consistency across hardware platforms.



Attunity software solutions enable access, management, sharing and distribution of data across heterogeneous enterprise platforms, organizations, and the cloud. Our software solutions include data replicationdata flow managementtest data managementchange data capture (CDC), data connectivityenterprise file replication (EFR), managed file transfer (MFT), data warehouse automation, and cloud data delivery. Using Attunity’s solutions, our customers enjoy significant business benefits by enabling real-time access and availability of data and files where and when needed.


BlueData is transforming Big Data infrastructure. The BlueData EPIC software platform leverages virtualization and patent-pending innovations to make it easier, faster, and more cost-effective to deploy Hadoop or Spark infrastructure. With BlueData, our customers can provide Hadoop-as-a-Service in an on-premises deployment model. They can spin up virtual Hadoop or Spark clusters within minutes – providing their data scientists with on-demand access to the applications, data and infrastructure they need. Learn more at www.bluedata.com


Bright Cluster Manager is enterprise-grade software for deploying, monitoring and managing Hadoop clusters of all sizes. From its bare-metal provisioning of the entire software stack (including Spark) to its beautiful graphical user interface, Bright provides the most advanced cluster management solution for Hadoop available. Dell, Cisco, Amazon and Intel are part of Bright’s partner ecosystem, and our customers include leading Fortune 100 companies


Cask is an open source big data software company providing simple access to powerful technology. Based in Palo Alto, CA and funded by leading investors such as Battery Ventures, Ignition Partners, and Andreessen Horowitz, Cask was founded by developers to build solutions for developers.  Cask’s flagship offering, the Cask Data Application Platform, makes it possible to quickly and easily develop and deploy more powerful applications for Hadoop.


Supercomputing leader Cray provides high-performance data analytics and discovery platforms that help organizations derive value from big data. Cray’s analytics products accelerate time to insight by unifying hardware, software and management into turnkey, converged platforms. Organizations enjoy superior analytic productivity through Cray’s implementation of innovative memory and storage strategies and use of performance networking in its systems. Pre-installed software includes industry-standard Hadoop and Spark frameworks, and Cray’s graph engine for complex relationship analytics.

cloudwick logo


Databricks was founded out of the UC Berkeley AMPLab by the creators of Apache Spark. We’ve been working for the past six years on cutting-edge systems to extract value from Big Data. We believe that Big Data is a huge opportunity that is still largely untapped, and we’re working to revolutionize what you can do with it.


Elastic is a company that believes getting insight from data matters. Built around three open source products — Elasticsearch, Logstash, and Kibana — Elastic is extending what’s possible with data, delivering on the promise that good things come from connecting the dots. Designed to help users take data from any source and search, analyze, and visualize it in real time, Elastic products are changing the way organizations get value from data. To learn more, visit www.elastic.co.



Kyvos is committed to unlock the power of Big Data Analytics with its unique “OLAP on Hadoop” technology. This allows you to build cubes in-place on Hadoop with linear scalability, eliminating the limitations of traditional OLAP solutions, and enabling interactive multi-dimensional analytics on your Big Data. Users can visualize, explore and analyze their data interactively on Hadoop with no programming required. Come and explore Kyvos to experience OLAP on Hadoop at unprecedented scale.


QCT is a global datacenter solution provider extending the power of hyperscale datacenter design in standard and open SKUs to all datacenter customers.
Product lines include servers, storage, network switches, integrated rack systems and cloud solutions, all delivering hyperscale efficiency, scalability, reliability, manageability, serviceability and optimized performance for each workload. QCT offers a full spectrum of datacenter products and services from engineering, integration and optimization to global supply chain support, all under one roof.


Qubole delivers a Self-Service Platform for Big Data Analytics built on Amazon, Microsoft and Google Clouds. We were started by the team that built and ran Facebook’s Data Service and authored Apache Hive. With Qubole, a data scientist can now spin up hundreds of clusters on their public cloud of choice and begin creating ad hoc and/or batch queries in under five minutes and have the system autoscale to the optimal compute levels as needed.


Saama Technologies is one of the largest pure-play data science solutions and services companies focused on solving the data management and advanced analytics challenges of the world’s leading brands. Saama has over 15 years of experience implementing business analytics, big data, predictive analytics and data management solutions for global clients.


SK Telecom is largest mobile operator of South Korea with over 50% of market share. Throughout its 30-year history, SK Telecom has led the evolution of mobile networks by commercializing CDMA, WCDMA, HSDPA, 150Mbps LTE-A, and 225Mbps LTE-A via Carrier Aggregation (CA) for the first time in the world. SK Telecom continues to innovate in Big Data solutions as well, utilizing Hadoop technologies such as Apache Tajo.


Skytree®—The Machine Learning Company® is disrupting the Advanced Analytics market with a Machine Learning platform that gives organizations the power to discover deep analytic insights, predict future trends, make recommendations and reveal untapped market and customer opportunities. Skytree’s flagship product—Skytree Infinity™— is an enterprise-ready scalable Machine Learning platform, built from the ground up to work on massive and fast changing datasets with the highest accuracy at unprecedented speed and scale.



StreamSets offers an innovative, new approach for managing data in motion. We help companies automatically reduce the risk and increase the efficiency of continuous data ingestion into big data stores. Today, much of this work is either done manually or by repurposing legacy technologies, leading to great difficulty in ensuring best practices.  StreamSets works with any Hadoop distribution to provide automatic integration, continuous data preparation and intelligent monitoring of ingested data.


Exhibitor Sponsors






Interested in Sponsoring Hadoop Summit?

Hadoop Summit North America 2015 is the premier industry event for Apache Hadoop users, developers and vendors. It is an ideal opportunity to showcase your Hadoop-related products and services to organizations interested in leveraging Apache Hadoop technology to solve their big data challenges.

A variety of sponsorship options are available depending on the exposure you want at Hadoop Summit. Some levels are limited in numbers so act now. Special deals are available if you sponsor both Hadoop Summit Europe and Hadoop Summit North America. For more information on becoming a sponsor, please contact:

Jeff Taylor
Hadoop Summit Sponsorship Sales
US Direct: 1.925.997.7831


2014 Hadoop Summit Community Showcase