Intersect360 HPC500 HPC Job Bank
HPC in the Cloud


Dedicated to covering high-end cloud computing
in science, industry and the datacenter

Language Flags

Datanami
Digital Manufacturing Report
HPCwire

DataStax Rewires Hadoop for Low-Latency Applications With Apache Cassandra


BURLINGAME, Calif., March 24, 2011 -- GIGAOM STRUCTURE CONF. -- Today, industry leaders in the big data community converged on New York City to discuss the best technologies for managing and harnessing ever-increasing volumes of data. At the Structure Big Data conference today, DataStax the commercial leader in Apache Cassandra, unveiled Brisk, a new distribution that enhances the Hadoop and Hive platform with scalable low-latency data capabilities. This results in a single platform that can act as the low-latency database for extremely high-volume web and real-time applications while providing tightly coupled Hadoop and Hive analytics. 

"The challenge of 'big data' is twofold. The analytical side is well understood and served by Hadoop and Hive. However we live in a real-time world, and the ability for applications to interact with big data at low-latency is equally important," said Matt Pfeil, CEO and co-founder, DataStax.  "Apache Cassandra was bred for big data, real time scenarios, and using it to power Apache Hive and Apache Hadoop gives users a single solution that serves both needs."

DataStax' Brisk is an enhanced open-source Hadoop and Hive distribution that utilizes Cassandra for many of its core services. Brisk provides integrated Hadoop MapReduce, Hive and job and task tracking capabilities, while providing an HDFS-compatible storage layer powered by Cassandra. It also exposes the full power of Cassandra for real-time applications. The result is a single integrated solution that provides increased reliability, simpler deployment and lower TCO than traditional Hadoop solutions.

A key benefit of DataStax' Brisk is the tight feedback loop it allows between real-time application and the analytics that follow. Traditionally, users would be forced to move data between systems via complex ETL processes, or perform both functions on the same system with the risk of one impacting the other.

"By marrying the power of Cassandra – including its simplicity, scalability and speedy reads / writes – to Hadoop, DataStax has created a powerful system that speeds up the time between data creation and analysis." said Tim Estes, CEO of Digital Reasoning. "We can count on some of Cassandra's unique capabilities to aid projects that have multiple datacenter locations and large and complex bulk ingest demands. We've been thrilled to work with the DataStax team to push its capabilities into some of the most demanding customers- particularly in the Defense and Intelligence Community."

DataStax' Brisk Uses:

High-volume websites – Provide real-time data access and storage for millions of simultaneous users. Directly perform Hive analysis on the latest data, and immediately feed analytic insights back into the application behavior.

Finance and capital markets – Process, store and trigger actions based on a high-volume real-time event stream. Perform analytics on historical data, and update models directly into the application.

Retail - Maintain real-time summaries and aggregates to allow a continuously up-to-date view of important business metrics. Alert when anomalies occur.

High-volume event processing - Track and react instantly to millions of sensors or other distributed feeds, while allowing deeper analytic questions to be asked of the historical data at any moment.

DataStax' Brisk, a new Hadoop and Hive distribution, will be available under Apache open-source license within 45 days of this announcement.

About DataStax

DataStax, the commercial leader in Apache Cassandra, offers products and services that make it easy for customers to build, deploy and operate elastically scalable and cloud-optimized applications and data services. The company has over 65 customers, including leaders such as Netflix, Cisco, Rackspace and Constant Contact, and spanning verticals including web, financial services, telecommunications, logistics and government. DataStax is backed by industry leading investors, including Lightspeed Venture Partners, Sequoia Capital and Rackspace Hosting, and is based in Burlingame, CA with offices in Austin, TX and Stamford, CT. For more information, visit www.datastax.com.

About Apache Cassandra

Apache Cassandra™ is an open source distributed database management system. It is designed to store and allow very low-latency access to very large amounts of data spread out across many commodity servers while providing a highly available service with no single point of failure. This next-generation data platform evolved from work at Google, Amazon and Facebook, and is an Apache Software Foundation top-level project.

For more information, visit http://cassandra.apache.org/.

------

Source: DataStax

May 18, 2012

May 17, 2012

May 16, 2012

May 15, 2012

May 14, 2012

May 11, 2012

May 10, 2012

May 09, 2012

May 08, 2012


Most Read Features

Most Read Around the Web

Most Read This Just In

Most Read Blogs

Arkeia

Feature Articles

Cloud Services Satisfy a Higher Calling

Higher education involves many collaborative projects that lend themselves to cloud services, however often those services are not tailored to the uniqueness of an academic environment. That's where the Internet2 NET+ project comes in. By partnering with 16 major cloud providers, the networking consortium is seeking to expedite the delivery of cloud services and by doing so advance research and innovation in the United States.
Read more...

Around the Web

NVIDIA Raises Its Game to the Cloud

May 17, 2012 | NVIDIA GeForce GRID, a cloud gaming platform announced at the 2012 GPU Technology Conference (GTC), seeks to reduce the the latency associated with cloud gaming.
Read more...

Breaking the Cloud Barrier

May 15, 2012 | New Microsoft report shows that beyond the expected financial benefits, cloud services may offer more comprehensive security features compared to in-house IT operations.
Read more...

Vendors Demo Next-Gen Sequencing Platforms for Pharma

May 14, 2012 | During the second annual Pistoia Alliance conference, three teams demonstrated their newly-implemented cloud-based next-generation sequencing platforms.
Read more...

Zunicore Offers Bare Metal by the Hour

May 10, 2012 | PEER1's cloud division, Zunicore, will soon be offering GPU-equipped servers on-demand.
Read more...

US Cloud Providers Struggle With Data Privacy Laws

May 08, 2012 | The Patriot Act leads foreign governments to question the security of US cloud services.
Read more...

Sponsored Whitepapers

Appro White Paper: Enabling Performance-per-Watt Gains in HPC

04/05/2012 | Appro | Designed to meet the growing global demand for HPC solutions, Appro's Xtreme-X™ Supercomputer delivers superior performance-per-watt and reduced I/O latency while bringing significant flexibility to HPC workload configurations including capacity, hybrid, data intensive and capability computing.

Exploring the Potential of Heterogeneous Computing

04/02/2012 | AMD | Developers today are just beginning to explore the potential of heterogeneous computing, but the potential for this new paradigm is huge. This brief article reviews how the technology might impact a range of application development areas, including client experiences and cloud-based data management. As platforms like OpenCL continue to evolve, the benefits of heterogeneous computing will become even more accessible. Use this quick article to jump-start your own thinking on heterogeneous computing.

Sponsored Multimedia

Newsletters

Intersect360 HPC500

HPC Job Bank


Featured Events









HPC in the Cloud Conferences & Events