October 02, 2012
Utility supercomputing leader and stem cell researcher team up to index gene expression
NEW YORK, Oct. 2 — Cycle Computing today announced that its inaugural Big Science Challenge winner, Victor Ruotti, of the Morgridge Institute for Research, has successfully completed his breakthrough utility supercomputing run. A computational biologist, Ruotti applied his $10,000 of CycleCloud computation time and $9,500 of credit from Amazon Web Services (AWS) to begin constructing a knowledgebase indexing system for stem cells and their derivatives. BigData met BigCompute as Cycle's software ran one million compute hours against 78 terabytes of data for Ruotti on AWS – over a compute century of run-time in just one week, for the cost of four servers, or 100 times cheaper than buying the cluster.
Ruotti works at the Morgridge Institute for Research as part of the regenerative biology team, in the laboratory of stem cell researcher James Thomson, who in 1998 was the first to successfully isolate human embryonic stem cells. Using the power of utility supercomputing, Ruotti is furthering the research on stem cells and their derivatives by creating an indexing system of the cells, which will allow researchers to quickly classify the cells based on their expression pattern and identify genes and regions of the genome that are critical for establishing and maintaining cell states that have potential for clinical applications.
"By using Cycle's utility supercomputing software, and infrastructure from Amazon Web Services, we were able to run 115 years of computation in just one week," said Ruotti. "We now have the components needed to build an index to help identify cells in a laboratory setting, based upon the genes that have been expressed. The goal is to use these results to build a database to speed development of potential therapies using stem cells. The emergence of utility supercomputing as an available and affordable research tool could completely transform the class of problem we can solve, enabling larger breakthroughs than were possible before."
Ruotti's run included a total of 1,003,404 core-hours against 11,955 pairs of samples processed. The compute price was $0.0175 per core-hour and $19,555 total for the run. Buying 400 servers to get the equivalent cluster would have cost 100 times the cost of this run, not including the cost of 78 TB of storage. Ruotti's run harnessed the power of 5,000 cores on average, 8,000 cores at peak, and used 78TB of storage in the AWS cloud.
The Big Science Challenge winner was selected based on the project's long-term benefit to humanity and its originality, creativity and suitability to run on CycleCloud clusters launched within AWS. The finalists were judged by Jason Stowe, CEO, Cycle Computing, and a panel of industry luminaries, including Kevin Davies, editor-in-chief of Bio-IT World, Matt Wood, technology evangelist for AWS and Peter S. Shenkin, vice president at Schrödinger.
AWS provided an additional $9,500 in credits for the winner of the Big Science Challenge, and Matt Wood, product manager for big data and high performance computing, AWS, participated as a judge. He said, "AWS provides the resources to allow scientists to deliver on the vision of their research, by removing the constraints of traditional IT with a low cost, productive, utility environment. We congratulate Victor Ruotti on this great accomplishment."
"Cycle launched the Big Science Challenge to give researchers like Victor the opportunity to take advantage of utility supercomputing technology to do science that could benefit humanity," said Stowe. "This million hour run exemplifies the classes of breakthroughs possible with utility supercomputing, made possible by the cloud and Cycle's software. We are very excited about Victor's inventive work to move forward the state of the art in IPS experimentation, and applaud his efforts to make stem cells more accessible for disease treatment."
Cycle Computing is also pleased to announce the next Cycle BigScience Challenge, and will be opening for applications in the coming weeks. For more information, please see the Cycle Computing blog at http://blog.cyclecomputing.com.
About Cycle Computing:
Cycle Computing is the leader in Utility Supercomputing software. As a bootstrapped, profitable software company, Cycle delivers proven, secure and flexible high performance computing (HPC) and data solutions since 2005. Cycle helps clients maximize existing infrastructure and speed computations on servers, VMs, and on-demand in the cloud. Cycle's products help clients maximize internal infrastructure and increase power as research demands, like the 10000-core cluster for Genentech and the 30000+ core cluster for a Top 5 Pharma that were covered in Wired, TheRegister, BusinessWeek, Bio-IT World, and Forbes. Starting with three initial Fortune 100 clients, Cycle has grown to deploy proven implementations at Fortune 500s, SMBs and government and academic institutions including JP Morgan Chase, Purdue University, Pfizer and Lockheed Martin.
Source: Cycle Computing
Researchers from the Suddhananda Engineering and Research Centre in Bhubaneswar, India developed a job scheduling system, which they call Service Level Agreement (SLA) scheduling, that is meant to achieve acceptable methods of resource provisioning similar to that of potential in-house systems. They combined that with an on-demand resource provisioner to ensure utilization optimization of virtual machines.
Experimental scientific HPC applications are continually being moved to the cloud, as covered here in several capacities over the last couple of weeks. Included in that rundown, Co-founder and CEO of CloudSigma Robert Jenkins penned an article for HPC in the Cloud where he discussed the emergence of cloud technologies to supplement research capabilities of big scientific initiatives like CERN and ESA (the European Space Agency)...
When considering moving excess or experimental HPC applications to a cloud environment, there will always be obstacles. Were that not the case, the cost effectiveness of cloud-based HPC would rule the high performance landscape. Jonathan Stewart Ward and Adam Barker of the University of St. Andrews produced an intriguing report on the state of cloud computing, paying a significant amount of attention to the problems facing cloud computing.
Jun 19, 2013 |
Ruan Pethiyagoda, Cameron Boehmer, John S. Dvorak, and Tim Sze, trained at San Francisco’s Hack Reactor, an institute designed for intense fast paced learning of programming, put together a program based on the N-Queens algorithm designed by the University of Cambridge’s Martin Richards, and modified it to run in parallel across multiple machines.
Jun 17, 2013 |
With that in mind, Datapipe hopes to establish themselves as a green-savvy HPC cloud provider with their recently announced Stratosphere platform. Datapipe markets Stratosphere as a green HPC cloud service and in doing so partnering with Verne Global and their Icelandic datacenter, which is known for its propensity in green computing.
Jun 12, 2013 |
Cloud computing is gaining ground in utilization by mid-sized institutions who are looking to expand their experimental high performance computing resources. As such, IBM released what they call Redbooks, in part to assist institutions’ movement of high performance computing applications to the cloud.
Jun 06, 2013 |
The San Diego Supercomputer Center launched a public cloud system for universities in the area designed specifically to run on commodity hardware with high performance solid-state drives. The center, which currently holds 5.5 PB of raw storage, is open to educational and research users in the University of California.
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/02/2012 | AMD | Developers today are just beginning to explore the potential of heterogeneous computing, but the potential for this new paradigm is huge. This brief article reviews how the technology might impact a range of application development areas, including client experiences and cloud-based data management. As platforms like OpenCL continue to evolve, the benefits of heterogeneous computing will become even more accessible. Use this quick article to jump-start your own thinking on heterogeneous computing.