HPC in the Cloud


Dedicated to covering high-end cloud computing
in science, industry and the datacenter

Language Flags

Universities Receive Number Crunching On Demand from Penguin


Compute intensive workloads can prove taxing to an institution's HPC resources. When applications require more horsepower, the process of upgrading an existing cluster or deploying a new system altogether can slow down research and productivity. There is another alternative, though. Cloud service providers offer scalable on-demand resources, giving organizations an anytime boost to their computational capacity.

Last week, Penguin Computing revealed partnerships between a number of universities and its cloud computing division. Penguin On Demand's (POD) academic clouds make local and remote HPC resources available to these institutions. In the official announcement, Tom Coull, senior vice president and general manager of software and services at Penguin, explained that the model adds compute capacity while reducing upfront costs.

"Penguin Computing has traditionally been very successful with HPC deployments in academic environments with widely varying workloads, many departments competing for resources and very limited budgets for capital expenses," stated Coull.

HPC in the Cloud caught up with Mr. Coull to discuss the POD service and Penguin's partnerships with these institutions.

In each scenario, the university houses computing equipment, which is owned and operated by Penguin. How the resources are used determine the nature of the agreement. In some cases, cycles can be resold to outside users, creating a new revenue stream for IT departments. 

The academic partnerships typically follow one of three models:

Channel Partnership

A channel partnership between academic institutions and Penguin essentially means the user becomes a distributor of the POD service. In this configuration, departments that need to obtain HPC resources can access Penguin's virtual cycles on demand. Again, this is a "pay as you go" model, in which upfront capital costs are transformed into operational overhead. The university receives an umbrella invoice from Penguin and charges individual departments for their usage of compute cycles. 

The agreement may seem exclusive to off-site infrastructures, but it has been in place at Cal Tech with local hardware for nearly two years. When workloads become too compute-intensive for the local cluster, Penguin allows system admins to burst jobs to a company datacenter in Salt Lake City. Coull explained how Cal Tech developed a software suite with Penguin called POD tools, which manages cloud resource interactions.  

"The connection to the cloud is only a small part of the problem," notes Coull. "To really make this useful, you have to be able to migrate data reliably and quickly. You have to be able to migrate applications [ISV software like MathWorks] if you want to run your own applications, and you have to be able to migrate job scripts."

The management software is designed to securely handle workload migration, monitoring and reporting from behind a firewall. Tracking usage from the various departments can be complicated, so Penguin created custom portals to help universities keep up to date with user access.

Hybrid / Channel Model

This is the model currently in place at Indiana University (IU). The public-private partnership engenders a symbiotic relationship wherein Penguin provides and operates a cluster on campus. The university provides the facilities, power, cooling and the Internet connection for the system. In return for their contribution, the university gets to use cycles free of charge.

The system also works in the case of repackaging cycles to external federally-funded research and development centers (FFRDC). Coull gave an example where IU would be able to send cycles to another facility:

UC Berkeley could contact IU and say 'hey, we'd like some cycles.' We can turn that on literally with just a white list.

Hybrid Model

This configuration involves an on-site cluster using a prepaid model. When an institution deploys a system, they typically purchase a number of core hours. In most cases, those hours can be used over the next three years. Penguin also includes tools that enable cloud bursting if more compute power is needed. This is sometimes utilized in the early stages before the local cluster is fully deployed.

"What's more common is that they'll size their cluster based on an average workload and then they'll let the peaks burst out to the cloud. That's a nice model because it's actually a little more cost effective to do that," said Coull.

The menu of offerings gives academic institutions access to essentially infinite compute power in a way that best fits their needs.

Most Read Blogs

Aspen

Feature Articles

CometCloud: Using a Federated HPC-Cloud to Understand Fluid Flow in Microchannels

The ever-growing complexity of scientific and engineering problems continues to pose new computational challenges. Thus, we present a novel federation model that enables end-users with the ability to aggregate heterogeneous resource scale problems. The feasibility of this federation model has been proven, in the context of the UberCloud HPC Experiment, by gathering the most comprehensive information to date on the effects of pillars on microfluid channel flow.
Read more...

CERN, Google, and the Future of Global Science Initiatives

Large-scale, worldwide scientific initiatives rely on some cloud-based system to both coordinate efforts and manage computational efforts at peak times that cannot be contained within the combined in-house HPC resources. Last week at Google I/O, Brookhaven National Lab’s Sergey Panitkin discussed the role of the Google Compute Engine in providing computational support to ATLAS, a detector of high-energy particles at the Large Hadron Collider (LHC).
Read more...

Avoiding Scientific Computing Bottlenecks in the Cloud

Frank Ding, engineering analysis & technical computing manager at Simpson Strong-Tie, discussed the advantages of utilizing the cloud for occasional scientific computing, identified the obstacles to doing so, and proposed workarounds to some of those obstacles.
Read more...

Sponsored Whitepapers

Best Practices in Big Data Storage

05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.

Exploring the Potential of Heterogeneous Computing

04/02/2012 | AMD | Developers today are just beginning to explore the potential of heterogeneous computing, but the potential for this new paradigm is huge. This brief article reviews how the technology might impact a range of application development areas, including client experiences and cloud-based data management. As platforms like OpenCL continue to evolve, the benefits of heterogeneous computing will become even more accessible. Use this quick article to jump-start your own thinking on heterogeneous computing.

Sponsored Multimedias

Newsletters

Stay informed! Subscribe to HPC in the Cloud email Newsletters.

HPC in the Cloud Update
HPCwire Weekly Update
Digital Manufacturing Report
Datanami
HPCwire Conferences & Events
Job Bank
HPCwire Product Showcases


ISC

HPC Job Bank


Featured Events



  • June 16, 2013 - June 20, 2013
    ISC'13
    Leipzig,
    Germany

  • June 17, 2013 - June 18, 2013
    Forecast 2013
    San Francisco, CA
    United States




HPC in the Cloud Conferences & Events