March 21, 2012
Direct access from databases, files, data warehouses and applications
NEW YORK, March 21 — MapR Technologies, Inc., the provider of the industry's most advanced distribution for Apache Hadoop, today announced at GigaOm's Structure:Data event a comprehensive set of data connection options for Hadoop enabling a wide range of data ingress and egress alternatives for customers. These include direct file-based access using standard tools and file-based applications, direct database connectivity, Hadoop specific connectors via Sqoop, Flume and Hive; and direct access to popular data warehouses and applications using custom connectors.
Additionally, technology providers Pentaho and Talend have embraced MapR with alliance partnerships to provide direct integration with MapR's Distribution. MapR has also entered into a partnership with leading data warehouse and business intelligence platform vendor Tableau Software to ensure customers benefit from MapR's comprehensive data connection options. These partners join Informatica to further deliver a wide range of connectivity alternatives for Big Data analytics by extending the dependability, ease of use and performance of the MapR Distribution.
With the exponential growth in unstructured data, fast, dependable and easy access to relevant data is crucial to extract business insight from Big Data analysis. MapR's Distribution enables customers using Hadoop Big Data software to realize significant performance improvements, reliability and ease of use enhancements.
"These partnerships deliver a wide range of connectivity capabilities for Big Data analytics," said Jack Norris, vice president of marketing for MapR. "Customers benefit from the ability to easily integrate leading data warehousing and business intelligence platforms with Hadoop and the deep integration, dependability, ease of use and performance of the MapR Distribution."
Partners Accelerate Business Goals by Leveraging Data Connection Options
Technology companies are collaborating with MapR and choosing MapR offerings as their distribution of choice to build, deploy and sell differentiated offerings for Hadoop. This is enabling customers to access data via standard tools and file-based applications.
Pentaho and MapR
With MapR's partnership with Pentaho, the MapR Distribution is integrated natively with Pentaho Kettle. Distributed under the Apache License, Pentaho Kettle adds visual tools to input, extract, manipulate, report, visualize and explore data in Hadoop. Certified and tested with MapR, Pentaho Kettle also provides scheduling, job orchestration, workflow, scalable deployment across entire Hadoop clusters, and native connectivity to HDFS, HBase, Hive, Pig and MapR's NFS. Users will benefit from a seamless on-ramp to Pentaho Business Analytics, a complete end-to-end solution that enables users to intuitively access, discover and analyze their data, empowering them to make information-driven decisions that positively impact their organization's performance.
"The partnership between Pentaho and MapR technologies is a natural match to ensure maximized performance and provide easy to use tools to get value from data via exploration, data discovery, reporting and analytics," said Eddie White, executive vice president, business development, Pentaho. "Together, Pentaho and MapR provide a dramatic boost in Hadoop developer productivity and extend usability to a much broader spectrum of developers, data scientists and business analysts."
Talend Integration with MapR
MapR also directly integrates with Talend Open Studio for Big Data, the leading Apache-licensed big data integration solution, which has recently been certified to use with the MapR Distribution, offering native connectivity to HDFS, HBase, Hive, Pig, Sqoop, as well as MapR's Direct Access NFS for high performance streaming. Providing integration of Hadoop with a broad variety of databases, packaged applications, files, legacy systems and SaaS and social media platforms, the MapR integration extends to the Talend Platform for Big Data, a comprehensive enterprise solution for data integration and data quality for big data environments.
"We are excited to be contributing the breadth of our integration technology to facilitate Hadoop's deployment and comprehensiveness," said Keith Goldstein, vice president of worldwide channels and alliances for Talend. "By providing a graphical development layer that abstracts the technical complexity of Hadoop, we are taking the difficulty out of Hadoop integration. Talend makes it easy to bring relevant enterprise data into Hadoop, cleanse this data and move it to analytics platforms – all of this without coding."
Tableau Software and MapR
The new MapR ODBC driver will enable Tableau to quickly and easily access data stored in Hadoop. Customers will simply point Tableau to the MapR Distribution and instantly create reports, data visualizations and dashboards without any programming or coding. Tableau, positioned as a "challenger" by Gartner in its Business Intelligence platforms report, helps anyone quickly and easily analyze, visualize and share information. Native Tableau connectivity to MapR will be generally available later this spring.
"Our partnership with MapR will further Tableau's mission to help people see and understand data, no matter where it resides," said Dan Jewett, vice president of product management at Tableau Software. "Customers will be able to gain valuable insights from their data without needing any special configuration or technical skills typically required to operate Hadoop."
MapR's Open Database Connector
In addition to specific connectors and partnerships, MapR includes the most complete ODBC driver for the widest support of applications for data analysis and reporting. Developed with Simba Technologies, the recognized industry expert in ODBC that originally co-developed the ODBC specification with Microsoft, MapR enables virtually any ODBC-enabled application to connect and access data within MapR's Hadoop/Hive implementation. No special add-in or optimizations are needed, unlike ODBC drivers supplied with other distributions.
Additional Resources
About MapR Technologies
MapR delivers on the promise of Hadoop, making managing and analyzing Big Data a reality for more business users. The award-winning MapR Distribution brings unprecedented dependability, speed and ease-of-use to Hadoop. Combined with data protection and business continuity, MapR enables customers to harness the power of Big Data analytics. The company is headquartered in San Jose, Calif. Investors include Lightspeed Venture Partners, NEA and Redpoint Ventures. To download the latest MapR Distribution for Apache Hadoop, visit http://www.mapr.com/products/download.
-----
Source: MapR
Researchers from the Suddhananda Engineering and Research Centre in Bhubaneswar, India developed a job scheduling system, which they call Service Level Agreement (SLA) scheduling, that is meant to achieve acceptable methods of resource provisioning similar to that of potential in-house systems. They combined that with an on-demand resource provisioner to ensure utilization optimization of virtual machines.
Read more...
Experimental scientific HPC applications are continually being moved to the cloud, as covered here in several capacities over the last couple of weeks. Included in that rundown, Co-founder and CEO of CloudSigma Robert Jenkins penned an article for HPC in the Cloud where he discussed the emergence of cloud technologies to supplement research capabilities of big scientific initiatives like CERN and ESA (the European Space Agency)...
Read more...
When considering moving excess or experimental HPC applications to a cloud environment, there will always be obstacles. Were that not the case, the cost effectiveness of cloud-based HPC would rule the high performance landscape. Jonathan Stewart Ward and Adam Barker of the University of St. Andrews produced an intriguing report on the state of cloud computing, paying a significant amount of attention to the problems facing cloud computing.
Read more...
Jun 17, 2013 |
With that in mind, Datapipe hopes to establish themselves as a green-savvy HPC cloud provider with their recently announced Stratosphere platform. Datapipe markets Stratosphere as a green HPC cloud service and in doing so partnering with Verne Global and their Icelandic datacenter, which is known for its propensity in green computing.
Read more...
Jun 12, 2013 |
Cloud computing is gaining ground in utilization by mid-sized institutions who are looking to expand their experimental high performance computing resources. As such, IBM released what they call Redbooks, in part to assist institutions’ movement of high performance computing applications to the cloud.
Read more...
Jun 06, 2013 |
The San Diego Supercomputer Center launched a public cloud system for universities in the area designed specifically to run on commodity hardware with high performance solid-state drives. The center, which currently holds 5.5 PB of raw storage, is open to educational and research users in the University of California.
Read more...
05/10/2013 | Cleversafe, Cray, DDN, NetApp, & Panasas | From Wall Street to Hollywood, drug discovery to homeland security, companies and organizations of all sizes and stripes are coming face to face with the challenges – and opportunities – afforded by Big Data. Before anyone can utilize these extraordinary data repositories, however, they must first harness and manage their data stores, and do so utilizing technologies that underscore affordability, security, and scalability.
04/02/2012 | AMD | Developers today are just beginning to explore the potential of heterogeneous computing, but the potential for this new paradigm is huge. This brief article reviews how the technology might impact a range of application development areas, including client experiences and cloud-based data management. As platforms like OpenCL continue to evolve, the benefits of heterogeneous computing will become even more accessible. Use this quick article to jump-start your own thinking on heterogeneous computing.