EMC Corporation has announced the Federation Business Data Lake.
The fully engineered solution includes leading storage and big data analytics technologies from EMC Information Infrastructure, Pivotal, and VMware to help customers leverage the new world of big data, thereby clearing the path for new insights and disruptive differentiation.
Implemented in as little as seven days, the Federation Business Data Lake greatly simplifies the massively complex task of building a Data Lake and is designed for speed, self-service and scalability for the enterprise, enabling organisations to begin making better-informed business decisions using big data analytics. The Federation Business Data Lake joins the Enterprise Hybrid Cloud Solution as a converged solution from the EMC Federation that will redefine infrastructure to maximise the speed and agility for IT organisations deploying Hybrid Clouds and Data Lakes.
The incredible potential of big data is being driven first and foremost by the growth of data from traditional applications, modern applications, sensors and intelligent devices along with masses of new public data such as social media feeds. The ability to capture and process that data is now possible because of the growth of inexpensive storage and limitless compute, along with the invention of new technologies that enable real-time analysis and a direct connection to action through new applications and products. These storage and analytics technologies, along with the massive data sets comprise the Business Data Lake.
Business Data Lakes are becoming a top corporate priority because they fill a critical gap left by traditional data warehousing. A Business Data Lake contains structured and unstructured data from a wide variety of sources and the analytics are focused on building models to predict the future. Companies with successful Data Lakes are leveraging the data and predictive models to build new products, applications and business models to redefine their industry, taking or extending the “Market Leader” role.
A highly effective Business Data Lake will provide three critical functions:
* Store: Stores structured and unstructured data for all types of analytics, from many different sources, blending capacity and performance as needed for the analytics use case.
* Analyse: Provides modern data management and analytics tools for all types of analytics including Hadoop-based, In-Memory No-SQL and Scale-out MPP.
* Surface & Act: Provides data to users and applications to enable real-time changes in outcomes and to influence critical decisions.
Until now, building an effective Data Lake has been difficult and complex. IT organisations seeking to deploy a Data Lake must deploy and configure the right analytics platform and the right corresponding storage for each analytics use case, from Hadoop to real-time. Once the environment is created, data must be loaded with all the right access rights and governance applied to the data sets. Deployment of the environment and data sets is a complex and time-consuming task, preventing IT from meeting the needs of business users.
The Federation Business Data Lake Solution makes it easy to deploy a Business Data Lake. Core products from the EMC Federation of Companies, EMC Information Infrastructure, Pivotal and VMware, provide the core functionality of the Federation Business Data Lake meeting the critical functional needs – Store, Analyse, Surface and Act.
The Federation Business Data Lake is a fully engineered solution that can be rapidly and automatically provisioned, enabling IT organisations to lead the needs of the business. The analytics layer is completely virtualised with VMware running on Vblocks with predefined analytics use cases and automated provisioning and configuration. EMC Isilon provides the Data Lake Storage Foundation, delivering the ideal balance of capacity and performance.
The analytics layer is comprised of the Pivotal big data Suite, including PivotalHD, featuring the world’s leading SQL-on-Hadoop engine, HAWQ. Pivotal big data Suite provides enterprise-class SQL, which allows for seamless integration and interoperability with top analytics platforms such as SAS, Tableau and others, over data stored in Hadoop. EMC is also delivering two additional Business Data Lakes to enable integration with customer choice of Hadoop distribution including Cloudera and Hortonworks, along with any future Open Data Platform-based Hadoop distribution.
EMC Data Lake Services:
A full suite of services and education is available with the Federation Business Data Lake to enable customers at varying stages of their Data Lake journey to implement the solution, prove out the value of the solution and quickly identify strategic big data use cases, including:
* EMC Technology Onboarding Service – for customers who are ready to deploy a Data Lake, the EMC Technology Onboarding Service offers full consulting services to install and deploy the Federation Business Data Lake, optimise the analytics environment and configure and customise data requirements.
* EMC Proof of Value Service – for customers who know the use case they want to address but are looking for help implementing the latest big data analytic and rapid application development tools and techniques, the Proof of Value Service demonstrates the ROI of a targeted use case using real customer data.
* EMC big data Vision Workshop – for customers who are undecided about how to start infusing big data into its business strategy, the EMC big data Vision Workshop analyses an organisation’s strategy, business goals and then prioritises a target use case for the start of its big data journey.
* Education Services – in addition to the service offerings above, EMC offers training and certification to develop fundamental as well as advanced big data and Data Science understanding and skills required by business leaders and big data practitioners.