DeepCube, the award-winning deep learning pioneer, today announced the launch of a new suite of products and services to help drive enterprise adoption of deep learning, at scale, on intelligent edge devices and in data centers.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20210224005217/en/
DeepCube end-to-end workflow (Graphic: Business Wire)
The offerings build on DeepCube’s patented platform, which is the industry’s first software-based deep learning accelerator that drastically improves performance on any existing hardware. Now, DeepCube will offer solutions for neural network training and inference, allowing users to leverage DeepCube’s technology to address challenges in their deep learning pipeline. Additionally, a new service offering will make available DeepCube’s team of leading AI experts to support deep learning projects.
DeepCube’s new offerings include:
- CubeIQ: the first fully automated training framework that can take a model and surgically eliminate unnecessary parameters to ensure that physical, real-world constraints and profiles are met. CubeIQ trains models with a significant reduction in size, and with pre-knowledge of the end target and environment. This leads to a drastic speed increase, minimized compute footprint, and efficient edge deployment – all while maintaining accuracy.
- CubeEngine: an inference engine designed to run next-generation deep learning models for optimal performance. CubeEngine is designed to accelerate CubeIQ generated models by dynamically assigning optimal kernels suitable for the specific hardware and model execution. CubeEngine is architected as a composable inference engine, unlike prior generation monolithic inference engines.
- CubeAdvisor: an expert-level service offered to leverage DeepCube’s wide-ranging ML experience, with guidance from some of the world’s leading AI experts and PhDs. It helps customers design, optimize, and deploy deep learning models, ensuring that customers achieve the best performing model that fits their strict cost, performance, power, and latency requirements.
To trial the new suite of products, DeepCube utilized 2nd Gen AMD EPYC™ based cloud instances and the new DeepCube solutions to showcase high levels of inference performance on a multitude of popular neural networks, including ResNet-50, BERT-Large, and DLRM. To read more about this sweep of AI benchmarking on 2nd Gen AMD EPYC™ based cloud instances, stay tuned for an upcoming blog post.
“The offerings announced by DeepCube today are the culmination of decades of work and research by some of the world’s leading experts in deep learning,” said Michael Zimmerman, CEO at DeepCube. “We have long been focused on solving the technical challenges of training and inference for next-generation deep learning models, which is no easy feat – this is proven by the fact that so many enterprises are still unable to take their AI models out of the research stage. But we’re confident in the power of our patented technology, and by commercializing it through CubeIQ, CubeEngine and CubeAdvisor, we’re taking steps toward democratizing deep learning across industries.”
“AMD worked with DeepCube to preview the new solutions on cloud instances using 2nd Gen AMD EPYC processors, and saw fantastic performance for deep learning workloads,” said Kumaran Siva, corporate vice president, EPYC Cloud Business, AMD. “We believe that DeepCube’s innovative CubeIQ and CubeEngine products, coupled with optimizations specific to current and future generation AMD EPYC CPUs, will set a new bar for performance and business metrics, such as inference throughput, latency, and performance/dollar.”
For qualifying parties, DeepCube is offering a free trial license to CubeIQ and CubeEngine, with CubeAdvisor as an option.
For more information on 2nd Gen AMD EPYC processors, visit https://www.amd.com/en/processors/epyc-7002-series.
DeepCube is an award-winning deep learning pioneer that provides the industry’s first software-based deep learning acceleration platform that drastically improves performance on existing hardware. Modeled after the way the human brain develops during childhood, DeepCube’s patented technology is the first to be purpose-built for deployment of deep learning models in data centers and on intelligent edge devices. Its proprietary framework can be deployed on top of any existing hardware, resulting in drastic speed improvement and memory reduction. Led by a team of experienced deep learning researchers and developers, DeepCube has patented numerous innovations, including methods for faster and more accurate training of deep learning models, and drastically improved inference performance.
AMD, the AMD arrow logo, EPYC, and combinations thereof are trademarks of Advanced Micro Devices, Inc.