Microsoft, Tencent and Baidu are adopting CV-CUDA for pc imaginative and prescient AI.
NVIDIA CEO Jensen Huang highlighted work in content material understanding, visible search and deep studying Tuesday as he introduced the beta launch for NVIDIA’s CV-CUDA — an open-source, GPU-accelerated library for pc imaginative and prescient at cloud scale.
“Eighty p.c of web site visitors is video, user-generated video content material is driving vital development and consuming large quantities of energy,” stated Huang in his keynote at NVIDIA’s GTC know-how convention. “We must always speed up all video processing and reclaim the facility.”
CV-CUDA guarantees to assist firms the world over construct and scale end-to-end, AI-based pc imaginative and prescient and picture processing pipelines on GPUs.
Optimizing Web-Scale Visible Computing With AI
Nearly all of web site visitors is video and picture knowledge, driving unbelievable scale in purposes reminiscent of content material creation, visible search and advice, and mapping.
These purposes use a specialised, recurring set of pc imaginative and prescient and image-processing algorithms to course of picture and video knowledge earlier than and after they’re processed by neural networks.

to seek for pictures (pet food, for instance) inside pictures on the Web.
Whereas neural networks are usually GPU accelerated, the pc imaginative and prescient and picture processing algorithms that help them are sometimes CPU bottlenecks in right this moment’s AI purposes.
CV-CUDA helps course of 4x as many streams on a single GPU by transitioning the pre- and post-processing steps from CPU to GPU. In impact, it processes the identical workloads at 1 / 4 of the cloud-computing price.
The CV-CUDA library supplies builders greater than 30 high-performance pc imaginative and prescient algorithms with native Python APIs and zero-copy integration with the PyTorch, TensorFlow2, ONNX and TensorRT machine studying frameworks.
The result’s increased throughput, lowered computing price and a smaller carbon footprint for cloud AI companies.
International Adoption for Pc Imaginative and prescient AI
Adoption by business leaders across the globe highlights the advantages and flexibility of CV-CUDA for a rising variety of large-scale visible purposes. Firms with large picture processing workloads can save tens to lots of of thousands and thousands of {dollars}.
Microsoft is working to combine CV-CUDA into Bing Visible Search, which lets customers search the net utilizing a picture as an alternative of textual content to search out comparable pictures, merchandise and net pages.
In 2019, Microsoft shared at GTC how they’re utilizing NVIDIA applied sciences to assist convey speech recognition, clever solutions, textual content to speech know-how and object detection collectively seamlessly and in actual time.
Tencent has deployed CV-CUDA to speed up its advert creation and content material understanding pipelines, which course of greater than 300,000 movies per day.
The Shenzhen-based multimedia conglomerate has achieved a 20% discount in vitality and price for picture processing over their earlier GPU-optimized pipelines.
And Beijing-based search large Baidu is integrating CV-CUDA into FastDeploy, one of many open-source deployment toolkits of the PaddlePaddle Deep Studying Framework, which allows seamless pc imaginative and prescient acceleration to builders within the open-source neighborhood.
From Content material Creation to Automotive Use Circumstances
Purposes for CV-CUDA are rising. Greater than 500 firms have reached out with over 100 use instances in simply the primary few months of the alpha launch.
In content material creation and e-commerce, pictures use pre- and post-processing operators to assist recommender engines acknowledge, find and curate content material.
In mapping, video ingested from mapping survey automobiles requires preprocessing and post-processing operators to coach neural networks within the cloud to establish infrastructure and street options.
In infrastructure purposes for self-driving simulation and validation software program, CV-CUDA allows GPU acceleration for algorithms which are already occurring within the automobile, reminiscent of shade conversion, distortion correction, convolution and bilateral filtering.
Trying to the long run, generative AI is reworking the world of video content material creation and curation, permitting creators to achieve a worldwide viewers.
New York-based startup Runway has built-in CV-CUDA, assuaging a important bottleneck in preprocessing high-resolution movies of their video object segmentation mannequin.
Implementing CV-CUDA led to a 3.6x speedup, enabling Runway to optimize real-time, click-to-content responses throughout its suite of creation instruments.
“For creators, each second it takes to convey an thought to life counts,” stated Cristóbal Valenzuela, co-founder and CEO of Runway. “The distinction CV-CUDA makes is extremely significant for the thousands and thousands of creators utilizing our instruments.”
To entry CV-CUDA, go to the CV-CUDA GitHub.
Or study extra by testing the GTC classes that includes CV-CUDA. Registration is free.