The Core Engine Behind Intelligent Computing
Experience an infrastructure designed to build high-performance AI and HPC applications
The Core Engine Behind Intelligent Computing
Experience an infrastructure designed to build high-performance AI and HPC applications
Key Features
Key Features
Key Feature 1
Blazing-Fast Matrix Operators for Large-Scale Real-Time Data
A high-performance matrix computation framework for AI and HPC that delivers optimized operators for image and signal processing, linear algebra, and more, outperforming open-source alternatives.
Guarantees reliable, low-latency performance even in the most compute-intensive AI and HPC environments by performing complex computations on massive data in real time.
Key Feature 1
Blazing-Fast Matrix Operators for Large-Scale Real-Time Data
A high-performance matrix computation framework for AI and HPC that delivers optimized operators for image and signal processing, linear algebra, and more, outperforming open-source alternatives.
Guarantees reliable, low-latency performance even in the most compute-intensive AI and HPC environments by performing complex computations on massive data in real time.
Key Feature 1
Blazing-Fast Matrix Operators for Large-Scale Real-Time Data
A high-performance matrix computation framework for AI and HPC that delivers optimized operators for image and signal processing, linear algebra, and more, outperforming open-source alternatives.
Guarantees reliable, low-latency performance even in the most compute-intensive AI and HPC environments by performing complex computations on massive data in real time.
Key Feature 2
Optimized for Diverse Hardware and Platforms
Provides optimized operators for NVIDIA and AMD GPUs as well as x64 and ARM CPUs, and supports various operating systems including Windows and Linux, ensuring exceptional performance on every platform.
Develop and scale applications across diverse environments while staying agile as infrastructures evolve.
Key Feature 2
Optimized for Diverse Hardware and Platforms
Provides optimized operators for NVIDIA and AMD GPUs as well as x64 and ARM CPUs, and supports various operating systems including Windows and Linux, ensuring exceptional performance on every platform.
Develop and scale applications across diverse environments while staying agile as infrastructures evolve.
Key Feature 2
Optimized for Diverse Hardware and Platforms
Provides optimized operators for NVIDIA and AMD GPUs as well as x64 and ARM CPUs, and supports various operating systems including Windows and Linux, ensuring exceptional performance on every platform.
Develop and scale applications across diverse environments while staying agile as infrastructures evolve.
Key Feature 3
Developer-Friendly APIs and Tools for Multimedia Workloads
A single integrated toolkit that provides video streaming and image processing, image and video decoding, FFT, plot rendering, profiling, and other multimedia capabilities, along with easy-to-use developer tools.
Developers can easily develop, deploy, and run high-performance, stable applications using a single unified interface, without assembling extra libraries.
Key Feature 3
Developer-Friendly APIs and Tools for Multimedia Workloads
A single integrated toolkit that provides video streaming and image processing, image and video decoding, FFT, plot rendering, profiling, and other multimedia capabilities, along with easy-to-use developer tools.
Developers can easily develop, deploy, and run high-performance, stable applications using a single unified interface, without assembling extra libraries.
Key Feature 3
Developer-Friendly APIs and Tools for Multimedia Workloads
A single integrated toolkit that provides video streaming and image processing, image and video decoding, FFT, plot rendering, profiling, and other multimedia capabilities, along with easy-to-use developer tools.
Developers can easily develop, deploy, and run high-performance, stable applications using a single unified interface, without assembling extra libraries.
Key Feature 4
Compiler-Based Optimization for Custom Operators
JETWAVE™ leverages advanced compiler technology* to make it easy for developers to implement custom operators that are automatically optimized for each target hardware architecture just like built-in operators.
Effectively address complex, specialized application requirements and enable the development of advanced AI and HPC solutions that deliver high performance and full functionality.
Key Feature 4
Compiler-Based Optimization for Custom Operators
JETWAVE™ leverages advanced compiler technology* to make it easy for developers to implement custom operators that are automatically optimized for each target hardware architecture just like built-in operators.
Effectively address complex, specialized application requirements and enable the development of advanced AI and HPC solutions that deliver high performance and full functionality.
Key Feature 4
Compiler-Based Optimization for Custom Operators
JETWAVE™ leverages advanced compiler technology* to make it easy for developers to implement custom operators that are automatically optimized for each target hardware architecture just like built-in operators.
Effectively address complex, specialized application requirements and enable the development of advanced AI and HPC solutions that deliver high performance and full functionality.
In Progress..
Morph Compiler
Polyhedral-Based Optimization
Self-Definable & Multi-Level Intermediate Representation
Modern Programming Language Features
Hardware-Aware Compilation and Acceleration (CPU, GPU, and NPU)
In Progress..
Morph Compiler
Polyhedral-Based Optimization
Self-Definable & Multi-Level Intermediate Representation
Modern Programming Language Features
Hardware-Aware Compilation and Acceleration (CPU, GPU, and NPU)
In Progress..
Morph Compiler
Polyhedral-Based Optimization
Self-Definable & Multi-Level Intermediate Representation
Modern Programming Language Features
Hardware-Aware Compilation and Acceleration (CPU, GPU, and NPU)
Comparisons
Comparisons
Vectorized CPU
16K Image Resize
PyTorch
0
OpenCV
0
JETWAVE
0
Elapsed Time in Milliseconds
Vectorized CPU
16K Image Resize
PyTorch
0
OpenCV
0
JETWAVE
0
Elapsed Time in Milliseconds
GPGPU
16K Image Resize
PyTorch
0
OpenCV
0
JETWAVE
0
Elapsed Time in Milliseconds
GPGPU
16K Image Resize
PyTorch
0
OpenCV
0
JETWAVE
0
Elapsed Time in Milliseconds
GPGPU
Matrix Convolution (Float32, 512x512 * 3x3)
PyTorch
0
JETWAVE
0
Elapsed Time in Milliseconds
GPGPU
Matrix Convolution (Float32, 512x512 * 3x3)
PyTorch
0
JETWAVE
0
Elapsed Time in Milliseconds
Tracking and Mosaicizing People Application Benchmark
Python
Optimized Python
JETWAVE™
PyTorch
FFmpeg
OpenCV
NumPy
NVIDIA TensorRT
NVIDIA Decoder
NumPy
OpenCV
JETWAVE Standard Libraries
Elapsed Time
: 90,022 ms
Average FPS
: 27.7 fps
Elapsed Time
: 58,349 ms
Average FPS
: 42.8 fps
Elapsed Time
: 15,525 ms (x5.7 faster)
Average FPS
: 161.1 fps
Python
PyTorch
FFmpeg
OpenCV
NumPy
Elapsed Time
: 90,022 ms
Average FPS
: 27.7 fps
Optimized Python
NVIDIA TensorRT
NVIDIA Decoder
NumPy
OpenCV
Elapsed Time
: 58,349 ms
Average FPS
: 42.8 fps
JETWAVE™
JETWAVE Standard Libraries
Elapsed Time
: 15,525 ms (x5.7 faster)
Average FPS
: 161.1 fps