Unlock the Potential of AI
Deploy your AI with Maximal Efficiency
Bringing NPUs into Production: Our Journey with Intel Gaudi
SqueezeBits has partnered with Intel to make Gaudi NPUs more usable in practice. We optimized LLMs and diffusion models for Gaudi-2 and created yetter, a generative AI API service.
[Intel Gaudi] #5. FLUX.1 on Gaudi-2
This article discusses inference efficiency when running the FLUX.1 models on Intel Gaudi-2 hardware.
[Intel Gaudi] #4. FP8 Quantization
In this blog series, we thoroughly evaluate Intel's AI accelerator, the Gaudi series, focusing on its performance, features, and usability.
[Intel Gaudi] #3. Performance Evaluation with SynapseAI v1.19
In this blog series, we thoroughly evaluate Intel's AI accelerator, the Gaudi series, focusing on its performance, features, and usability.