|
Blog
OwLite
Fits on Chips
SqueezeBits
Subscribe
Open main menu
Search posts...
Internal Traffic (traffic_type=internal)
Accessed from the dashboard.
This session is not logged.
SqueezeBits
Subscribe
Eunik Park
[vLLM vs TensorRT-LLM] #7. Weight-Activation Quantization
This article provides a comparative analysis of the effects of weight-activation quantization on vLLM and TensorRT-LLM frameworks.
Nov 11, 2024
Tech
SqueezeBits
RSS
ยท
Powered by Inblog