vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Stars总数6834

Forks总数724

今日Stars72

源码分类Python

更新时间2023-09-16(2年前)

查看项目：vllm-project/vllm

扫码关注公众号获取最新文章，并可免费领取前端工程师必备学习资源