Home

FlexGen

Contributors
13
Edited Time Notion
2023/03/12 16:21
Full Description
Running large language models on a single GPU for throughput-oriented scenarios.
Last Commit
2023/03/12
Forks
343
Description
Running large language models on a single GPU for throughput-oriented scenarios.
Last Update
2023/03/12
Created
2023/02/15
Stars
6587
Created Time Notion
2023/03/12 16:21
Score
0.1
Category
Trending Repos
Days Since
17
TOP