This website requires JavaScript.
Explore
Help
Register
Sign In
valknar
/
runpod
Watch
1
Star
0
Fork
0
You've already forked runpod
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
cc0f55df38b24e5bccb7c98903e057b2dc4ef101
runpod
/
vllm
/
server.py
Sebastian Krüger
cc0f55df38
fix: reduce max_model_len to 20000 to fit in 24GB VRAM
2025-11-23 15:43:37 +01:00
10 KiB
Raw
Blame
History
View Raw
Reference in New Issue
View Git Blame
Copy Permalink