Екатерина Ештокина
Opens in a new window
,推荐阅读heLLoword翻译获取更多信息
# https://developer.download.nvidia.com/cg/asin.html
You can edit --threads 32 for the number of CPU threads, --ctx-size 16384 for context length, --n-gpu-layers 2 for GPU offloading on how many layers. Try adjusting it if your GPU goes out of memory. Also remove it if you have CPU only inference.
The website you are visiting is protected.