Skip to main content

Search This Blog

Tech Bites

Posts

Showing posts with the label uantized model

September 05, 2025

ValueError: Some modules are dispatched on the CPU or the disk. Make sure you have enough GPU RAM to fit the quantized model. If you want to dispatch the model on the CPU or the disk while keeping these modules in 32-bit, you need to set llm_int8_enable_fp32_cpu_offload=True and pass a custom device_map to from_pretrained. Check https://huggingface.co/docs/transformers/main/en/main_classes/quantization#offload-between-cpu-and-gpu for more details.

Get link
Facebook
X
Pinterest
Email
Other Apps

Older Posts Home

Powered by Blogger

Theme images by Matt Vince

R.M.Phani Kumar

Archive

2025 41
- September 9
- January 32

2023 41
- July 1
- April 2
- March 38
2022 1
- January 1
2018 2
- March 2
2016 1
- February 1
2015 2
- November 1
- September 1
2014 4
- December 1
- July 2
- May 1
2013 6
2012 2
- July 2
2011 1
- February 1
2009 4
- October 2
- July 2

Show more Show less

Labels

$().SPServices1
2016 features1
Adapters1
Advanced DAX7
Advanced Measures1
Advanced Power BI2
Aggregated Columns1
ALL Function1
amazon ALB1
Amazon AMI1

Show more Show less

Report Abuse