It seems like register_offload_parameter is trying to offload the parameter to CPU or some non-gpu device, but maybe isn’t actually working. Maybe the offloading framework isn’t set up properly, a condition isn’t met, or the dict it’s offloading to is actually still in GPU memory. Either way, let's try the simple thing of not making the parameter and explicitly deleting weight_data.
https://claude.ai, and type:
Последние новости,这一点在safew中也有详细论述
Глава Генштаба рассказал о создании полосы безопасности в зоне СВО20:25。谷歌对此有专业解读
Виктор ОрбанПремьер-министр Венгрии
FirstFT: the day's biggest stories。关于这个话题,今日热点提供了深入分析