当前位置：首页 > 大杂烩 > 正文内容

modelscope Qwen/Qwen3-VL-8B-Instruct ValueError: Image features and image tokens do not match: tok

高老师5个月前 (10-20)大杂烩99

使用modelscope推理Qwen/Qwen3-VL-8B-Instruct报错如下信息,官方例子会有这个报错：

ValueError: Image features and image tokens do not match: tokens: 2752, features 2752

调整后的代码：

from modelscope import Qwen3VLForConditionalGeneration, AutoProcessor
from PIL import Image
import torch

# Load the model on GPU for better performance
model = Qwen3VLForConditionalGeneration.from_pretrained(
    "Qwen/Qwen3-VL-8B-Instruct",
    dtype=torch.bfloat16,
    device_map="cuda",
)

processor = AutoProcessor.from_pretrained("Qwen/Qwen3-VL-8B-Instruct")

# 使用官方推荐的消息格式
messages = [
    {
        "role": "user",
        "content": [
            {
                "type": "image",
                "image": "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-VL/assets/demo.jpeg",
            },
            {"type": "text", "text": "请介绍下这张图片"},
        ],
    }
]

# 使用apply_chat_template处理输入
inputs = processor.apply_chat_template(
    messages,
    tokenize=True,
    add_generation_prompt=True,
    return_tensors="pt"
)

# 检查inputs的类型并正确处理
if isinstance(inputs, torch.Tensor):
    # 如果inputs是张量，直接移到设备上
    inputs = inputs.to(model.device)
    # 创建正确的输入字典
    inputs_dict = {"input_ids": inputs}
else:
    # 如果inputs是字典，正常处理
    inputs_dict = {k: v.to(model.device) for k, v in inputs.items()}

# Inference: Generation of the output
generated_ids = model.generate(**inputs_dict, max_new_tokens=128)
generated_ids_trimmed = [
    out_ids[len(in_ids) :] for in_ids, out_ids in zip(inputs_dict["input_ids"], generated_ids)
]
output_text = processor.batch_decode(
    generated_ids_trimmed, skip_special_tokens=True, clean_up_tokenization_spaces=False
)
print(output_text)

扫描二维码推送至手机访问。

本文链接：https://blog.20230611.cn/post/911.html

分享给朋友：

返回列表

上一篇：飞书智能合作伙伴平台aily添加多维表格记录人员的写法

下一篇：torch指定GPU来运行大模型

“modelscope Qwen/Qwen3-VL-8B-Instruct ValueError: Image features and image tokens do not match: tok” 的相关文章

svn自动更新到网站

【一】.钩子文件的设置和创建(1).打开hooks目录，可以看到有一个post-commit.tmpl文件，这是一个模板文件。复制一份，重命名为post-commit，将其用户组设为www，并设置为可执行。chown www:www post-commitchmod +x post-commit(2...

PHP安装mongodb扩展

在安装之前我们先看看官方给出的依赖关系.首先是dll文件和mongodb软件的依赖关系然后是PHP文件和dll的依赖关系我的是phpstudy的集成环境PHP5.4.45 NTS+Apache+Mysql【一】.安装mongodb3.0软件对比依赖关系下载mongodb3.0.msi软件,完整名称:...

Git日志查看和版本切换

日志查看:git log版本切换:方式1：git reset --hard HEAD^ 倒退一个版本git reset --hard HEAD^^ 倒退两个版本方式2：(版本号的形式,建议版本号码补充完...

c#关闭计算机的代码

1.关机Process.Start("shutdown", "-s -t 0"); 2. 注销 Proc...

C# md5加密,C# md5加密代码

public static string GetMD5(string str) { //创建MD5对象 MD5 md5 = MD5.C...

IIS7.0无法加载CSS的处理办法

首先网页全部是纯静态的文件,本地测试正常访问,服务器端无法加载CSS,并且无法查看CSS文件的内容。解决方案:关闭网站的压缩->>静态压缩和动态压缩...

modelscope Qwen/Qwen3-VL-8B-Instruct ValueError: Image features and image tokens do not match: tok

“modelscope Qwen/Qwen3-VL-8B-Instruct ValueError: Image features and image tokens do not match: tok” 的相关文章

svn自动更新到网站

PHP安装mongodb扩展

Git日志查看和版本切换

c#关闭计算机的代码

C# md5加密,C# md5加密代码

IIS7.0无法加载CSS的处理办法

© 2023 高久峰个人博客 - https://blog.20230611.cn . All rights reserved 粤ICP备20061021号-2

Powered by TOYEAN.