Mindspore报错：the dimension of logits must be equal to 2, but got 3.-昇腾社区

Mindspore报错：the dimension of logits must be equal to 2, but got 3.

2023/05/23

181

暂无评分

我要评分

问题信息

问题来源	产品大类	产品子类	关键字
官方	模型训练	MindSpore	无

问题现象描述

系统环境：

Hardware Environment(Ascend/GPU/CPU): Ascend Software Environment: -- MindSpore version (source or binary): 1.6.0 -- Python version (e.g., Python 3.7.5): 3.7.6 -- OS platform and distribution (e.g., Linux Ubuntu 16.04): Ubuntu 4.15.0-74-generic -- GCC/Compiler version (if compiled from source):

训练脚本是通过构建SoftmaxCrossEntropyWithLogits的单算子网络，计算两个变量softmax 交叉熵的例子。脚本如下：

class Net(nn.Cell):
  def __init__(self):
    super(Net, self).__init__()
    self.loss = nn.SoftmaxCrossEntropyWithLogits(sparse=False)

  def construct(self, logits, labels):
    output = self.loss(logits, labels)
    return output

net = Net()
logits = Tensor(np.array([[[2, 4, 1, 4, 5], [2, 1, 2, 4, 3]]]), mindspore.float32)
labels = Tensor(np.array([[[0, 0, 0, 0, 1], [0, 0, 0, 1, 0]]]).astype(np.float32))
out = net(logits, labels)
print('out',out)

报错信息：

Traceback (most recent call last):
File 'demo.py', line 13, in &lt;module&gt;
    out = net(logits, labels)
…
ValueError: mindspore/core/utils/check_convert_utils.cc:395 CheckInteger] For primitive[SoftmaxCrossEntropyWithLogits], the dimension of logits must be equal to 2, but got 3.
The function call stack (See file ' rank_0/om/analyze_fail.dat' for more details):
\# 0 In file demo.py(04)
    output = self.loss(logits, labels)

原因分析

在报错信息ValueError中，For primitive[SoftmaxCrossEntropyWithLogits], the dimension of logits must be equal to 2, but got 3，表示传入logits应该等于2维，但实际传入的logits的Shape是3维，查看官网对logits的描述可知，支持的Shape为(N,C)。如下图所示。

放大

对于3维数据，建议先reshape成2维(N*L, C)，然后再调用nn.SoftmaxCrossEntropyWithLogits接口，执行完后再reshape回 (N, 1)。

参考文档：SoftmaxCrossEntropyWithLogits算子API接口

解决措施

基于以上原因，修改训练脚本：

class Net(nn.Cell):
    def __init__(self):
        super(Net, self).__init__()
        self.loss = nn.SoftmaxCrossEntropyWithLogits(sparse=False)

    def construct(self, logits, labels):
        output = self.loss(logits, labels)
        return output

net = Net()
logits = Tensor(np.array([[[2, 4, 1, 4, 5], [2, 1, 2, 4, 3]]]), mindspore.float32)
labels = Tensor(np.array([[[0, 0, 0, 0, 1], [0, 0, 0, 1, 0]]]).astype(np.float32))
L, N, C = logits.shape
logits,labels = logits.reshape(L*N, C),labels.reshape(L*N, C)
out = net(logits, labels)
out = out.reshape(N,1)
print('out',out)

执行训练。成功后输出如下：

out [[0.5899297 ]
[0.52374405]]

本页内容

问题信息
问题现象描述
原因分析
解决措施

问题现象描述

原因分析

解决措施

关于昇腾

新闻与活动

交流与资讯

支持与服务

开源社区

About Ascend

Communication and Information

Links