IndexError: too many indices for tensor of dimension 1 #8

mfelice · 2021-03-24T18:59:37Z

Hi there,

I'm using the PyTorch implementation with bert-base-uncased and I get the following error when the sentence contains only one token:

Traceback (most recent call last):
  File "bert.py", line 28, in <module>
    print(scorer.score_sentences(["Hello"]))
  File ".../mlm-scoring/src/mlm/scorers.py", line 167, in score_sentences
    return self.score(corpus, **kwargs)[0]
  File ".../mlm-scoring/src/mlm/scorers.py", line 757, in score
    out = out[list(range(split_size)), token_masked_ids]
IndexError: too many indices for tensor of dimension 1

It works fine with MXNet MLMs, but I need to use a community model from HuggingFace.

Thanks!

The text was updated successfully, but these errors were encountered:

mfelice · 2021-03-25T13:23:55Z

OK, I think I found the problem.

mlm-scoring/src/mlm/scorers.py

Line 727 in 6727297

out = out[0].squeeze()

should be changed to:

out = torch.reshape(out[0], (out[0].shape[0], -1))

squeeze() was removing a dimension that should be preserved.

DarrenAbramson · 2021-03-25T13:38:25Z

Hurray for publicly licensed software and donation of labour to the public good!

awslabs#8

miidas added a commit to miidas/mlm-scoring that referenced this issue Jan 31, 2022

Fix IndexError

78281db

awslabs#8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IndexError: too many indices for tensor of dimension 1 #8

IndexError: too many indices for tensor of dimension 1 #8

mfelice commented Mar 24, 2021

mfelice commented Mar 25, 2021 •

edited

Loading

DarrenAbramson commented Mar 25, 2021

IndexError: too many indices for tensor of dimension 1 #8

IndexError: too many indices for tensor of dimension 1 #8

Comments

mfelice commented Mar 24, 2021

mfelice commented Mar 25, 2021 • edited Loading

DarrenAbramson commented Mar 25, 2021

mfelice commented Mar 25, 2021 •

edited

Loading