RuntimeError: The size of tensor a (51) must match the size of tensor b (53) at non-singleton dimension 1 #131

yinchyu · 2021-12-06T08:51:48Z

File "/data/home/scv1106/FastSpeech2/model/modules.py", line 121, in forward
x = x + pitch_embedding
RuntimeError: The size of tensor a (51) must match the size of tensor b (53) at non-singleton dimension 1

use biaobei dataset to training， but found the problem that the tensor is not in same dimension what should i do to reslove the problems?

yanzhuangzhuang-beep · 2021-12-09T11:16:32Z

我之前也是这样，我觉得是词库不是完整导致有的索引为空数据不对齐。
但是当我补充完词汇后发现了新的问题不知道是不是采样率的问题你可以先打印缺少的字符在text/system中

yanzhuangzhuang-beep · 2021-12-10T04:23:26Z

使用MFA后的数据我的实验就正常可以跑了

yinchyu · 2021-12-10T07:05:03Z

我使用了mfa 生成的词汇表和声学模型，但是还是有这个问题，

chenming6615 · 2021-12-14T08:33:14Z

Maybe your lexicon generated by MFA contains some phones that are not included in the phone list in the text/pinyin.py file.

finals = [
    "a1",
    "a2",
    "a3",
    "a4",
    "a5",
    "ai1",
    "ai2",
    "ai3",
    "ai4",
...
]

You can print out the missing phones by modifying the _symbols_to_sequence function in text/__init__.py into

def _symbols_to_sequence(symbols):
    missing=[s for s in symbols if not _should_keep_symbol(s)]
    if missing:
        print(missing)
    return [_symbol_to_id[s] for s in symbols if _should_keep_symbol(s)]

And add the missing phones into text/pinyin.py

everschen · 2022-10-02T12:03:09Z

if chinese, please try PR: #153

AkiKagura · 2022-11-13T07:31:38Z

我也遇到了类似的问题，但是我是刚训练的，mfa生成的音节已经不是拼音了，是一些我看不懂的符号

tuntun990606 · 2023-02-11T07:55:53Z

我也遇到了类似的问题，但是我是刚训练的，mfa生成的音节已经不是拼音了，是一些我看不懂的符号

你需要把音素加到，text/symbols里

jobanpreet495 · 2023-08-24T17:28:09Z

Maybe your lexicon generated by MFA contains some phones that are not included in the phone list in the text/pinyin.py file.
finals = [
    "a1",
    "a2",
    "a3",
    "a4",
    "a5",
    "ai1",
    "ai2",
    "ai3",
    "ai4",
...
]
You can print out the missing phones by modifying the _symbols_to_sequence function in text/__init__.py into
def _symbols_to_sequence(symbols):
    missing=[s for s in symbols if not _should_keep_symbol(s)]
    if missing:
        print(missing)
    return [_symbol_to_id[s] for s in symbols if _should_keep_symbol(s)]
And add the missing phones into text/pinyin.py

This worked

laTH380 · 2023-11-08T17:04:09Z

MFA によって生成された辞書には、ファイル内の電話リストに含まれていない電話が含まれている可能性がありますtext/pinyin.py。
finals = [
    "a1",
    "a2",
    "a3",
    "a4",
    "a5",
    "ai1",
    "ai2",
    "ai3",
    "ai4",
...
]
_symbols_to_sequenceの関数をtext/__init__.py次のように変更することで、不足している電話機を印刷できます。
def _symbols_to_sequence(symbols):
    missing=[s for s in symbols if not _should_keep_symbol(s)]
    if missing:
        print(missing)
    return [_symbol_to_id[s] for s in symbols if _should_keep_symbol(s)]
不足している電話を追加します text/pinyin.py

this is useful when also training japanese model!
Thank you!!!

Lakhjeet1082 · 2024-05-11T16:35:06Z

Maybe your lexicon generated by MFA contains some phones that are not included in the phone list in the text/pinyin.py file.
finals = [
    "a1",
    "a2",
    "a3",
    "a4",
    "a5",
    "ai1",
    "ai2",
    "ai3",
    "ai4",
...
]
You can print out the missing phones by modifying the _symbols_to_sequence function in text/__init__.py into
def _symbols_to_sequence(symbols):
    missing=[s for s in symbols if not _should_keep_symbol(s)]
    if missing:
        print(missing)
    return [_symbol_to_id[s] for s in symbols if _should_keep_symbol(s)]
And add the missing phones into text/pinyin.py
I am getting list of many missing phones, where should I exactly add them. Please guide me on this

chenming6615 · 2024-05-13T05:46:02Z

Maybe your lexicon generated by MFA contains some phones that are not included in the phone list in the text/pinyin.py file.
finals = [
    "a1",
    "a2",
    "a3",
    "a4",
    "a5",
    "ai1",
    "ai2",
    "ai3",
    "ai4",
...
]
You can print out the missing phones by modifying the _symbols_to_sequence function in text/__init__.py into
def _symbols_to_sequence(symbols):
    missing=[s for s in symbols if not _should_keep_symbol(s)]
    if missing:
        print(missing)
    return [_symbol_to_id[s] for s in symbols if _should_keep_symbol(s)]
And add the missing phones into text/pinyin.py
I am getting list of many missing phones, where should I exactly add them. Please guide me on this

add them into to the finals list in the file text/pinyin.py

FastSpeech2/text/pinyin.py

Line 211 in d4e79eb

"vn5",

chenming6615 mentioned this issue Dec 14, 2021

Mismatch tensor when training custom data #129

Closed

tuntun990606 mentioned this issue Jul 27, 2023

aishell3处理：使用mfa官方dict和声学模型处理aishell3 #188

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: The size of tensor a (51) must match the size of tensor b (53) at non-singleton dimension 1 #131

RuntimeError: The size of tensor a (51) must match the size of tensor b (53) at non-singleton dimension 1 #131

yinchyu commented Dec 6, 2021

yanzhuangzhuang-beep commented Dec 9, 2021

yanzhuangzhuang-beep commented Dec 10, 2021

yinchyu commented Dec 10, 2021

chenming6615 commented Dec 14, 2021

everschen commented Oct 2, 2022

AkiKagura commented Nov 13, 2022

tuntun990606 commented Feb 11, 2023

jobanpreet495 commented Aug 24, 2023

laTH380 commented Nov 8, 2023

Lakhjeet1082 commented May 11, 2024

chenming6615 commented May 13, 2024

RuntimeError: The size of tensor a (51) must match the size of tensor b (53) at non-singleton dimension 1 #131

RuntimeError: The size of tensor a (51) must match the size of tensor b (53) at non-singleton dimension 1 #131

Comments

yinchyu commented Dec 6, 2021

yanzhuangzhuang-beep commented Dec 9, 2021

yanzhuangzhuang-beep commented Dec 10, 2021

yinchyu commented Dec 10, 2021

chenming6615 commented Dec 14, 2021

everschen commented Oct 2, 2022

AkiKagura commented Nov 13, 2022

tuntun990606 commented Feb 11, 2023

jobanpreet495 commented Aug 24, 2023

laTH380 commented Nov 8, 2023

Lakhjeet1082 commented May 11, 2024

chenming6615 commented May 13, 2024