Dark Mode

Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

MirrorYuChen/MNNTaoAvatarLearn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

25 Commits

Repository files navigation

MNNTaoAvatarLearn

  • MNNTuan Dui Kai Yuan De MNNTaoAvatarXue Xi Bi Ji
  • Author: Chenjingyu
  • Date: 20250808

1.Xiang Mu Jian Jie

  • MNNTaoAvatarBao Han Wu Ge Bu Fen :
    • MNN-LLM(Ben Di Liao Tian Ji Qi Ren ):Ji Yu Ben Di Yun Xing De LLM,Shi Shi Yu Shu Zi Ren Chang Liao
    • Sherpa-MNN-ASR(Yu Yin Shi Bie Geng Zhi Neng ):Nei Zhi ASRMo Xing ,Ji Shuo Ji Zhuan Wen Zi
    • MNN-TTS(Sui Xin Suo Yu He Cheng Yu Yin ):TTSMo Xing ,Rang Ni De Shu Zi Ren Fa Sheng Zi Ran Zhen Shi
    • MNN-A2BS(Sheng Yin Qu Dong Biao Qing Dong Zuo ):A2BSJi Zhu ,Tong Guo Sheng Yin Zi Dong Sheng Cheng Shu Zi Ren Feng Fu De Mian Bu Biao Qing He Dong Zuo
    • MNN-NNR(Shi Shi Shen Jing Xuan Ran ):Rang Shu Zi Ren Biao Qing Xi Ni Bi Zhen ,Hu Dong Gan Geng Qiang

Mu Qian Yi Zhi Liao Si Ge Mo Kuai :MNN-LLMDui Ying MnnLLMSession,Sharpa-MNN-ASRDui Ying MnnASRSessionHe MNN-TTSDui Ying MnnTTSSession,MNN-A2BSDui Ying MnnA2BSSession,Qi Zhong ASRMo Kuai Shi Dui Yuan Shi sherpa-mnnCai Jian Hou De Ban Ben ,Zhe Ge Bi Jiao Fei Shi ,Qi Yu San Ge Shi Zhi Jie Chao Guo Lai De ,Zhu Yao Shi Zuo Liao Yi Xie Dai Ma Zheng Li ,A2BSJie Guo Shi Fou Zheng Que Xu Yao Yi Zhi NNRMo Kuai ,Ran Er NNRMei You Kai Yuan ,Suo Yi Mei You Wang Xia Zheng De Bi Yao ,Zhe Ge Jiu Xian Zhe Yang Ba .

2.Shi Yong Fang Fa

Xian Kan Yi Xia Mo Xing Cun Zai Xiang Mu Jie Gou

.
+-- 3rdLibs
| +-- MNN
| | +-- include
| | +-- lib
| +-- nlohmann
| | +-- json.hpp
| +-- spdlog
+-- CMakeLists.txt
+-- README.MD
+-- build
+-- cmake
| +-- kaldi-native-fbank.cmake
+-- data
| +-- a2bs
| | +-- README.md
| | +-- audio2verts.mnn
| | +-- body_converter.mnn
| | +-- body_params.bin
| | +-- configuration.json
| | +-- idle_speech_slices.json
| +-- asr
| | +-- 1.wav
| | +-- README.md
| | +-- configuration.json
| | +-- decoder.mnn
| | +-- encoder.mnn
| | +-- joiner.mnn
| | +-- tokens.txt
| +-- llm
| | +-- README.md
| | +-- config.json
| | +-- configuration.json
| | +-- llm.mnn
| | +-- llm.mnn.json
| | +-- llm.mnn.weight
| | +-- llm_config.json
| | +-- tokenizer.txt
| +-- tts
| +-- 38acd89e9b396e6b
| +-- b4da26028007a684
| +-- common
| | +-- mnn_models
| | | +-- chinese_bert.mnn
| | | +-- chinese_bert.mnn.weight
| | | +-- english_bert.mnn
| | | +-- english_bert.mnn.weight
| | +-- text_processing_jsons
| | +-- char_state.bin
| | +-- cn_bert_token.bin
| | +-- default_tone_words.json
| | +-- en_bert_token.json
| | +-- eng_dict.bin
| | +-- hotwords_cn.bin
| | +-- hotwords_cn.json
| | +-- phrases_dict.bin
| | +-- pinyin_dict.bin
| | +-- pinyin_to_symbol_map.bin
| | +-- prob_emit.bin
| | +-- prob_start.bin
| | +-- prob_trans.bin
| | +-- tokenizer.txt
| | +-- word_freq.bin
| | +-- word_tag.bin
| +-- config.json
| +-- configuration.json
| +-- tokenizer.txt
| +-- tts_generator_w_bert_chenxi_0310_int8.mnn
+-- include
+-- source
+-- test

Zhe Li Zhu Yao Kan Yi Xia Mo Xing Cun Fang De Mu Lu Jie Gou .

  • (1) Bian Yi Zui Xin Ban De MNN
>> git clone git@github.com:alibaba/MNN.git
>> cd MNN && mkdir build && cd build
>> cmake .. -DMNN_LOW_MEMORY=true -DMNN_CPU_WEIGHT_DEQUANT_GEMM=true \
-DMNN_BUILD_LLM=true -DMNN_SUPPORT_TRANSFORMER_FUSE=true -DMNN_SEP_BUILD=OFF \
-DMNN_BUILD_CONVERTER=ON -DCMAKE_INSTALL_PREFIX=.
>> make install
  • (2) Jiang Bian Yi Hao De libHe includeTi Huan Diao 3rdLibs/MNNXia Mian Dui Ying Wen Jian
  • (3) Bian Yi Ben Xiang Mu
>> mkdir build && cd build && cmake .. && make -j 16
  • (4) Ce Shi Xiao Guo

Suo You MNNTaoAvatarDe Mo Xing Du Ke Yi Zai Zhe Ge Di Zhi Zhao Dao :https://modelscope.cn/collections/TaoAvatar-68d8a46f2e554a

  • (a) Ce Shi MNN-TTSMo Kuai :Xian Xia Zai Mo Xing ,Dui Ying Di Zhi :bert-vits2-MNN,Wo Zhe Li Jiang Qi Jie Ya Zhi Hou ,Fang Dao Liao Xiang Mu Di Xia De data/ttsLu Jing ,Ju Ti Kan Dai Ma
>> ./TestMnnTTSSession
[2025-08-08 20:32:18.785] [mirror] [info] [Pinyin.cc:64] Pinyin Kai Shi Chu Shi Hua ...
[2025-08-08 20:32:18.841] [mirror] [info] [Pinyin.cc:96] Pinyin Chu Shi Hua Cheng Gong , timecost: 55ms
[2025-08-08 20:32:18.841] [mirror] [info] [WordSpliter.cc:43] WordSpliter Kai Shi Chu Shi Hua ...
[2025-08-08 20:32:19.154] [mirror] [info] [WordSpliter.cc:55] WordSpliter Chu Shi Hua Wan Cheng , timecost: 313ms
[2025-08-08 20:32:19.154] [mirror] [info] [ToneAdjuster.cc:14] ToneAdjuster Kai Shi Chu Shi Hua ...
[2025-08-08 20:32:19.154] [mirror] [info] [ToneAdjuster.cc:36] ToneAdjuster Chu Shi Hua Wan Cheng , timecost: 0 ms
[2025-08-08 20:32:19.154] [mirror] [info] [ChineseG2p.cc:28] ChineseG2P Kai Shi Chu Shi Hua ...
[2025-08-08 20:32:19.155] [mirror] [info] [ChineseG2p.cc:49] ChineseG2P Chu Shi Hua Cheng Gong , timecost: 0 ms
The device supports: i8sdot:0, fp16:0, i8mm: 0, sve2: 0, sme2: 0
Can't open file:.cachefile
Load Cache file error.
[2025-08-08 20:32:19.851] [mirror] [info] [ChineseBert.cc:53] bertMo Xing Jia Zai Cheng Gong : ../data/tts/./common/mnn_models/chinese_bert.mnn
Can't open file:.cachefile
Load Cache file error.
[2025-08-08 20:32:20.383] [mirror] [info] [EnglishBert.cc:39] en_bertMo Xing Jia Zai Cheng Gong : ../data/tts/./common/mnn_models/english_bert.mnn
Can't open file:.tts_generator_cachefile
Load Cache file error.
[2025-08-08 20:32:20.811] [mirror] [info] [TTSGenerator.cc:42] tts Mo Xing Jia Zai Cheng Gong : ../data/tts/tts_generator_w_bert_chenxi_0310_int8.mnn
[2025-08-08 20:32:20.811] [mirror] [info] [TTSGenerator.cc:43] ### tts load memory increase : 44.42533
[2025-08-08 20:32:26.467] [mirror] [info] [TTSGenerator.cc:99] ### tts forward memory increase : 61.616356.
>> ./TestMnnASRSession \
--tokens=../data/asr/tokens.txt \
--encoder=../data/asr/encoder.mnn \
--decoder=../data/asr/decoder.mnn \
--joiner=../data/asr/joiner.mnn \
--num-threads=2 \
../data/asr/1.wav
[2025-08-08 20:41:08.084] [mirror] [info] [ParseOptions.cc:310] ./TestMnnASRSession --tokens=../data/asr/tokens.txt --encoder=../data/asr/encoder.mnn --decoder=../data/asr/decoder.mnn --joiner=../data/asr/joiner.mnn --num-threads=2 ../data/asr/1.wav

OnlineMnnASRSessionConfig(feature_extractor_config=FeatureExtractorConfig(sampling_rate=16000, feature_dim=80, low_freq=20, high_freq=-400, dither=0, normalize_samples=True, snip_edges=False), model_config=ModelConfig(encoder="../data/asr/encoder.mnn", decoder="../data/asr/decoder.mnn", joiner="../data/asr/joiner.mnn"), endpoint_config=EndpointConfig(rule1=EndpointRule(must_conta in_nonsilence=False, min_trailing_silence=2.4, min_utterance_length=0), rule2=EndpointRule(must_contain_nonsilence=True, min_trailing_silence=1.2, min_utterance_length=0), rule3=EndpointRule(must_contain_nonsilence=False, min_trailing_silence=0, min_utterance_length=20)), enable_endpoint=True, blank_penalty=0, temperature_scale=2")
The device supports: i8sdot:0, fp16:0, i8mm: 0, sve2: 0, sme2: 0
[2025-08-08 20:41:10.449] [mirror] [info] [MnnASRSession.cc:115] processed result: Elapsed seconds: 0.36, Audio duration (s): 5.5, Real time factor (RTF) = 0.36/5.5 = 0.065
Huan Ying Da Jia Lai Ti Yan Da Mo Yuan Tui Chu De Yu Yin Shi Bie Mo Xing
{ "text": "Huan Ying Da Jia Lai Ti Yan Da Mo Yuan Tui Chu De Yu Yin Shi Bie Mo Xing ", "tokens": ["Huan ", "Ying ", "Da ", "Jia ", "Lai ", "Ti ", "Yan ", "Da ", "Mo ", "Yuan ", "Tui ", "Chu ", "De ", "Yu ", "Yin ", "Shi ", "Bie ", "Mo ", "Xing "], "timestamps": [0.96, 1.20, 1.60, 1.80, 2.12, 2.36, 2.60, 2.88, 3.00, 3.20, 3.40, 3.68, 3.88, 4.16, 4.36, 4.56, 4.84, 5.12, 5.40], "segment": 0, "words": [], "start_time": 0.00, "is_final": false}


processed succeed.
{ "text": "Huan Ying Da Jia Lai Ti Yan Da Mo Yuan Tui Chu De Yu Yin Shi Bie Mo Xing ", "tokens": ["Huan ", "Ying ", "Da ", "Jia ", "Lai ", "Ti ", "Yan ", "Da ", "Mo ", "Yuan ", "Tui ", "Chu ", "De ", "Yu ", "Yin ", "Shi ", "Bie ", "Mo ", "Xing "], "timestamps": [0.96, 1.20, 1.60, 1.80, 2.12, 2.36, 2.60, 2.88, 3.00, 3.20, 3.40, 3.68, 3.88, 4.16, 4.36, 4.56, 4.84, 5.12, 5.40], "segment": 0, "words": [], "start_time": 0.00, "is_final": false}
  • (c) Ce Shi MNN-LLMMo Kuai :Xian Xia Zai Mo Xing ,Dui Ying Di Zhi :Qwen2.5-1.5B-Instruct-MNN,Wo Zhe Li Jiang Qi Jie Ya Zhi Hou ,Fang Dao Liao Xiang Mu Di Xia De data/llmLu Jing ,Ju Ti Kan Dai Ma
>> ./TestMnnLLMSession
[2025-08-08 20:43:29.898] [mirror] [info] [TestMnnLLMSession.cc:38] cfg: {"is_r1":false,"max_new_tokens":2048,"minP":0.05000000074505806,"mixed_samplers":["topK","topP","minP","temperature"],"penalty":1.2,"precision":"high","sampler_type":"mixed","system_prompt":"You are a helpful assistant.","temperature":0.6000000238418579,"topK":20,"topP":0.949999988079071}.
[2025-08-08 20:43:29.898] [mirror] [info] [TestMnnLLMSession.cc:44] extra cfg: {"mmap_dir":"./tmp","use_mmap":false}
The device supports: i8sdot:0, fp16:0, i8mm: 0, sve2: 0, sme2: 0
[2025-08-08 20:43:29.899] [mirror] [info] [MnnLLMSession.cc:178] extra_cfg: {"is_r1":false,"max_new_tokens":2048,"minP":0.05000000074505806,"mixed_samplers":["topK","topP","minP","temperature"],"penalty":1.2,"precision":"high","sampler_type":"mixed","system_prompt":"You are a helpful assistant.","temperature":0.6000000238418579,"tmp_path":"./tmp","topK":20,"topP":0.949999988079071,"use_mmap":true}

User: Ni Hao
[2025-08-08 20:43:37.192] [mirror] [info] [TestMnnLLMSession.cc:14] response: Ni Hao
[2025-08-08 20:43:37.246] [mirror] [info] [TestMnnLLMSession.cc:14] response: !
[2025-08-08 20:43:37.284] [mirror] [info] [TestMnnLLMSession.cc:14] response: Hen Gao Xing
[2025-08-08 20:43:37.322] [mirror] [info] [TestMnnLLMSession.cc:14] response: Neng
[2025-08-08 20:43:37.358] [mirror] [info] [TestMnnLLMSession.cc:14] response: Wei Ni
[2025-08-08 20:43:37.395] [mirror] [info] [TestMnnLLMSession.cc:14] response: Fu Wu
[2025-08-08 20:43:37.429] [mirror] [info] [TestMnnLLMSession.cc:14] response: .

Assistant: Ni Hao !Hen Gao Xing Neng Wei Ni Fu Wu .

User: Ni Shi Shi Yao Da Mo Xing
[2025-08-08 20:43:44.603] [mirror] [info] [TestMnnLLMSession.cc:14] response: Wo Shi
[2025-08-08 20:43:44.636] [mirror] [info] [TestMnnLLMSession.cc:14] response: Lai Zi
[2025-08-08 20:43:44.672] [mirror] [info] [TestMnnLLMSession.cc:14] response: A Li
[2025-08-08 20:43:44.706] [mirror] [info] [TestMnnLLMSession.cc:14] response: Yun
[2025-08-08 20:43:44.740] [mirror] [info] [TestMnnLLMSession.cc:14] response: De
[2025-08-08 20:43:44.774] [mirror] [info] [TestMnnLLMSession.cc:14] response: Tong
[2025-08-08 20:43:44.810] [mirror] [info] [TestMnnLLMSession.cc:14] response: Yi
[2025-08-08 20:43:44.849] [mirror] [info] [TestMnnLLMSession.cc:14] response: Qian
[2025-08-08 20:43:44.887] [mirror] [info] [TestMnnLLMSession.cc:14] response: Wen
[2025-08-08 20:43:44.919] [mirror] [info] [TestMnnLLMSession.cc:14] response: ,
[2025-08-08 20:43:44.953] [mirror] [info] [TestMnnLLMSession.cc:14] response: Shi Yi Ge
[2025-08-08 20:43:44.990] [mirror] [info] [TestMnnLLMSession.cc:14] response: Yu
[2025-08-08 20:43:45.023] [mirror] [info] [TestMnnLLMSession.cc:14] response: Xun Lian
[2025-08-08 20:43:45.058] [mirror] [info] [TestMnnLLMSession.cc:14] response: Yu Yan
[2025-08-08 20:43:45.099] [mirror] [info] [TestMnnLLMSession.cc:14] response: Mo Xing
[2025-08-08 20:43:45.134] [mirror] [info] [TestMnnLLMSession.cc:14] response: ,
[2025-08-08 20:43:45.170] [mirror] [info] [TestMnnLLMSession.cc:14] response: Ke Yi Bang Zhu
[2025-08-08 20:43:45.209] [mirror] [info] [TestMnnLLMSession.cc:14] response: Ti Gong
[2025-08-08 20:43:45.248] [mirror] [info] [TestMnnLLMSession.cc:14] response: Wen Ben
[2025-08-08 20:43:45.285] [mirror] [info] [TestMnnLLMSession.cc:14] response: Sheng Cheng
[2025-08-08 20:43:45.318] [mirror] [info] [TestMnnLLMSession.cc:14] response: ,
[2025-08-08 20:43:45.352] [mirror] [info] [TestMnnLLMSession.cc:14] response: Fan Yi
[2025-08-08 20:43:45.387] [mirror] [info] [TestMnnLLMSession.cc:14] response: ,
[2025-08-08 20:43:45.419] [mirror] [info] [TestMnnLLMSession.cc:14] response: Zhai Yao
[2025-08-08 20:43:45.453] [mirror] [info] [TestMnnLLMSession.cc:14] response: ,
[2025-08-08 20:43:45.485] [mirror] [info] [TestMnnLLMSession.cc:14] response: Wen Da
[2025-08-08 20:43:45.521] [mirror] [info] [TestMnnLLMSession.cc:14] response: Deng
[2025-08-08 20:43:45.556] [mirror] [info] [TestMnnLLMSession.cc:14] response: Fu Wu
[2025-08-08 20:43:45.602] [mirror] [info] [TestMnnLLMSession.cc:14] response: .

Assistant: Wo Shi Lai Zi A Li Yun De Tong Yi Qian Wen ,Shi Yi Ge Yu Xun Lian Yu Yan Mo Xing ,Ke Yi Bang Zhu Ti Gong Wen Ben Sheng Cheng , Fan Yi , Zhai Yao , Wen Da Deng Fu Wu .
  • (d) Ce Shi MNN-A2BSMo Kuai :Xian Xia Zai Mo Xing ,Dui Ying Di Zhi :UniTalker-MNN,Wo Zhe Li Jiang Qi Jie Ya Zhi Hou ,Fang Dao Liao Xiang Mu Di Xia De data/a2bsLu Jing ,Ju Ti Kan Dai Ma
>> ./TestMnnA2BSSession
The device supports: i8sdot:0, fp16:0, i8mm: 0, sve2: 0, sme2: 0
[2025-08-11 11:15:56.179] [mirror] [info] [AudioToFrameBlendShape.cc:72] ### audio_to_flame_blend_shape load memory increase : 364.81076
[2025-08-11 11:15:56.179] [mirror] [info] [AudioTo3dgsBlendShape.cc:27] A2BSService ParseInputsFromJson execution time: 0 ms
Load a2bs recources successed.
Audio format: 2, Channels: 1, Sample rate: 44100
[2025-08-11 11:15:56.585] [mirror] [info] [AudioToFrameBlendShape.cc:108] ### audio2verts forward memory increase : 35.944977
[2025-08-11 11:15:56.585] [mirror] [info] [AudioTo3dgsBlendShape.cc:70] Audio2BS timecost: 405.000000 ms, audio_duration: 2799.977295 ms, rtf:(0.144644+0.144644)

Zhu Yi ,Zhe Li A2BSBu Zhi Dao Jie Guo Dui Bu Dui ,Deng Hou Mian NNRMo Kuai Kai Yuan Cai Neng Zhi Xiao .

3.Xia Yi Bu Ji Hua

  • A2BSMo Kuai Jie Guo Shi Fou Zheng Que Xu Yao Yi Zhi Wan NNRMo Kuai Cai Neng Yan Zheng ,Dan Shi NNRMo Kuai Mei You Kai Yuan ,Suo Yi Jiu Zhe Yang Ba ,Lan De Zheng Liao .

4.Can Kao Zi Liao

About

MNNTaoAvatarXiang Mu Xue Xi Bi Ji

Topics

Resources

Readme

Stars

Watchers

Forks

Releases

No releases published

Packages

Contributors