Light Mode

Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
yangjianxin1
Follow
Focusing

Yang JianXin yangjianxin1

Focusing
FireflyDa Mo Xing | Gong Zhong Hao :YeungNLP | Zhi Hu :Hong Yu Piao Po

Block or report yangjianxin1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user's behavior. Learn more about reporting abuse.

Report abuse
yangjianxin1/README.md

Hi there , I'm Yang Jianxin

I'm a NLPer interested in Large Language Model and graduated from SYSU with a master's degree.

In my free time, I like to write technical blogs on [Wechat Official Accounts: YeungNLP] and [Zhihu: Hong Yu Piao Po ]

Experiences:

  • Shopee, responsible for building NLP algorithm ability about Customer Service. (from 2022-04 to now)
  • Tencent, responsible for building NLP algorithm ability about Product Understanding. (from 2021-06 to 2022-04)
  • Alibaba, Internship at Alibaba (from 2020-06 to 2020-09).

Here are some my public projects:

Project Description Code
Firefly One-stop training for LLMs. Some achievements:
1. firefly-llama2-13b ranked 3rd among all 13B models on Open LLM Leaderboard, only 0.5 points less than 1st.
2. firefly-llama-30b ranked 10th among all 30B models on Open LLM Leaderboard trained with single V100.
3. firefly-baichuan-13b achieves over 1.63 million downloads.
4. firefly-qwen1.5-en-7b-dpo improves 7.21 points compared with the official chat model.
5. firefly-gemma-7b improves 9.37 points compared with the official chat model.
GPT2-chitchat Chinese GPT2 for chitchat
Firefly-LLaMA2-Chinese Chinese Llama2 with efficient and effective training method.
LongQLoRA Efficient and Effective method for extending context length of Llama2 to 8192 with single V100. Technical Report
CPM Chinese composition model based on CPM
CLIP-Chinese Chinese CLIP model trained with 1.4 million image-text pairs
ClipCap-Chinese Chinese image caption model based on clip and mengzi
OFA-Chinese Chinese multi-modal unified pre-training model
LLMPruner Prune vocabulary of LLMs to save memory in training.

Here are some my technical blogs:

Pinned Loading

  1. Firefly Firefly Public

    Firefly: Da Mo Xing Xun Lian Gong Ju ,Zhi Chi Xun Lian Qwen2.5, Qwen2, Yi1.5, Phi-3, Llama3, Gemma, MiniCPM, Yi, Deepseek, Orion, Xverse, Mixtral-8x7B, Zephyr, Mistral, Baichuan2, Llma2, Llama, Qwen, Baichuan, ChatGLM2, InternLM, Ziya2, Vicuna, BloomDeng Da Mo Xing

    Python 6.6k 588

  2. GPT2-chitchat GPT2-chitchat Public

    GPT2 for Chinese chitchat/Yong Yu Zhong Wen Xian Liao De GPT2Mo Xing (Shi Xian Liao DialoGPTDe MMISi Xiang )

    Python 3k 673

  3. CPM CPM Public

    Easy-to-use CPM for Chinese text generation(Ji Yu CPMDe Zhong Wen Wen Ben Sheng Cheng )

    Python 531 130

  4. Firefly-LLaMA2-Chinese Firefly-LLaMA2-Chinese Public

    FireflyZhong Wen LLaMA-2Da Mo Xing ,Zhi Chi Zeng Liang Yu Xun Lian Baichuan2, Llama2, Llama, Falcon, Qwen, Baichuan, InternLM, BloomDeng Da Mo Xing

    Python 416 32

  5. CLIP-Chinese CLIP-Chinese Public

    Zhong Wen CLIPYu Xun Lian Mo Xing

    Python 423 61

  6. LongQLoRA LongQLoRA Public

    LongQLoRA: Extent Context Length of LLMs Efficiently

    Python 168 16