Natural Language Processing Group, Institute of Computing Technology, Chinese Academy of Science

university

https://nlp.ict.ac.cn

AI & ML interests

None defined yet.

Recent Activity

poeroz updated a dataset 26 days ago

ICTNLP/InstructS2S-200K

Paulmzr new activity 4 months ago

ICTNLP/SLED-TTS-Streaming-Libriheavy:Is this model multilingual ?

guoshoutao updated a dataset 5 months ago

ICTNLP/LongSpeech-Eval

View all activity

poeroz

updated a dataset 26 days ago

ICTNLP/InstructS2S-200K

Viewer • Updated 26 days ago • 200k • 1.23k • 8

Paulmzr

in ICTNLP/SLED-TTS-Streaming-Libriheavy 4 months ago

Is this model multilingual ?

#2 opened 4 months ago by

MuhammadZaeemNasir

guoshoutao

updated a dataset 5 months ago

ICTNLP/LongSpeech-Eval

Viewer • Updated Jul 22 • 164 • 187 • 2

guoshoutao

updated a model 5 months ago

ICTNLP/FastLongSpeech

8B • Updated Jul 22 • 10 • 2

guoshoutao

published a dataset 5 months ago

ICTNLP/LongSpeech-Eval

Viewer • Updated Jul 22 • 164 • 187 • 2

guoshoutao

published a model 5 months ago

ICTNLP/FastLongSpeech

8B • Updated Jul 22 • 10 • 2

guoshoutao

updated a dataset 6 months ago

ICTNLP/StreamUni

Viewer • Updated Jul 14 • 9.63k • 5.59k • 2

guoshoutao

updated a model 6 months ago

ICTNLP/StreamUni-Phi4

Audio-Text-to-Text • 6B • Updated Jul 14 • 7

zhangshaolei

authored a paper 6 months ago

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

Paper • 2506.13642 • Published Jun 16 • 26

Paulmzr

authored a paper 7 months ago

Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space

Paper • 2505.13181 • Published May 19 • 9

poeroz

authored 4 papers 8 months ago

Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation

Paper • 2310.13361 • Published Oct 20, 2023

BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models

Paper • 2306.10968 • Published Jun 19, 2023 • 7

DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation

Paper • 2310.07403 • Published Oct 11, 2023

BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment

Paper • 2411.16300 • Published Nov 25, 2024

zhangshaolei

authored a paper 8 months ago

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published May 5 • 22

guoshoutao

authored a paper 8 months ago

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published May 5 • 22

poeroz

authored a paper 8 months ago

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published May 5 • 22

poeroz

authored a paper 12 months ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 52

zhangshaolei

authored a paper 12 months ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 52

zhangshaolei

authored a paper over 1 year ago

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10, 2024 • 60