IJRTI
International Journal for Research Trends and Innovation
International Peer Reviewed & Refereed Journals, Open Access Journal
ISSN Approved Journal No: 2456-3315 | Impact factor: 8.14 | ESTD Year: 2016
Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 8.14 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)

Call For Paper

For Authors

Forms / Download

Published Issue Details

Editorial Board

Other IMP Links

Facts & Figure

Impact Factor : 8.14

Issue per Year : 12

Volume Published : 11

Issue Published : 117

Article Submitted : 21307

Article Published : 8476

Total Authors : 22301

Total Reviewer : 802

Total Countries : 156

Indexing Partner

Licence

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
Published Paper Details
Paper Title: Streaming End-to-End Target-Speaker Automatic Speech Recognition and Activity Detection
Authors Name: Dr. K.SUBBARAO , GOPU SINDHUJA , BUDDI SRAVANI , BELLAMKONDA SUPRAJA , VANAMA SANDHYA RANI
Download E-Certificate: Download
Author Reg. ID:
IJRTI_205995
Published Paper Id: IJRTI2509013
Published In: Volume 10 Issue 9, September-2025
DOI:
Abstract: Automatic Speech Recognition (ASR) of a target speaker in multi-speaker environments remains a significant challenge. Traditional ASR systems often fail to isolate a specific speaker's voice from overlapping and interfering audio sources. To address this, Target-Speaker ASR (TS-ASR) has emerged as a viable solution by conditioning the recognition process on speaker-specific embeddings. This paper presents a Streaming End-to-End TS-ASR system based on a neural transducer architecture that facilitates low-latency and on-device speech recognition. The proposed model integrates Target-Speaker Activity Detection (TSAD), allowing the system to remain silent when the target speaker is inactive, thereby reducing unnecessary outputs. Experimental evaluations demonstrate that the proposed TS-ASR model achieves superior performance compared to traditional cascade systems, with improvements in word error rate (WER), speaker identification accuracy, and real-time latency. The system is optimized for real-world deployment, offering high accuracy and low computational overhead suitable for mobile and edge applications.
Keywords: Target-Speaker ASR (TS-ASR), Recurrent Neural Network Transducer (RNNT), Speaker Embedding, Voice Activity Detection (VAD), Real-Time Speech Recognition, Speaker Identification, End-to-End ASR.
Cite Article: "Streaming End-to-End Target-Speaker Automatic Speech Recognition and Activity Detection ", International Journal for Research Trends and Innovation (www.ijrti.org), ISSN:2455-2631, Vol.10, Issue 9, page no.a113-a117, September-2025, Available :http://www.ijrti.org/papers/IJRTI2509013.pdf
Downloads: 0002590
ISSN: 2456-3315 | IMPACT FACTOR: 8.14 Calculated By Google Scholar| ESTD YEAR: 2016
An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.14 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator
Publication Details: Published Paper ID: IJRTI2509013
Registration ID:205995
Published In: Volume 10 Issue 9, September-2025
DOI (Digital Object Identifier):
Page No: a113-a117
Country: HYDERABAD, Telangana, India
Research Area: Computer Science & Technology 
Publisher : IJ Publication
Published Paper URL : https://www.ijrti.org/viewpaperforall?paper=IJRTI2509013
Published Paper PDF: https://www.ijrti.org/papers/IJRTI2509013
Share Article:

Click Here to Download This Article

Article Preview
Click Here to Download This Article

Major Indexing from www.ijrti.org
Google Scholar ResearcherID Thomson Reuters Mendeley : reference manager Academia.edu
arXiv.org : cornell university library Research Gate CiteSeerX DOAJ : Directory of Open Access Journals
DRJI Index Copernicus International Scribd DocStoc

ISSN Details

ISSN: 2456-3315
Impact Factor: 8.14 and ISSN APPROVED, Journal Starting Year (ESTD) : 2016

DOI (A digital object identifier)


Providing A digital object identifier by DOI.ONE
How to Get DOI?

Conference

Open Access License Policy

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

Creative Commons License This material is Open Knowledge This material is Open Data This material is Open Content

Important Details

Join RMS/Earn 300

IJRTI

WhatsApp
Click Here

Indexing Partner