Tech Science Press - Publisher of Open Access Journals

News & Announcements

30 January 2026
Tech Science Press Shares Integrity Insights on AI-Enabled Paper Mills at Charleston Asia Conference
27 January 2026
SDHM-Recommended: I3CSE 2026 in Guangzhou
26 January 2026
TSP Establishes Strategic Cooperation with Chinese Medical Association Publishing House (CMAPH)
05 January 2026
Prof. Lin Lu Appointed Editor-in-Chief of Energy Engineering
29 December 2025
Two More Tech Science Press Journals Now Indexed in Chemical Abstracts Service (CAS) Databases
24 December 2025
Oncologie Welcomes Dr. Lei Zheng as Editor-in-Chief

Title/Keywords
Author/Affliations
Journal
Article Type
Start Year
End Year

Update Searching Clear

Show export options

Articles
Online

Search Results (4)

Open Access

ARTICLE

RSG-Conformer: ReLU-Based Sparse and Grouped Conformer for Audio-Visual Speech Recognition

Yewei Xiao, Xin Du^*, Wei Zeng

CMC-Computers, Materials & Continua, Vol.86, No.3, 2026, DOI:10.32604/cmc.2025.072145 - 12 January 2026

Abstract Audio-visual speech recognition (AVSR), which integrates audio and visual modalities to improve recognition performance and robustness in noisy or adverse acoustic conditions, has attracted significant research interest. However, Conformer-based architectures remain computational expensive due to the quadratic increase in the spatial and temporal complexity of their softmax-based attention mechanisms with sequence length. In addition, Conformer-based architectures may not provide sufficient flexibility for modeling local dependencies at different granularities. To mitigate these limitations, this study introduces a novel AVSR framework based on a ReLU-based Sparse and Grouped Conformer (RSG-Conformer) architecture. Specifically, we propose a Global-enhanced Sparse… More >

View
939

Download
433
Open Access

ARTICLE

Visual Lip-Reading for Quranic Arabic Alphabets and Words Using Deep Learning

Nada Faisal Aljohani^*, Emad Sami Jaha

Computer Systems Science and Engineering, Vol.46, No.3, pp. 3037-3058, 2023, DOI:10.32604/csse.2023.037113 - 03 April 2023

Abstract The continuing advances in deep learning have paved the way for several challenging ideas. One such idea is visual lip-reading, which has recently drawn many research interests. Lip-reading, often referred to as visual speech recognition, is the ability to understand and predict spoken speech based solely on lip movements without using sounds. Due to the lack of research studies on visual speech recognition for the Arabic language in general, and its absence in the Quranic research, this research aims to fill this gap. This paper introduces a new publicly available Arabic lip-reading dataset containing 10490… More >

View
3383

Download
1816
Open Access

ARTICLE

Deep Learning-Based Approach for Arabic Visual Speech Recognition

Nadia H. Alsulami^1,*, Amani T. Jamal¹, Lamiaa A. Elrefaei²

CMC-Computers, Materials & Continua, Vol.71, No.1, pp. 85-108, 2022, DOI:10.32604/cmc.2022.019450 - 03 November 2021

Abstract Lip-reading technologies are rapidly progressing following the breakthrough of deep learning. It plays a vital role in its many applications, such as: human-machine communication practices or security applications. In this paper, we propose to develop an effective lip-reading recognition model for Arabic visual speech recognition by implementing deep learning algorithms. The Arabic visual datasets that have been collected contains 2400 records of Arabic digits and 960 records of Arabic phrases from 24 native speakers. The primary purpose is to provide a high-performance model in terms of enhancing the preprocessing phase. Firstly, we extract keyframes from… More >

View
4278

Download
2691
Open Access

ARTICLE

HLR-Net: A Hybrid Lip-Reading Model Based on Deep Convolutional Neural Networks

Amany M. Sarhan¹, Nada M. Elshennawy¹, Dina M. Ibrahim^1,2,*

CMC-Computers, Materials & Continua, Vol.68, No.2, pp. 1531-1549, 2021, DOI:10.32604/cmc.2021.016509 - 13 April 2021

Abstract
Lip reading is typically regarded as visually interpreting the speaker’s lip movements during the speaking. This is a task of decoding the text from the speaker’s mouth movement. This paper proposes a lip-reading model that helps deaf people and persons with hearing problems to understand a speaker by capturing a video of the speaker and inputting it into the proposed model to obtain the corresponding subtitles. Using deep learning technologies makes it easier for users to extract a large number of different features, which can then be converted to probabilities of letters to obtain accurate results.
… More >

View
4743

Download
3302

Cited by
2

Displaying 1-10 on page 1 of 4. Per Page

RSG-Conformer: ReLU-Based Sparse and Grouped Conformer for Audio-Visual Speech Recognition

View

939

Download

433

Visual Lip-Reading for Quranic Arabic Alphabets and Words Using Deep Learning

View

3383

Download

1816

Deep Learning-Based Approach for Arabic Visual Speech Recognition

View

4278

Download

2691

HLR-Net: A Hybrid Lip-Reading Model Based on Deep Convolutional Neural Networks

View

4743

Download

3302

Cited by

2

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: