Author: Abigail Kelly

scholar

Heterogeneous Space Fusion and Dual-Dimension Attention: A New Paradigm for Speech Enhancement

By Abigail KellyAugust 16, 20240

Authors: Tao Zheng、Liejun Wang、Yinfeng Yu Paper: https://arxiv.org/abs/2408.06911 Introduction Speech communication is a fundamental mode of human interaction, but environmental noise often degrades the quality and clarity of speech data. Speech enhancement (SE) technology aims to mitigate the impact of noise while preserving the integrity of the original signal. This paper introduces a novel speech enhancement framework, HFSDA, which integrates heterogeneous spatial features and incorporates a dual-dimension attention mechanism to significantly enhance speech clarity and quality in noisy environments. Related Work Self-Supervised Learning Models Self-supervised learning (SSL) models have shown significant progress in speech tasks. Early methods like Contrastive Predictive Coding…

scholar

What's Hot

AAAI.2024 – Humans and AI

How Diffusion Models Learn to Factorize and Compose

Temporal Fairness in Decision Making Problems

Author: Abigail Kelly

Heterogeneous Space Fusion and Dual-Dimension Attention: A New Paradigm for Speech Enhancement

AAAI.2024 – Humans and AI

How Diffusion Models Learn to Factorize and Compose

Temporal Fairness in Decision Making Problems

NeCo: Improving DINOv2’s spatial representations in 19 GPU hours with Patch Neighbor Consistency

Our Picks

AAAI.2024 – Humans and AI

How Diffusion Models Learn to Factorize and Compose

Temporal Fairness in Decision Making Problems

Subscribe to Updates

What's Hot

Author: Abigail Kelly