Paper Digest: ICASSP 2021 Highlights
Download ICASSP-2021-Paper-Digests.pdf– highlights of all ICASSP-2021 papers. Readers can also choose to read this highlight article on our console, which allows users to filter out papers using keywords. The IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) is one of the top signal processing conferences in the world. In 2021, it is to be held online.
To help the community quickly catch up on the work presented in this conference, Paper Digest Team processed all accepted papers, and generated one highlight sentence (typically the main topic) for each paper. Readers are encouraged to read these machine generated highlights / summaries to quickly get the main idea of each paper.
If you do not want to miss any interesting academic paper, you are welcome to sign up our free daily paper digest service to get updates on new papers published in your area every day. You are also welcome to follow us on Twitter and Linkedin to get updated with new conference digests.
Paper Digest Team
team@paperdigest.org
TABLE 1: Paper Digest: ICASSP 2021 Highlights
Paper | Author(s) | |
---|---|---|
1 | Rethinking The Separation Layers In Speech Separation Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we empirically examine those questions by designing models with varying configurations in the SIMO and SISO modules. |
Y. Luo; Z. Chen; C. Han; C. Li; T. Zhou; N. Mesgarani; |
2 | On Permutation Invariant Training For Speech Source Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We study permutation invariant training (PIT), which targets at the permutation ambiguity problem for speaker independent source separation models. |
X. Liu; J. Pons; |
3 | Count And Separate: Incorporating Speaker Counting For Continuous Speaker Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study leverages frame-wise speaker counting to switch between speech enhancement and speaker separation for continuous speaker separation. |
Z. -Q. Wang; D. Wang; |
4 | Ultra-Lightweight Speech Separation Via Group Communication Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we provide a simple model design paradigm that explicitly designs ultra-lightweight models without sacrificing the performance. |
Y. Luo; C. Han; N. Mesgarani; |
5 | Attention Is All You Need In Speech Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Transformers are emerging as a natural alternative to standard RNNs, replacing recurrent computations with a multi-head attention mechanism.In this paper, we propose the SepFormer, a novel RNN-free Transformer-based neural network for speech separation. |
C. Subakan; M. Ravanelli; S. Cornell; M. Bronzi; J. Zhong; |
6 | Multichannel Overlapping Speaker Segmentation Using Multiple Hypothesis Tracking Of Acoustic And Spatial Features Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we explore the use of a new multimodal approach for overlapping speaker segmentation that tracks both the fundamental frequency (F0) of the speaker and the speaker?s direction of arrival (DOA) simultaneously. |
A. O. T. Hogg; C. Evers; P. A. Naylor; |
7 | Semi-Supervised Singing Voice Separation With Noisy Self-Training Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Given a limited set of labeled data, we present a method to leverage a large volume of unlabeled data to improve the model?s performance. |
Z. Wang; R. Giri; U. Isik; J. -M. Valin; A. Krishnaswamy; |
8 | Neuro-Steered Music Source Separation With EEG-Based Auditory Attention Decoding And Contrastive-NMF Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel informed music source separation paradigm, which can be referred to as neuro-steered music source separation. |
G. Cantisani; S. Essid; G. Richard; |
9 | Complex Ratio Masking For Singing Voice Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a complex ratio masking method for voice and accompaniment separation. |
Y. Zhang; Y. Liu; D. Wang; |
10 | Transcription Is All You Need: Learning To Separate Musical Mixtures With Score As Supervision Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we use musical scores, which are comparatively easy to obtain, as a weak label for training a source separation system. |
Y. -N. Hung; G. Wichern; J. Le Roux; |
11 | All For One And One For All: Improving Music Separation By Bridging Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes several improvements for music separation with deep neural networks (DNNs), namely a multi-domain loss (MDL) and two combination schemes. |
R. Sawata; S. Uhlich; S. Takahashi; Y. Mitsufuji; |
12 | An Hrnet-Blstm Model With Two-Stage Training For Singing Melody Extraction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome this problem, we propose to use a pitch refinement method to refine the semitone-level pitch sequences decoded from massive melody MIDI files to generate a large number of fundamental frequency (F0) values for model training. |
Y. Gao; X. Du; B. Zhu; X. Sun; W. Li; Z. Ma; |
13 | DeepF0: End-To-End Fundamental Frequency Estimation for Music and Speech Signals Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel pitch estimation technique called DeepF0, which leverages the available annotated data to directly learns from the raw audio in a data-driven manner. |
S. Singh; R. Wang; Y. Qiu; |
14 | Differentiable Signal Processing With Black-Box Audio Effects Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a data-driven approach to automate audio signal processing by incorporating stateful third-party, audio effects as layers within a deep neural network. |
M. A. Mart�nez Ram�rez; O. Wang; P. Smaragdis; N. J. Bryan; |
15 | Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these challenges, we propose a domain-inspired model with a strong inductive bias for the mixing task. |
C. J. Steinmetz; J. Pons; S. Pascual; J. Serr�; |
16 | Sequence-To-Sequence Singing Voice Synthesis With Perceptual Entropy Loss Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a Perceptual Entropy (PE) loss derived from a psycho-acoustic hearing model to regularize the network. |
J. Shi; S. Guo; N. Huo; Y. Zhang; Q. Jin; |
17 | Reverb Conversion Of Mixed Vocal Tracks Using An End-To-End Convolutional Deep Neural Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In response, we propose an end-to-end system capable of switching the musical reverb factor of two different mixed vocal tracks. |
J. Koo; S. Paik; K. Lee; |
18 | Extending Music Based On Emotion And Tonality Via Generative Adversarial Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a generative model for music extension in this paper. |
B. -W. Tseng; Y. -L. Shen; T. -S. Chi; |
19 | Improving The Robustness Of Right Whale Detection In Noisy Conditions Using Denoising Autoencoders And Augmented Training Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The aim of this paper is to examine denoising autoencoders (DAEs) for improving the detection of right whales recorded in harsh marine environments. |
W. Vickers; B. Milner; R. Lee; |
20 | Self-Supervised VQ-VAE for One-Shot Music Style Transfer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we are specifically interested in the problem of one-shot timbre transfer. |
O. C�fka; A. Ozerov; U. Simsekli; G. Richard; |
21 | Capturing Temporal Dependencies Through Future Prediction for CNN-Based Audio Classifiers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To capture audio temporal dependencies using CNNs, we take a different approach from the purely architecture-induced method and explicitly encode temporal dependencies into the CNN-based audio classifiers. |
H. Song; J. Han; S. Deng; Z. Du; |
22 | Segmental Dtw: A Parallelizable Alternative to Dynamic Time Warping Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we explore parallelizable alternatives to DTW for globally aligning two feature sequences. |
T. Tsai; |
23 | Pitch-Timbre Disentanglement Of Musical Instrument Sounds Based On Vae-Based Metric Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes a representation learning method for disentangling an arbitrary musical instrument sound into latent pitch and timbre representations. |
K. Tanaka; R. Nishikimi; Y. Bando; K. Yoshii; S. Morishima; |
24 | Asynchronous Acoustic Echo Cancellation Over Wireless Channels Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a novel acoustic echo cancellation framework for systems where the loudspeaker and the microphone array are not synchronized. |
R. Ayrapetian; P. Hilmes; M. Mansour; T. Kristjansson; C. Murgia; |
25 | Combining Adaptive Filtering And Complex-Valued Deep Postfiltering For Acoustic Echo Cancellation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this contribution, we introduce a novel approach to noise-robust acoustic echo cancellation employing a complex-valued Deep Neural Network (DNN) for postfiltering. |
M. M. Halimeh; T. Haubner; A. Briegleb; A. Schmidt; W. Kellermann; |
26 | Deep Residual Echo Suppression With A Tunable Tradeoff Between Signal Distortion And Echo Suppression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a residual echo suppression method using a UNet neural network that directly maps the outputs of a linear acoustic echo canceler to the desired signal in the spectral domain. |
A. Ivry; I. Cohen; B. Berdugo; |
27 | Robust STFT Domain Multi-Channel Acoustic Echo Cancellation with Adaptive Decorrelation of The Reference Signals Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present an algorithm for multi-channel acoustic echo cancellation for a high-fidelity audio reproduction system equipped with a microphone array for voice control. |
S. Bagheri; D. Giacobello; |
28 | A Method for Determining Periodically Time-Varying Bias and Its Applications in Acoustic Feedback Cancellation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we make use of that knowledge and propose a method to detect different acoustic situations, based on the level of residual bias. |
M. Guo; |
29 | Weighted Recursive Least Square Filter and Neural Network Based Residual ECHO Suppression for The AEC-Challenge Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a real-time Acoustic Echo Cancellation (AEC) algorithm submitted to the AEC-Challenge. |
Z. Wang; Y. Na; Z. Liu; B. Tian; Q. Fu; |
30 | ICASSP 2021 Acoustic Echo Cancellation Challenge: Integrated Adaptive Echo Cancellation with Time Alignment and Deep Learning-Based Residual Echo Plus Noise Suppression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes a three-stage acoustic echo cancellation (AEC) and suppression framework for the ICASSP 2021 AEC Challenge. |
R. Peng; L. Cheng; C. Zheng; X. Li; |
31 | ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing Framework, and Results Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this challenge, we open source two large datasets to train AEC models under both single talk and double talk scenarios. |
K. Sridhar; et al. |
32 | AEC in A Netshell: on Target and Topology Choices for FCRN Acoustic Echo Cancellation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we will heal this issue and significantly improve the near-end speech component quality over existing approaches. |
J. Franzen; E. Seidel; T. Fingscheidt; |
33 | Kernel-Interpolation-Based Filtered-X Least Mean Square for Spatial Active Noise Control In Time Domain Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Time-domain spatial active noise control (ANC) algorithms based on kernel interpolation of a sound field are proposed. |
J. Brunnstr�m; S. Koyama; |
34 | Wave-Domain Optimization of Secondary Source Placement Free From Information of Error Sensor Positions Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, a method free from the information of specific error sensors positions is proposed. |
J. Xu; K. Chen; Y. Li; |
35 | Lasaft: Latent Source Attentive Frequency Transformation For Conditioned Source Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The goal of this paper is to extend the FT block to fit the multi-source task. |
W. Choi; M. Kim; J. Chung; S. Jung; |
36 | Surrogate Source Model Learning for Determined Source Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose to learn surrogate functions of universal speech priors for determined blind speech separation. |
R. Scheibler; M. Togami; |
37 | Auditory Filterbanks Benefit Universal Sound Source Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We proposed parameterized Gammatone and Gammachirp filterbanks, which improved performance with fewer parameters and better interpretability. |
H. Li; K. Chen; B. U. Seeber; |
38 | What�s All The Fuss About Free Universal Sound Separation Data? Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce the Free Universal Sound Separation (FUSS) dataset, a new corpus for experiments in separating mixtures of an unknown number of sounds from an open domain of sound types. |
S. Wisdom; et al. |
39 | SepNet: A Deep Separation Matrix Prediction Network for Multichannel Audio Source Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose SepNet, a deep neural network (DNN) designed to predict separation matrices from multichannel observations. |
S. Inoue; H. Kameoka; L. Li; S. Makino; |
40 | CDPAM: Contrastive Learning for Perceptual Audio Similarity Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces CDPAM ?a metric that builds on and advances DPAM. |
P. Manocha; Z. Jin; R. Zhang; A. Finkelstein; |
41 | Linear Multichannel Blind Source Separation Based on Time-Frequency Mask Obtained By Harmonic/Percussive Sound Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Building up on this framework, in this paper, we propose a unification of determined BSS and harmonic/percussive sound separation (HPSS). |
S. Oyabu; D. Kitamura; K. Yatabe; |
42 | Multichannel-based Learning for Audio Object Extraction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we propose a novel deep learning approach to object extraction that learns from the multichannel renders of object-based productions, instead of directly learning from the audio objects themselves. |
D. Arteaga; J. Pons; |
43 | DBnet: Doa-Driven Beamforming Network for End-to-end Reverberant Sound Source Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we propose a direction-of-arrival-driven beamforming network (DBnet) consisting of direction-of-arrival (DOA) estimation and beamforming layers for end-to-end source separation. |
A. Aroudi; S. Braun; |
44 | Joint Dereverberation and Separation With Iterative Source Steering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new algorithm for joint dereverberation and blind source separation (DR-BSS). |
T. Nakashima; R. Scheibler; M. Togami; N. Ono; |
45 | Exploiting Non-Negative Matrix Factorization for Binaural Sound Localization in The Presence of Directional Interference Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study presents a novel solution to the problem of binaural localization of a speaker in the presence of interfering directional noise and reverberation. |
I. �rnolfsson; T. Dau; N. Ma; T. May; |
46 | Blind Extraction of Moving Audio Source in A Challenging Environment Supported By Speaker Identification Via X-Vectors Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel approach for semi-supervised extraction of a moving audio source of interest (SOI) applicable in reverberant and noisy environments. |
J. Malek; J. Jansky; T. Kounovsky; Z. Koldovsky; J. Zdansky; |
47 | Mind The Beat: Detecting Audio Onsets from EEG Recordings of Music Listening Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a deep learning approach to predicting audio event onsets in electroencephalogram (EEG) recorded from users as they listen to music. |
A. Vinay; A. Lerch; G. Leslie; |
48 | Don�t Look Back: An Online Beat Tracking Method Using RNN and Enhanced Particle Filtering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose Don’t Look back! (DLB), a novel approach optimized for efficiency when performing OBT. |
M. Heydari; Z. Duan; |
49 | Singing Melody Extraction from Polyphonic Music Based on Spectral Correlation Modeling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore the idea of modeling spectral correlation explicitly for melody extraction. |
X. Du; B. Zhu; Q. Kong; Z. Ma; |
50 | Improving Automatic Drum Transcription Using Large-Scale Audio-to-Midi Aligned Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle this issue, we propose a semi-automatic way of compiling a labeled dataset using the audio-to-MIDI alignment technique. |
I. -C. Wei; C. -W. Wu; L. Su; |
51 | Frequency-Temporal Attention Network for Singing Melody Extraction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by these intrinsic characteristics, a frequency-temporal attention network is proposed to mimic human auditory for singing melody extraction. |
S. Yu; X. Sun; Y. Yu; W. Li; |
52 | Statistical Correction of Transcribed Melody Notes Based on Probabilistic Integration of A Music Language Model and A Transcription Error Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes a statistical post-processing method for automatic singing transcription that corrects pitch and rhythm errors included in a transcribed note sequence. |
Y. Hiramatsu; G. Shibata; R. Nishikimi; E. Nakamura; K. Yoshii; |
53 | Reliability Assessment of Singing Voice F0-Estimates Using Multiple Algorithms Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we consider an approach to automatically assess the reliability of F0-trajectories estimated from monophonic singing voice recordings. |
S. Rosenzweig; F. Scherbaum; M. M�ller; |
54 | End-to-End Lyrics Recognition with Voice to Singing Style Transfer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a data augmentation method that converts natural speech to singing voice based on vocoder based speech synthesizer. |
S. Basak; S. Agarwal; S. Ganapathy; N. Takahashi; |
55 | Singing Language Identification Using A Deep Phonotactic Approach Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work presents a modernized phonotactic system for SLID on polyphonic music: phoneme recognition is performed with a Connectionist Temporal Classification (CTC)-based acoustic model trained with multilingual data, before language classification with a recurrent model based on the phonemes estimation. |
L. Renault; A. Vaglio; R. Hennequin; |
56 | On The Preparation and Validation of A Large-Scale Dataset of Singing Transcription Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a large-scale dataset for singing transcription, along with some methods for fine-tuning and validating its contents. |
J. -Y. Wang; J. -S. R. Jang; |
57 | Joint Multi-Pitch Detection and Score Transcription for Polyphonic Piano Music Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a method for joint multi-pitch detection and score transcription for polyphonic piano music. |
L. Liu; V. Morfi; E. Benetos; |
58 | Karaoke Key Recommendation Via Personalized Competence-Based Rating Prediction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we address a novel task of recommending a suitable key for a user to sing a given song to meet his or her vocal competence, by proposing the Personalized Competence-based Rating Prediction (PCRP) model. |
Y. Wang; S. Tanaka; K. Yokoyama; H. -T. Wu; Y. Fang; |
59 | A Closed-Loop Gain-Control Feedback Model for The Medial Efferent System of The Descending Auditory Pathway Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We have implemented a dynamic, closed-loop gain-control system into an existing auditory model to simulate parts of the efferent system. |
A. Farhadi; S. G. Jennings; E. A. Strickland; L. H. Carney; |
60 | DHASP: Differentiable Hearing Aid Speech Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore an alternative approach to finding the optimal fitting by introducing a hearing aid speech processing framework, in which the fitting is optimised in an automated way using an intelligibility objective function based on the HASPI physiological auditory model. |
Z. Tu; N. Ma; J. Barker; |
61 | Computationally Efficient DNN-Based Approximation of An Auditory Model for Applications in Speech Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Hence, in this work we propose and evaluate DNN-based approximations of a state-of-the-art auditory model. |
A. Nagathil; F. G�bel; A. Nelus; I. C. Bruce; |
62 | Cascaded All-Pass Filters with Randomized Center Frequencies and Phase Polarity for Acoustic and Speech Measurement and Data Augmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a new member of TSP (Time Stretched Pulse) for acoustic and speech measurement infrastructure, based on a simple all-pass filter and systematic randomization. |
H. Kawahara; K. Yatabe; |
63 | Probing Acoustic Representations for Phonetic Properties Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We compare features from two conventional and four pre-trained systems in some simple frame-level phonetic classification tasks, with classifiers trained on features from one version of the TIMIT dataset and tested on features from another. |
D. Ma; N. Ryant; M. Liberman; |
64 | An End-To-End Non-Intrusive Model for Subjective and Objective Real-World Speech Assessment Using A Multi-Task Framework Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel multi-task non-intrusive approach that is capable of simultaneously estimating both subjective and objective scores of real-world speech, to help facilitate learning. |
Z. Zhang; P. Vyas; X. Dong; D. S. Williamson; |
65 | Few-Shot Continual Learning for Audio Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we introduce a few-shot continual learning framework for audio classification, where we can continuously expand a trained base classifier to recognize novel classes based on only few labeled data at inference time. |
Y. Wang; N. J. Bryan; M. Cartwright; J. Pablo Bello; J. Salamon; |
66 | Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study zero-shot learning in audio classification through factored linear and nonlinear acoustic-semantic projections between audio instances and sound classes. |
H. Xie; O. R�s�nen; T. Virtanen; |
67 | Unsupervised and Semi-Supervised Few-Shot Acoustic Event Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we study unsupervised and semi-supervised learning approaches for few-shot AEC. |
H. -P. Huang; K. C. Puvvada; M. Sun; C. Wang; |
68 | Flow-Based Self-Supervised Density Estimation for Anomalous Sound Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To develop a machine sound monitoring system, a method for detecting anomalous sound is proposed. |
K. Dohi; T. Endo; H. Purohit; R. Tanabe; Y. Kawaguchi; |
69 | Self-Training for Sound Event Detection in Audio Mixtures Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to address limitations in availability of training data, this work proposes a self-training technique to leverage unlabeled datasets in supervised learning using pseudo label estimation. |
S. Park; A. Bellur; D. K. Han; M. Elhilali; |
70 | Prototypical Networks for Domain Adaptation in Acoustic Scene Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In the search for an optimal solution to the said problem, we explore a metric learning approach called prototypical networks using the TUT Urban Acoustic Scenes dataset, which consists of 10 different acoustic scenes recorded across 10 cities. |
S. Singh; H. L. Bear; E. Benetos; |
71 | A Global-Local Attention Framework for Weakly Labelled Audio Tagging Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we propose a novel two-stream framework for audio tagging by exploiting the global and local information of sound events. |
H. Wang; Y. Zou; W. Wang; |
72 | An Improved Mean Teacher Based Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents an improved mean teacher (MT) based method for large-scale weakly labeled semi-supervised sound event detection (SED), by focusing on learning a better student model. |
X. Zheng; Y. Song; I. McLoughlin; L. Liu; L. -R. Dai; |
73 | Comparison of Deep Co-Training and Mean-Teacher Approaches for Semi-Supervised Audio Tagging Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we adapted the Deep-Co-Training algorithm (DCT) to perform AT, and compared it to another SSL approach called Mean Teacher (MT), that has been used by the winning participants of the DCASE competitions these last two years. |
L. Cances; T. Pellegrini; |
74 | The Benefit of Temporally-Strong Labels in Audio Event Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To reveal the importance of temporal precision in ground truth audio event labels, we collected precise (~0.1 sec resolution) strong labels for a portion of the AudioSet dataset. |
S. Hershey; et al. |
75 | Unsupervised Contrastive Learning of Sound Event Representations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we explore unsupervised contrastive learning as a way to learn sound event representations. |
E. Fonseca; D. Ortego; K. McGuinness; N. E. O�Connor; X. Serra; |
76 | Sound Event Detection By Consistency Training and Pseudo-Labeling With Feature-Pyramid Convolutional Recurrent Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To exploit large amount of unlabeled in-domain data efficiently, we applied three semi-supervised learning strategies: interpolation consistency training (ICT), shift consistency training (SCT), and weakly pseudo-labeling. |
C. -Y. Koh; Y. -S. Chen; Y. -W. Liu; M. R. Bai; |
77 | SESQA: Semi-Supervised Learning for Speech Quality Assessment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we tackle these problems with a semi-supervised learning approach, combining available annotations with programmatically generated data, and using 3 different optimization criteria together with 5 complementary auxiliary tasks. |
J. Serr�; J. Pons; S. Pascual; |
78 | Detecting Signal Corruptions in Voice Recordings For Speech Therapy Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this article we design an experimental setup to detect disturbances in voice recordings, such as additive noise, clipping, infrasound and random muting. |
H. Nyl�n; S. Chatterjee; S. Ternstr�m; |
79 | MBNET: MOS Prediction for Synthesized Speech with Mean-Bias Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose MBNet, a MOS predictor with a mean subnet and a bias subnet to better utilize every judge score in MOS datasets, where the mean subnet is used to predict the mean score of each utterance similar to that in previous works, and the bias subnet to predict the bias score (the difference between the mean score and each individual judge score) and capture the personal preference of individual judges. |
Y. Leng; X. Tan; S. Zhao; F. Soong; X. -Y. Li; T. Qin; |
80 | Non-Intrusive Binaural Prediction of Speech Intelligibility Based on Phoneme Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we explore an approach for modeling speech intelligibility in spatial acoustic scenes. |
J. Ro�bach; S. R�ttges; C. F. Hauth; T. Brand; B. T. Meyer; |
81 | Warp-Q: Quality Prediction for Generative Neural Speech Codecs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present WARP-Q, a full-reference objective speech quality metric that uses dynamic time warping cost for MFCC speech representations. |
W. A. Jassim; J. Skoglund; M. Chinen; A. Hines; |
82 | Crowdsourcing Approach for Subjective Evaluation of Echo Impairment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We then introduce an open-source crowdsourcing approach for subjective evaluation of echo impairment which can be used to evaluate the performance of AECs. |
R. Cutler; B. Nadari; M. Loide; S. Sootla; A. Saabas; |
83 | Amplitude Matching: Majorization�Minimization Algorithm for Sound Field Control Only with Amplitude Constraint Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A sound field control method for synthesizing a desired amplitude distribution inside a target region, amplitude matching, is proposed. |
S. Koyama; T. Amakasu; N. Ueno; H. Saruwatari; |
84 | 3D Multizone Soundfield Reproduction in A Reverberant Environment Using Intensity Matching Method Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We address this challenge and propose a multizone reproduction method for 3D soundfield in a reverberant room based on intensity matching. |
H. Zuo; T. D. Abhayapala; P. N. Samarasinghe; |
85 | The Far-Field Equatorial Array for Binaural Rendering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a method for obtaining a spherical harmonic representation of a sound field based on a microphone array along the equator of a rigid spherical scatterer. |
J. Ahrens; H. Helmholz; D. L. Alon; S. V. A. Gar�; |
86 | Spherical Harmonic Representation for Dynamic Sound-Field Measurements Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a new physical interpretation of the dynamic sampling problem. |
F. Katzberg; M. Maass; A. Mertins; |
87 | Direction Preserving Wind Noise Reduction Of B-Format Signals Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, methods to reduce wind noise while limiting the spatial distortions of the original signal are proposed based on recent works of the present authors. |
A. Herzog; D. Mirabilii; E. A. P. Habets; |
88 | Refinement of Direction of Arrival Estimators By Majorization-Minimization Optimization on The Array Manifold Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Unlike most conventional methods that rely exclusively on grid search, we introduce a continuous optimization algorithm to refine DOA estimates beyond the resolution of the initial grid. |
R. Scheibler; M. Togami; |
89 | On The Predictability of Hrtfs from Ear Shapes Using Deep Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Using 3D ear shapes as inputs, we explore the bounds of HRTF predictability using deep neural networks. |
Y. Zhou; H. Jiang; V. K. Ithapu; |
90 | Applied Methods for Sparse Sampling of Head-Related Transfer Functions Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the application of two methods for ear-aligned HRTF interpolation by sparse sampling: Orthogonal Matching Pursuit and Principal Component Analysis. |
L. Arbel; Z. Ben-Hur; D. L. Alon; B. Rafaely; |
91 | Personalized HRTF Modeling Using DNN-Augmented BEM Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a new deep learning method that combines measurements and numerical simulations to take the best of three worlds. |
M. Zhang; J. -H. Wang; D. L. James; |
92 | Efficient Training Data Generation for Phase-Based DOA Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a low complexity online data generation method to train DL models with a phase-based feature input. |
F. H�bner; W. Mack; E. A. P. Habets; |
93 | Acoustic Reflectors Localization from Stereo Recordings Using Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a fully convolutional network (FCN) that localizes reflective surfaces under the relaxed assumptions that (i) a compact array of only two microphones is available, (ii) emitter and receivers are not synchronized, and (iii) both the excitation signals and the impulse responses of the enclosures are unknown. |
G. Bologni; R. Heusdens; J. Martinez; |
94 | Detecting Acoustic Reflectors Using A Robot�s Ego-Noise Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a method to estimate the proximity of an acoustic reflector, e.g., a wall, using ego-noise, i.e., the noise produced by the moving parts of a listening robot. |
U. Saqib; A. Deleforge; J. R. Jensen; |
95 | Prediction of Object Geometry from Acoustic Scattering Using Convolutional Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The present work proposes a method to infer object geometry from scattering features by training convolutional neural networks. |
Z. Fan; V. Vineet; C. Lu; T. W. Wu; K. McMullen; |
96 | Blind Amplitude Estimation of Early Room Reflections Using Alternating Least Squares Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work presents a preliminary attempt to blindly estimate reflection amplitudes. |
T. Shlomo; B. Rafaely; |
97 | Acoustic Analysis and Dataset of Transitions Between Coupled Rooms Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents the measurement and analysis of a dataset of spatial room impulse responses for the transition between four coupled room pairs. |
T. McKenzie; S. J. Schlecht; V. Pulkki; |
98 | On Loss Functions for Deep-Learning Based T60 Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a composite classification- and regression-based cost function for training a deep neural network that predicts T60 for a variety of reverberant signals. |
Y. Li; Y. Liu; D. S. Williamson; |
99 | Towards Listening to 10 People Simultaneously: An Efficient Permutation Invariant Training of Audio Source Separation Using Sinkhorn�s Algorithm Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome this limitation, this paper proposes a SinkPIT, a novel variant of the PIT losses, which is much more efficient than the ordinary PIT loss when N is large. |
H. Tachibana; |
100 | Accelerating Auxiliary Function-Based Independent Vector Analysis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we investigate techniques which accelerate the convergence of the AuxIVA update rules without extra computational cost. |
A. Brendel; W. Kellermann; |
101 | One-Shot Conditional Audio Filtering of Arbitrary Sounds Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider the problem of separating a particular sound source from a single-channel mixture, based on only a short sample of the target source (from the same recording). |
B. Gfeller; D. Roblek; M. Tagliasacchi; |
102 | Low Latency Online Blind Source Separation Based on Joint Optimization with Blind Dereverberation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a method to solve this problem by integrating BSS with Weighted Prediction Error (WPE) based dereverberation. |
T. Ueda; T. Nakatani; R. Ikeshita; K. Kinoshita; S. Araki; S. Makino; |
103 | Autoregressive Fast Multichannel Nonnegative Matrix Factorization For Joint Blind Source Separation And Dereverberation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes a joint blind source separation and dereverberation method that works adaptively and efficiently in a reverberant noisy environment. |
K. Sekiguchi; Y. Bando; A. A. Nugraha; M. Fontaine; K. Yoshii; |
104 | Phase Recovery with Bregman Divergences for Audio Source Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to reformulate phase recovery in audio source separation as a minimization problem involving Bregman divergences. |
P. Magron; P. -H. Vial; T. Oberlin; C. F�votte; |
105 | Adversarial Attacks on Audio Source Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we reformulate various adversarial attack methods for the audio source separation problem and intensively investigate them under different attack conditions and target models. |
N. Takahashi; S. Inoue; Y. Mitsufuji; |
106 | Maximum A Posteriori Estimator for Convolutive Sound Source Separation with Sub-Source Based NTF Model and The Localization Probabilistic Prior on The Mixing Matrix Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we present a method for the separation of sound source signals recorded using multiple microphones in a reverberant room. |
M. Fras; K. Kowalczyk; |
107 | Unified Gradient Reweighting for Model Biasing with Applications to Source Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a simple, unified gradient reweighting scheme, with a lightweight modification to bias the learning process of a model and steer it towards a certain distribution of results. |
E. Tzinis; D. Bralios; P. Smaragdis; |
108 | Melon Playlist Dataset: A Public Dataset for Audio-Based Playlist Generation and Music Tagging Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present Melon Playlist Dataset, a public dataset of mel-spectrograms for 649,091 tracks and 148,826 associated playlists annotated by 30,652 different tags. |
A. Ferraro; et al. |
109 | Investigating The Efficacy of Music Version Retrieval Systems for Setlist Identification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an end-to-end workflow that identifies relevant metadata and timestamps of live music performances using a version identification system. |
F. Yesiler; E. Molina; J. Serr�; E. G�mez; |
110 | Instrument Classification of Solo Sheet Music Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we train AWD-LSTM, GPT-2, and RoBERTa models on solo sheet music images from IMSLP for eight different instruments. |
K. Ji; D. Yang; T. Tsai; |
111 | Bytecover: Cover Song Identification Via Multi-Loss Training Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present in this paper ByteCover, which is a new feature learning method for cover song identification (CSI). |
X. Du; Z. Yu; B. Zhu; X. Chen; Z. Ma; |
112 | Multi-Task Self-Supervised Pre-Training for Music Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we apply self-supervised and multi-task learning methods for pre-training music encoders, and explore various design choices including encoder architectures, weighting mechanisms to combine losses from multiple tasks, and worker selections of pretext tasks. |
H. -H. Wu; et al. |
113 | Towards Explaining Expressive Qualities in Piano Recordings: Transfer of Explanatory Features Via Acoustic Domain Adaptation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we show that by utilising unsupervised domain adaptation together with receptive-field regularised deep neural networks, it is possible to significantly improve generalisation to this domain. |
S. Chowdhury; G. Widmer; |
114 | Supervised Chorus Detection for Popular Music Using Convolutional Neural Network and Multi-Task Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a novel supervised approach to detecting the chorus segments in popular music. |
J. -C. Wang; J. B. L. Smith; J. Chen; X. Song; Y. Wang; |
115 | Structure-Aware Audio-to-Score Alignment Using Progressively Dilated Convolutional Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a novel method to detect such differences between the score and performance for a given piece of music using progressively dilated convolutional neural networks. |
R. Agrawal; D. Wolff; S. Dixon; |
116 | Language-Sensitive Music Emotion Recognition Models: Are We Really There Yet? Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents additional investigation on our approach, which reveals that: (1) performing pretraining with speech in a mixture of languages yields similar results than for specific languages – the pretraining phase appears not to exploit particular language features, (2) the music in Mandarin dataset consistently results in poor classification performance – we found low agreement in annotations, and (3) novel methodologies for representation learning (Contrastive Predictive Coding) may exploit features from both languages (i.e., pretraining on a mixture of languages) and improve classification of music emotions in both languages. |
J. S. G�mez-Ca��n; E. Cano; A. G. Pandrea; P. Herrera; E. G�mez; |
117 | Leveraging The Structure of Musical Preference in Content-Aware Music Recommendation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose instead to leverage a model of musical preference which originates from the field of music psychology. |
P. Magron; C. F�votte; |
118 | Low Resource Audio-To-Lyrics Alignment from Polyphonic Music Recordings Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we present a novel method that performs audio-to-lyrics alignment with a low memory consumption footprint regardless of the duration of the music recording. |
E. Demirel; S. Ahlb�ck; S. Dixon; |
119 | Multimodal Metric Learning for Tag-Based Music Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate three ideas to successfully introduce multimodal metric learning for tag-based music retrieval: elaborate triplet sampling, acoustic and cultural music information, and domain-specific word embeddings. |
M. Won; S. Oramas; O. Nieto; F. Gouyon; X. Serra; |
120 | Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we propose a method for learning audio representations using an audio autoencoder (AAE), a general word embed-dings model (WEM), and a multi-head self-attention (MHA) mechanism. |
X. Favory; K. Drossos; T. Virtanen; X. Serra; |
121 | Efficient End-to-End Audio Embeddings Generation for Audio Classification on Target Applications Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We describe a general-purpose end-to-end audio embeddings generator that can be easily adapted to various acoustic scene and event classification applications. |
P. Lopez-Meyer; J. A. del Hoyo Ontiveros; H. Lu; G. Stemmer; |
122 | Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on such, we propose the text-to-audio grounding (TAG) task, which interactively considers the relationship be-tween audio processing and language understanding. |
X. Xu; H. Dinkel; M. Wu; K. Yu; |
123 | Multi-View Audio And Music Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose in this work a multi-view learning approach for audio and music classification. |
H. Phan; et al. |
124 | Audio-Visual Event Recognition Through The Lens of Adversary Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work aims to study several key questions related to multimodal learning through the lens of adversarial noises: 1) The trade-off between early/middle/late fusion affecting its robustness and accuracy 2) How does different frequency/time domain features contribute to the robustness? |
J. B. Li; K. Ma; S. Qu; P. -Y. Huang; F. Metze; |
125 | DCASENET: An Integrated Pretrained Deep Neural Network for Detecting and Classifying Acoustic Scenes and Events Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose three architectures of deep neural networks that are integrated to simultaneously perform acoustic scene classification, audio tagging, and sound event detection. |
J. -w. Jung; H. -j. Shim; J. -h. Kim; H. -J. Yu; |
126 | A Curated Dataset of Urban Scenes for Audio-Visual Scene Analysis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces a curated dataset of urban scenes for audio-visual scene analysis which consists of carefully selected and recorded material. |
S. Wang; A. Mesaros; T. Heittola; T. Virtanen; |
127 | Improving Sound Event Detection Metrics: Insights from DCASE 2020 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper compares conventional event-based and segment-based criteria against the Polyphonic Sound Detection Score (PSDS)’s intersection-based criterion, over a selection of systems from DCASE 2020 Challenge Task 4. |
G. Ferroni; et al. |
128 | Artificially Synthesising Data for Audio Classification and Segmentation to Improve Speech and Music Detection in Radio Broadcast Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we present a novel procedure that artificially synthesises data that resembles radio signals. |
S. Venkatesh; et al. |
129 | LSSED: A Large-Scale Dataset and Benchmark for Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present LSSED, a challenging large-scale english speech emotion dataset, which has data collected from 820 subjects to simulate real- world distribution. |
W. Fan; X. Xu; X. Xing; W. Chen; D. Huang; |
130 | Enhancing Audio Augmentation Methods with Consistency Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper investigates the use of training objectives that explicitly impose this consistency constraint, and how it can impact downstream audio classification tasks. |
T. Iqbal; K. Helwani; A. Krishnaswamy; W. Wang; |
131 | Fast Threshold Optimization for Multi-Label Audio Tagging Using Surrogate Gradient Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we consider having at disposal a trained classifier and we seek to automatically optimize the decision thresholds according to a performance metric of interest, in our case F-measure (micro-F1). |
T. Pellegrini; T. Masquelier; |
132 | Towards Efficient Models for Real-Time Deep Noise Suppression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we investigate reasonably small recurrent and convolutional-recurrent network architectures for speech enhancement, trained on a large dataset considering also reverberation. |
S. Braun; H. Gamper; C. K. A. Reddy; I. Tashev; |
133 | Teacher-Student Learning for Low-Latency Online Speech Enhancement Using Wave-U-Net Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a low-latency online extension of wave-U-net for single-channel speech enhancement, which utilizes teacher-student learning to reduce the system latency while keeping the enhancement performance high. |
S. Nakaoka; L. Li; S. Inoue; S. Makino; |
134 | Learning Disentangled Feature Representations for Speech Enhancement Via Adversarial Training Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address such mismatch, we propose to learn noise-agnostic feature representations by disentanglement learning, which removes the unspecified noise factor, while keeping the specified factors of variation associated with the clean speech. |
N. Hou; C. Xu; E. S. Chng; H. Li; |
135 | Speech Enhancement Autoencoder with Hierarchical Latent Structure Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A new hierarchical convolutional neural network-based autoencoder architecture called SEHAE (Speech Enhancement Hierarchical AutoEncoder) is introduced, in which the latent representation is decomposed into several parts that correspond to different scales. |
K. Oostermeijer; J. Du; Q. Wang; C. -H. Lee; |
136 | Variational Autoencoder for Speech Enhancement with A Noise-Aware Encoder Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To increase the robustness of the VAE, we propose to include noise information in the training phase by using a noise-aware encoder trained on noisy-clean speech pairs. |
H. Fang; G. Carbajal; S. Wermter; T. Gerkmann; |
137 | Guided Variational Autoencoder for Speech Enhancement with A Supervised Classifier Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to guide the variational autoencoder with a supervised classifier separately trained on noisy speech. |
G. Carbajal; J. Richter; T. Gerkmann; |
138 | An Extension of Sparse Audio Declipper to Multiple Measurement Vectors Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes formulating declipping as a constrained multiple measurement vector (MMV) optimization problem that has a ${\ell _{2,0}}$ group norm as its cost function for further improving the state-of-the-art declipping method SParse Audio DEclipper (SPADE). |
S. Emura; N. Harada; |
139 | Real-Time Speech Frequency Bandwidth Extension Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we propose a lightweight model for frequency bandwidth extension of speech signals, increasing the sampling frequency from 8kHz to 16kHz while restoring the high frequency content to a level almost indistinguishable from the 16kHz ground truth. |
Y. Li; M. Tagliasacchi; O. Rybakov; V. Ungureanu; D. Roblek; |
140 | Bandwidth Extension Is All You Need Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a new bandwidth extension (BWE) method that expands 8-16kHz speech signals to 48kHz. |
J. Su; Y. Wang; A. Finkelstein; Z. Jin; |
141 | Audio Dequantization Using (Co)Sparse (Non)Convex Methods Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: It reviews the state-of-the-art sparsity-based approaches and proposes several new methods. |
P. Z�vi�ka; P. Rajmic; O. Mokr�; |
142 | Source-Aware Neural Speech Coding for Noisy Speech Compression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces a novel neural network-based speech coding system that can process noisy speech effectively. |
H. Yang; K. Zhen; S. Beack; M. Kim; |
143 | Enhancing Into The Codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on VQ-VAE autoencoders with WaveRNN decoders, we develop compressor-enhancer encoders and accompanying decoders, and show that they operate well in noisy conditions. |
J. Casebeer; V. Vale; U. Isik; J. -M. Valin; R. Giri; A. Krishnaswamy; |
144 | Speech Enhancement with Mixture of Deep Experts with Clean Clustering Pre-Training Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study we present a mixture of deep experts (MoDE) neural-network architecture for single microphone speech enhancement. |
S. E. Chazan; J. Goldberger; S. Gannot; |
145 | A Novel NMF-HMM Speech Enhancement Algorithm Based on Poisson Mixture Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel non-negative matrix factorization (NMF) and hidden Markov model (NMF-HMM) based speech enhancement algorithm, which employs a Poisson mixture model (PMM). |
Y. Xiang; L. Shi; J. L. H�jvang; M. H�jfeldt Rasmussen; M. G. Christensen; |
146 | Phoneme-Based Distribution Regularization for Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we aim to bridge this gap by extracting phoneme identities to help speech enhancement. |
Y. Liu; X. Peng; Z. Xiong; Y. Lu; |
147 | Compressed Representation of Cepstral Coefficients Via Recurrent Neural Networks for Informed Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We investigate a hybrid strategy made of signal processing and RNN (Recurrent Neural Networks) to calculate and compress cepstral coefficients: these are descriptors of the speech signal, which can be embedded in the signal itself and used at the receiver?s end to perform an Informed Speech Enhancement. |
C. Chermaz; D. Leuchtmann; S. Tanner; R. Wattenhofer; |
148 | Optimizing Short-Time Fourier Transform Parameters Via Gradient Descent Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we show an approach that allows us to obtain a gradient for STFT parameters with respect to arbitrary cost functions, and thus enable the ability to employ gradient descent optimization of quantities like the STFT window length, or the STFT hop size. |
A. Zhao; K. Subramani; P. Smaragdis; |
149 | Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we present an approach to geometry calibration in wireless acoustic sensor networks, whose nodes are assumed to be equipped with a compact microphone array. |
T. Gburrek; J. Schmalenstroeer; R. Haeb-Umbach; |
150 | On The Design of Square Differential Microphone Arrays with A Multistage Structure Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: It presents a multistage approach, which first divides an SDMA composed of M2 microphones into (M – 1)2 subarrays with each subarray being a 2 ? 2 square array formed by four adjacent microphones. |
X. Zhao; G. Huang; J. Benesty; J. Chen; I. Cohen; |
151 | Arrays of First-Order Steerable Differential Microphones Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider arbitrarily shaped planar arrays of DMA units. |
F. Borra; A. Bernardini; I. Bertuletti; F. Antonacci; A. Sarti; |
152 | Planar Array Geometry Optimization for Region Sound Acquisition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper studies the problem of geometry optimization for planar arrays and it develops a genetic optimization algorithm that can optimize the positions of the sensors, thereby maximizing the directivity factor (DF) with a constrained level of white noise gain (WNG) given the number of microphones, the region in which they should be placed, and the interested range of steering. |
X. Chen; C. Pan; J. Chen; J. Benesty; |
153 | Estimation of Microphone Clusters in Acoustic Sensor Networks Using Unsupervised Federated Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we present a privacy-aware method for estimating source-dominated microphone clusters in the context of acoustic sensor networks (ASNs). |
A. Nelus; R. Glitza; R. Martin; |
154 | Misalignment Recognition in Acoustic Sensor Networks Using A Semi-Supervised Source Estimation Method and Markov Random Fields Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider the problem of acoustic source localization by acoustic sensor networks (ASNs) using a promising, learning-based technique that adapts to the acoustic environment. |
G. F. Miller; A. Brendel; W. Kellermann; S. Gannot; |
155 | Rotation-Robust Beamforming Based on Sound Field Interpolation with Regularly Circular Microphone Array Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel framework of beamforming robust for a microphone array rotation. |
Y. Wakabayashi; K. Yamaoka; N. Ono; |
156 | Sparse Recovery Beamforming and Upscaling in The Ray Space Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we explore a method to upscale an array beyond the limits imposed by the inter-microphone distances associated with the array and the concomitant spatial aliasing. |
S. Yu; C. Jin; F. Antonacci; A. Sarti; |
157 | Combined Differential Beamforming With Uniform Linear Microphone Arrays Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: It presents a method for the design of differential beamformers with uniform linear arrays. |
G. Huang; Y. Wang; J. Benesty; I. Cohen; J. Chen; |
158 | Polynomial Matrix Eigenvalue Decomposition of Spherical Harmonics for Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a PEVD algorithm that uses only the lower dimension eigenbeams for speech enhancement at a significantly lower computation cost. |
V. W. Neo; C. Evers; P. A. Naylor; |
159 | A Parametric Unconstrained Binaural Beamformer Based Noise Reduction and Spatial Cue Preservation for Hearing-Assistive Devices Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a parametric unconstrained binaural (PUB) beamformer, which can achieve a trade-off between noise reduction and binaural cue preservation. |
J. Zhang; |
160 | A Simplified Wiener Beamformer Based on Covariance Matrix Modelling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To deal with this problem, we propose a general method by parametric modeling the covariance matrices of speech and noise, which leads to a simplified Wiener beamformer. |
F. Zhang; C. Pan; J. Benesty; J. Chen; |
161 | Control Architecture of The Double-Cross-Correlation Processor for Sampling-Rate-Offset Estimation in Acoustic Sensor Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper converts the mechanism of offline multi-stage processing into a continuous feedback-control loop comprising a controlled ASRC unit followed by an online implementation of DXCP-based SRO estimation. |
A. Chinaev; S. Wienand; G. Enzner; |
162 | Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we proposed a new algorithmic extension of RCSCME. |
Y. Kondo; Y. Kubo; N. Takamune; D. Kitamura; H. Saruwatari; |
163 | Reducing Modal Error Propagation Through Correcting Mismatched Microphone Gains Using Rapid Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A method for reducing the error propagation in modes by correcting the mismatched microphone gains is proposed, where RAndom PerturbatIons for Diffuse-field (RAPID) is used to design filters for correcting the mismatch. |
N. Akbar; G. Dickins; M. R. P. Thomas; P. Samarasinghe; T. Abhayapala; |
164 | Evaluation and Comparison of Three Source Direction-of-Arrival Estimators Using Relative Harmonic Coefficients Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a compact evaluation and comparison between two existing RHC based DOA estimators: (i) a method using a full grid search over the two-dimensional (2-D) directional space, (ii) a decoupled estimator which uses one-dimensional (1-D) search to separately localize the source’s elevation and azimuth. |
Y. Hu; P. N. Samarasinghe; S. Gannot; T. D. Abhayapala; |
165 | Network-Aware Optimal Microphone Channel Selection in Wireless Acoustic Sensor Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address the vital problem of selecting the most useful microphones in wireless acoustic sensor networks, this paper proposes a novel, general-purpose approach that accounts for both acoustic and network aspects and remains application-agnostic for broad applicability. |
M. Gunther; H. Afifi; A. Brendel; H. Karl; W. Kellermann; |
166 | Supervised Direct-Path Relative Transfer Function Learning for Binaural Sound Source Localization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a supervised DP-RTF learning method with deep neural networks for robust binaural sound source localization. |
B. Yang; X. Li; H. Liu; |
167 | Cross-Modal Spectrum Transformation Network for Acoustic Scene Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we introduce an acoustic spectrum transformation network where traditional log-mel spectrums are transformed into imagined visual features (IVF). |
Y. Liu; A. Neophytou; S. Sengupta; E. Sommerlade; |
168 | Domestic Activities Clustering From Audio Recordings Using Convolutional Capsule Autoencoder Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we propose a method for domestic activities clustering using a convolutional capsule autoencoder network (CCAN). |
Z. Lin; et al. |
169 | Sound Event Detection and Separation: A Benchmark on Desed Synthetic Soundscapes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a benchmark of state-of-the-art sound event detection systems (SED). |
N. Turpault; et al. |
170 | A Two-Stage Approach to Device-Robust Acoustic Scene Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To improve device robustness, a highly desirable key feature of a competitive data-driven acoustic scene classification (ASC) system, a novel two-stage system based on fully convolutional neural networks (CNNs) is proposed. |
H. Hu; et al. |
171 | Subspectral Normalization for Neural Audio Data Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we introduce SubSpectral Normalization (SSN), which splits the input frequency dimension into several groups (sub-bands) and performs a different normalization for each group. |
S. Chang; H. Park; J. Cho; H. Park; S. Yun; K. Hwang; |
172 | Slow-Fast Auditory Streams for Audio Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a two-stream convolutional network for audio recognition, that operates on time-frequency spectrogram inputs. |
E. Kazakos; A. Nagrani; A. Zisserman; D. Damen; |
173 | Impact of Sound Duration and Inactive Frames on Sound Event Detection Performance Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate the impact of sound duration and inactive frames on SED performance by introducing four loss functions, such as simple reweighting loss, inverse frequency loss, asymmetric focal loss, and focal batch Tversky loss. |
K. Imoto; S. Mishima; Y. Arai; R. Kondo; |
174 | A New DCASE 2017 Rare Sound Event Detection Benchmark Under Equal Training Data: CRNN With Multi-Width Kernels Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a new CRNN model for rare SED. |
J. Baumann; P. Meyer; T. Lohrenz; A. Roy; M. Papendieck; T. Fingscheidt; |
175 | Room Adaptive Conditioning Method for Sound Event Classification in Reverberant Environments Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To alleviate this problem, we propose a conditioning method that provides room impulse response (RIR) information to help the network become less sensitive to environmental information and focus on classifying the desired sound. |
J. Lee; D. Lee; H. -S. Choi; K. Lee; |
176 | Sound Event Detection Based on Curriculum Learning Considering Learning Difficulty of Events Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To utilize the curriculum learning, we propose a new objective function for SED, wherein the events are trained from easy-to difficult-to-train events. |
N. Tonami; K. Imoto; Y. Okamoto; T. Fukumori; Y. Yamashita; |
177 | Sound Event Detection in Urban Audio with Single and Multi-Rate Pcen Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this article, we experiment using PCEN spectrograms as an alternative method for SED in urban audio using the UrbanSED dataset, demonstrating per-class improvements based on parameter configuration. |
C. Ick; B. McFee; |
178 | An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Two open problems are addressed in this paper. Firstly, to detect overlapping sound events of the same type but with different DoAs, we propose to use a trackwise output format and solve the accompanying track permutation problem with permutation-invariant training. Multi-head self-attention is further used to separate tracks. Secondly, a previous finding is that, by using hard parameter-sharing, SELD suffers from a performance loss compared with learning the subtasks separately. |
Y. Cao; T. Iqbal; Q. Kong; F. An; W. Wang; M. D. Plumbley; |
179 | Lightweight and Interpretable Neural Modeling of An Audio Distortion Effect Using Hyperconditioned Differentiable Biquads Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose using differentiable cascaded biquads to model an audio distortion effect. |
S. Nercessian; A. Sarroff; K. J. Werner; |
180 | Attacking and Defending Behind A Psychoacoustics-Based Captcha Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel audio CAPTCHA system that requires a user to respond immediately after hearing a short and easy-to-remember cue in its mixture with background music. |
C. -H. Huang; P. -H. Wu; Y. -W. Liu; S. -H. Wu; |
181 | Double-DCCCAE: Estimation of Body Gestures From Speech Waveform Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents an approach for body-motion estimation from audio-speech waveform, where context information in both input and output streams is taken in to account without using recurrent models. |
J. Lu; T. Liu; S. Xu; H. Shimodaira; |
182 | Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper first proposes a topic model for audio descriptions, comprehensively analyzing the hierarchical audio topics that are commonly covered. We then explore a transfer learning scheme to access local and global information. |
X. Xu; H. Dinkel; M. Wu; Z. Xie; K. Yu; |
183 | Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose Memory-Self-Attention (MSA), which adds history information into the Restricted-Self-Attention unit. |
J. Luo; J. Wang; N. Cheng; J. Xiao; |
184 | Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these problems, we propose an activity-coupled Cartesian DOA (ACCDOA) representation, which assigns a sound event activity to the length of a corresponding Cartesian DOA vector. |
K. Shimada; Y. Koyama; N. Takahashi; S. Takahashi; Y. Mitsufuji; |
185 | Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel framework based on variational auto-encoding Wasserstein generative adversarial network (VAW-GAN), which makes use of a pre-trained speech emotion recognition (SER) model to transfer emotional style during training and at run-time inference. |
K. Zhou; B. Sisman; R. Liu; H. Li; |
186 | U-Convolution Based Residual Echo Suppression with Multiple Encoders Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an efficient end-to-end neural network that can estimate near-end speech using a U-convolution block by exploiting various signals to achieve residual echo suppression (RES). |
E. Kim; J. -J. Jeon; H. Seo; |
187 | A Multi-Channel Temporal Attention Convolutional Neural Network Model for Environmental Sound Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an effective convolutional neural network structure with a multichannel temporal attention (MCTA) block, which applies a temporal attention mechanism within each channel of the embedded features to extract channel-wise relevant temporal information. |
Y. Wang; C. Feng; D. V. Anderson; |
188 | A General Network Architecture for Sound Event Localization and Detection Using Transfer Learning and Recurrent Neural Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a general network architecture for SELD in which the SELD network comprises sub-networks that are pre-trained to solve SED and DOA estimation independently, and a recurrent layer that combines the SED and DOA estimation outputs into SELD outputs. |
T. N. T. Nguyen; et al. |
189 | Robust Recursive Least M-Estimate Adaptive Filter for The Identification of Low-Rank Acoustic Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To identify acoustic systems (which are low-rank in nature) in non-Gaussian and Gaussian noise, a robust recursive least M-estimate adaptive filtering algorithm is developed in this paper by applying the nearest Kronecker product to decompose the acoustic impulse response. |
H. He; J. Chen; J. Benesty; Y. Yu; |
190 | Noise-Robust Adaptation Control for Supervised Acoustic System Identification Exploiting A Noise Dictionary Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a noise-robust adaptation control strategy for block-online supervised acoustic system identification by exploiting a noise dictionary. |
T. Haubner; A. Brendel; M. Elminshawi; W. Kellermann; |
191 | Interpolation of Irregularly Sampled Frequency Response Functions Using Convolutional Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we propose to use Convolutional Autoencoders (CA) for Frequency Response Function (FRF) interpolation from grids with different subsampling schemes. |
M. Acerbi; R. Malvermi; M. Pezzoli; F. Antonacci; A. Sarti; R. Corradi; |
192 | Effective Rank-Based Estimation of The Coherent-to-Diffuse Power Ratio Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A CDR estimator whose design is based on this premise is devised in this contribution. |
H. W. L�llmann; A. Brendel; W. Kellermann; |
193 | Room Impulse Response Interpolation from A Sparse Set of Measurements Using A Modal Architecture Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel method for 2D interpolation of room modes from a sparse set of RIR measurements that are non-uniformly sampled within a space. |
O. Das; P. Calamia; S. V. Amengual Gari; |
194 | Processing Pipelines for Efficient, Physically-Accurate Simulation of Microphone Array Signals in Dynamic Sound Scenes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A new approach, in which the filter kernels are obtained using principal component analysis from time-aligned impulse responses, is proposed. |
A. H. Moore; R. R. Vos; P. A. Naylor; M. Brookes; |
195 | A Classifier for Improving Cause and Effect in SSVEP-based BCIs for Individuals with Complex Communication Disorders Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present CCACUSUM, a classifier for steady-state visual evoked potential (SSVEP)-based brain-computer interfaces (BCIs) that determines whether a user is attending to a flickering stimulus or is at rest. |
H. Habibzadeh; O. Zhou; J. J. S. Norton; T. M. Vaughan; D. -S. Zois; |
196 | Saga: Sparse Adversarial Attack on EEG-Based Brain Computer Interface Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we conduct the first in-depth study on the robustness of EEG analytics under sparse perturbations and propose the first Sparse Adversarial eeG Attack, SAGA, to identify weakness of EEG analytics. |
B. Feng; Y. Wang; Y. Ding; |
197 | Riemannian Geometry on Connectivity for Clinical BCI Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To increase the accuracy of BCI systems, we propose an approach grounded on Riemannian geometry that extends this framework to functional connectivity measures. |
M. -C. Corsi; F. Yger; S. Chevallier; C. No�s; |
198 | Decoding Music Attention from �EEG Headphones�: A User-Friendly Auditory Brain-Computer Interface Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel BCI system using music stimuli that relies on brain signals collected via Smartfones, an EEG recording device integrated into a pair of headphones. |
W. W. An; et al. |
199 | Mitigating Inter-Subject Brain Signal Variability FOR EEG-Based Driver Fatigue State Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a subject- independent EEG-based driver fatigue state (i.e., awake, tired, and drowsy) classification model that mitigates a performance gap between subjects. |
S. Hwang; S. Park; D. Kim; J. Lee; H. Byun; |
200 | A Deep Spatio-Temporal Model for EEG-Based Imagined Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Consequently, in this work, we propose an imagined speech Brain-Computer-Interface (BCI) using Electroencephalogram (EEG) signals. |
P. Kumar; E. Scheme; |
201 | Incorporating Uncertainty In Data Labeling Into Detection of Brain Interictal Epileptiform Discharges From EEG Using Weighted Optimization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we incorporate this probability in an IED detection system which combines spatial component analysis (SCA) with the IED probabilities referred to as SCA-IEDP-based method. |
B. Abdi-Sargezeh; A. Valentin; G. Alarcon; S. Sanei; |
202 | Multi-Level Reversible Encryption for ECG Signals Using Compressive Sensing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a compressive sensing based multi-level encryption to ECG signals to mask possible heartbeat anomalies from semi-authorized users, while preserving the beat structure for heart rate monitoring. |
M. Impi�; M. Yama�; J. Raitoharju; |
203 | Validating The Inspired Sinewave Technique to Measure Lung Heterogeneity Compared to Atelectasis & Over-Distended Volume in Computed Tomography Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Six anaesthetised pigs were studied after surfactant depletion by saline-lavage. |
M. C. Tran; et al. |
204 | A Patient-Invariant Model for Freezing of Gait Detection Aided By Wavelet Decomposition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a method for online detection of FoG using a wearable motion sensor. |
N. Ahmed; S. Singhal; V. Sharma; S. Bhattacharya; A. Sinha; A. Ghose; |
205 | Identification of Uterine Contractions By An Ensemble of Gaussian Processes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study contraction identification by processing noisy signals due to uterine activities. |
L. Yang; C. Heiselman; J. Gerald Quirk; P. M. Djuri; |
206 | Arrhythmia Classification with Heartbeat-Aware Transformer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we proposed a novel neural network model which treats typical heartbeat classification task as ?Translation? problem. |
B. Wang; C. Liu; C. Hu; X. Liu; J. Cao; |
207 | Multi-Level Group Testing with Application to One-Shot Pooled COVID-19 Tests Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we study pooling-based COVID-19 tests. |
A. Cohen; N. Shlezinger; A. Solomon; Y. C. Eldar; M. M�dard; |
208 | Detection of Covid-19 Through The Analysis of Vocal Fold Oscillations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Our goal is to validate this hypothesis, and to quantitatively characterize the changes observed to enable the detection of COVID-19 from voice. |
M. Al Ismail; S. Deshmukh; R. Singh; |
209 | Ct-Caps: Feature Extraction-Based Automated Framework for Covid-19 Disease Identification From Chest Ct Scans Using Capsule Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a Capsule network framework, referred to as the CT-CAPS, is presented to automatically extract distinctive features of chest CT scans. |
S. Heidarian; et al. |
210 | Few-Shot Learning for Ct Scan Based Covid-19 Diagnosis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to tackle the above issues, we propose a supervised domain adaption based COVID-19 CT diagnostic method which can perform effectively when only a small samples of labeled CT scans are available. |
Y. Jiang; H. Chen; H. Ko; D. K. Han; |
211 | Graph-Based Pyramid Global Context Reasoning With A Saliency- Aware Projection for Covid-19 Lung Infections Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle these issues, we propose a Graph-based Pyramid Global Context Reasoning (Graph-PGCR) module, which is capable of modeling long-range dependencies among disjoint infections as well as adapt size variation. |
H. Huang; et al. |
212 | Interpreting Glottal Flow Dynamics for Detecting Covid-19 From Voice Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a method that analyzes the differential dynamics of the glottal flow waveform (GFW) during voice production to identify features in them that are most significant for the detection of COVID-19 from voice. |
S. Deshmukh; M. Al Ismail; R. Singh; |
213 | Cycle Generative Adversarial Network Approaches to Produce Novel Portable Chest X-Rays Images for Covid-19 Diagnosis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, given the low availability of images of this recent disease, we present new approaches to artificially increase the dimensionality of portable chest X-ray datasets for COVID-19 diagnosis. |
D. I. Mor�s; J. de Moura; J. Novo; M. Ortega; |
214 | EEG-Based Emotion Classification Using Graph Signal Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we bring to bear graph signal processing (GSP) techniques to tackle the problem of automatic emotion recognition using brain signals. |
S. S. Saboksayr; G. Mateos; M. Cetin; |
215 | Granger Causality Based Directional Phase-Amplitude Coupling Measure Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a Granger causality (GC) based approach to estimate the direction of PAC. |
T. T. K. Munia; S. Aviyente; |
216 | REPAC: Reliable Estimation of Phase-Amplitude Coupling in Brain Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This contribution presents REPAC, a reliable and robust algorithm for modeling and detecting PAC events in EEG signals. |
G. Cisotto; |
217 | Subspace Oddity – Optimization on Product of Stiefel Manifolds for EEG Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel similarity-based classification method that relies on dimensionality reduction of EEG covariance matrices. |
M. Sayu Yamamoto; F. Yger; S. Chevallier; |
218 | Decentralized Motion Inference and Registration of Neuropixel Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a new registration method to partially correct for this motion. |
E. Varol; et al. |
219 | Dynamic Graph Learning Based on Graph Laplacian Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The purpose of this paper is to infer a global (collective) model of time-varying responses of a set of nodes as a dynamic graph, where the individual time series are respectively observed at each of the nodes. |
B. Jiang; Y. Yu; H. Krim; S. L. Smith; |
220 | Mutual Information Flows in A Bivariate Point Process Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here we address that question using mutual information flows and establish a connection with Granger causality. |
S. Ahmed Pasha; V. Solo; |
221 | Uncertainty-Based Biological Age Estimation of Brain MRI Scans Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this initial study, we propose a new framework for organ-specific BA estimation utilizing 3D magnetic resonance image (MRI) scans. |
K. Armanious; S. Abdulatif; W. Shi; T. Hepp; S. Gatidis; B. Yang; |
222 | Sparse Representation of Complex-Valued FMRI Data Based on Hard Thresholding of Spatial Source Phase Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study proposes a sparse representation method using SSP hard thresholding to achieve the sparsity of spatial components, enabling the use of initially complex-valued fMRI data and retaining the brain information embedded in noisy voxels and weak BOLD-related voxels with small phase values. |
J. -Y. Song; M. -Y. Qi; D. -P. Lv; C. -Y. Zhang; Q. -H. Lin; V. D. Calhoun; |
223 | Tucker Decomposition for Extracting Shared and Individual Spatial Maps from Multi-Subject Resting-State FMRI Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study proposes to decompose multi-subject fMRI data in a natural three-way of voxel ? time ? subject via TKD. |
Y. Han; Q. -H. Lin; L. -D. Kuang; X. -F. Gong; F. Cong; V. D. Calhoun; |
224 | Riemannian Geometry-Based Decoding of The Directional Focus of Auditory Attention Using EEG Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we propose Riemannian geometry-based classification (RGC) as an alternative for this CSP approach, in which the covariance matrix of a new EEG segment is directly classified while taking its Riemannian structure into account. |
S. Geirnaert; T. Francart; A. Bertrand; |
225 | DFDM: A Deep Feature Decoupling Module for Lung Nodule Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel feature decoupling method to tackle two critical problems in the lung nodule segmentation task: (i) ambiguity of nodule boundary leads to the imprecise segmentation boundary and (ii) the high false positive rate of segmentation result. |
W. Chen; Q. Wang; S. Huang; X. Zhang; Y. Li; C. Liu; |
226 | Pyramid U-Net for Retinal Vessel Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose pyramid U-Net for accurate retinal vessel segmentation. |
J. Zhang; Y. Zhang; X. Xu; |
227 | A Probabilistic Model for Segmentation of Ambiguous 3D Lung Nodule Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end we propose a probabilistic generative segmentation model consisting of a V-Net and a conditional variational autoencoder. |
X. Long; et al. |
228 | Semi-Supervised Skin Lesion Segmentation with Learning Model Confidence Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, to solve this issue, we propose a novel confidence aware semi-supervised learning method based on a mean teacher scheme. |
Z. Xie; E. Tu; H. Zheng; Y. Gu; J. Yang; |
229 | A Hybrid Feature Enhancement Method for Gl And Segmentation In Histopathology Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a hybrid feature enhancement network (HFE-Net) for glandular segmentation is proposed, which includes a multi-scale local feature extraction block (MSLFEB) and a global feature enhancement block (GFEB). |
X. Wu; X. Li; K. Hu; Z. Chen; X. Gao; |
230 | Automated Multi-Organ Segmentation in Pet Images Using Cascaded Training of A 3d U-Net and Convolutional Autoencoder Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: As this transfer of information from CT/MRI to the PET domain is not always feasible, e.g. when the corresponding CT or MRI images are unavailable or corrupted by artifacts, we propose a novel approach to perform organ segmentation on the PET images directly. |
A. Liebgott; C. Lorenz; S. Gatidis; V. C. Vu; K. Nikolaou; B. Yang; |
231 | Improved Supervised Training of Physics-Guided Deep Learning Image Reconstruction with Multi-Masking Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we propose to improve the performance and robustness of supervised training by utilizing randomness by retrospectively selecting only a subset of all the available measurements for data consistency units. |
B. Yaman; S. A. H. Hosseini; S. Moeller; M. Ak�akaya; |
232 | Fine-Grained Mri Reconstruction Using Attentive Selection Generative Adversarial Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by the state-of-the-art methods in image generation, we propose a novel attention-based deep learning framework to provide high-quality MRI reconstruction. |
J. Liu; M. Yaghoobi; |
233 | Ensure: Ensemble Stein�s Unbiased Risk Estimator for Unsupervised Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an ENsemble SURE (ENSURE) approach to train a deep network only from undersampled measurements. |
H. K. Aggarwal; A. Pramanik; M. Jacob; |
234 | Ultrasound Elasticity Imaging Using Physics-Based Models and Learning-Based Plug-and-Play Priors Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Integrating learning-based priors with physical forward models for ultrasound elasticity imaging, we present a joint reconstruction framework which guarantees that learning driven reconstructions are consistent with the underlying physics. |
N. Mohammadi; M. M. Doyley; M. Cetin; |
235 | A Periodic Frame Learning Approach for Accurate Landmark Localization in M-Mode Echocardiography Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel two-stage frame-level detection and heatmap regression model for accurate landmark localization in m-mode echocardiography, which promotes better integration between global context information and local appearance. |
Y. Tian; S. Xu; L. Guo; F. Cong; |
236 | A Bias-Reducing Loss Function for CT Image Denoising Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel approach to designing a loss function that penalizes variance and bias differently. |
M. Nagare; R. Melnyk; O. Rahman; K. D. Sauer; C. A. Bouman; |
237 | Learning Binary Semantic Embedding for Breast Histology Image Classification and Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issues, we propose a novel method for Learning Binary Semantic Embedding (LBSE). |
X. Kang; X. Liu; X. Nie; Y. Yin; |
238 | Channel Attention Residual U-Net for Retinal Vessel Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a new deep learning model, namely Channel Attention Residual U-Net (CAR-UNet), to accurately segment retinal vascular and non-vascular pixels. |
C. Guo; M. Szemenyei; Y. Hu; W. Wang; W. Zhou; Y. Yi; |
239 | CMIM: Cross-Modal Information Maximization For Medical Imaging Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an innovative framework that makes the most of available data by learning good representations of a multi-modal input that are resilient to modality dropping at test-time, using recent advances in mutual information maximization. |
T. Sylvain; et al. |
240 | Structure-Enhanced Attentive Learning For Spine Segmentation From Ultrasound Volume Projection Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel framework to improve the segmentation accuracy on spine images via structure-enhanced attentive learning. |
R. Zhao; et al. |
241 | Foveal Avascular Zone Segmentation of Octa Images Using Deep Learning Approach with Unsupervised Vessel Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To simultaneously implement vessel and accurate FAZ segmentation, an end-to-end trained network is proposed to achieve unsupervised vessel segmentation and supervised FAZ segmentation. |
Z. Liang; J. Zhang; C. An; |
242 | Acute Lymphoblastic Leukemia Detection Based on Adaptive Unsharpening and Deep Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, in this paper we propose the first machine learning-based approach able to enhance blood sample images by an adaptive unsharpening method. |
A. Genovese; M. S. Hosseini; V. Piuri; K. N. Plataniotis; F. Scotti; |
243 | Meta Ordinal Weighting Net For Improving Lung Nodule Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a Meta Ordinal Weighting Network (MOW-Net) to explicitly align each training sample with a meta ordinal set (MOS) containing a few samples from all classes. |
Y. Lei; H. Shan; J. Zhang; |
244 | Deepnodule: Multi-Task Learning of Segmentation Bootstrap for Pulmonary Nodule Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome those barriers, we present a novel multi-task 3D convolutional network (DeepNodule) for simultaneous nodule detection and segmentation in a shared-and-fined manner. |
J. Li; K. Wang; D. Yang; X. Zhang; C. Liu; |
245 | Dense Attention Module for Accurate Pulmonary Nodule Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel pulmonary nodule detection framework and a novel 3D dense attention module (DAM) which can efficiently exploit the abundant 3D spatial features. |
J. Liu; J. Li; F. Xue; C. Wu; |
246 | Unsupervised Multimodal Image Registration with Adaptative Gradient Guidance Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel multimodal registration framework, which leverages the deformation fields estimated from both: (i) the original to-be-registered image pair, (ii) their corresponding gradient intensity maps, and adaptively fuses them with the proposed gated fusion module. |
Z. Xu; J. Yan; J. Luo; X. Li; J. Jagadeesan; |
247 | Improving Intraoperative Liver Registration in Image-Guided Surgery with Learning-Based Reconstruction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome the problems caused by noisy, partial, and sparse intraoperative sampling, we propose a novel occupancy-learning-based mesh to point cloud registration and apply it to align the preoperative liver image to intraoperative samples. |
M. Jia; M. Kyan; |
248 | A New Framework Based on Transfer Learning for Cross-Database Pneumonia Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we proposed a new framework based on transfer learning for cross-database pneumonia detection. |
X. Shan; Y. Wen; |
249 | Hierarchical Attention-Based Temporal Convolutional Networks for Eeg-Based Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to tackle these problems, a hierarchical attention-based temporal convolutional networks (HATCN) for efficient EEG-based emotion recognition is proposed. |
C. Li; B. Chen; Z. Zhao; N. Cummins; B. W. Schuller; |
250 | Deep Multiway Canonical Correlation Analysis For Multi-Subject Eeg Normalization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a deep learning framework to improve the correlation of electroencephalography (EEG) data recorded from multiple subjects engaged in an audio listening task. |
J. R. Katthi; S. Ganapathy; |
251 | Dynamic Graph Modeling Of Simultaneous EEG And Eye-Tracking Data For Reading Task Identification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a new approach, that we call AdaGTCN, for identifying human reader intent from Electroencephalogram (EEG) and Eye movement (EM) data in order to help differentiate between normal reading and task-oriented reading. |
P. Mathur; T. Mittal; D. Manocha; |
252 | Learning From Heterogeneous Eeg Signals with Differentiable Channel Reordering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose CHARM, a method for training a single neural network across inconsistent input channels. |
A. Saeed; D. Grangier; O. Pietquin; N. Zeghidour; |
253 | Enhancing Multi-Channel Eeg Classification with Gramian Temporal Generative Adversarial Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel method to synthesize multi-channel EEG in the form of Gramian Angular Field (GAF) images with a Gramian Temporal Generative Adversarial Network (GT-GAN). |
C. N. Enoch Kan; R. J. Povinelli; D. H. Ye; |
254 | A Novel Convolutional Neural Network Model to Remove Muscle Artifacts from EEG Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here we introduce a novel convolutional neural network (CNN) with gradually ascending feature dimensions and downsampling in time series for removing muscle artifacts in EEG data. |
H. Zhang; C. Wei; M. Zhao; Q. Liu; H. Wu; |
255 | Multilabel 12-Lead Electrocardiogram Classification Using Beat to Sequence Autoencoders Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper investigates the multi-label, multi-class classification of ECG records into one or more of 27 possible medical diagnoses. |
A. W. Wong; A. Salimi; A. Hindle; S. V. Kalmady; P. Kaul; |
256 | Contrastive Embeddind Learning Method for Respiratory Sound Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address the problems, we propose a contrastive embedding learning method, where the input is a contrastive tuple. |
W. Song; J. Han; H. Song; |
257 | Decoding Neural Representations of Rhythmic Sounds From Magnetoencephalography Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we investigate how to extract rhythmic information embedded in the brain responses and to decode the original audio waveforms from the extracted information. |
P. -C. Chang; et al. |
258 | Low-Dimensional Denoising Embedding Transformer for ECG Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a new method for ECG classification, called low-dimensional denoising embedding transformer (LDTF), which contains two components, i.e., low-dimensional denoising embedding (LDE) and transformer learning. |
J. Guan; W. Wang; P. Feng; X. Wang; W. Wang; |
259 | Self-Supervised Learning for Sleep Stage Classification with Predictive and Discriminative Contrastive Coding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The purpose of this paper is to learn efficient representations from raw electroencephalogram (EEG) signals for sleep stage classification via self-supervised learning (SSL). |
Q. Xiao; et al. |
260 | Length No Longer Matters: A Real Length Adaptive Arrhythmia Classification Model with Multi-Scale Convolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these problems, we propose a length adaptive arrhythmia classification model that can take advantage of raw ECG records of variable length. |
C. Han; F. Yu; P. Wang; R. Huang; X. Huang; L. Cui; |
261 | Few-Shot Learning for Decoding Surface Electromyography for Hand Gesture Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, in this work, we develop a novel hand gesture recognition framework based on the formulation of FewShot Learning (FSL) to infer the required output given only one or a few numbers of training examples. |
E. Rahimian; S. Zabihi; A. Asif; S. F. Atashzar; A. Mohammadi; |
262 | Deeplung Auscultation Using Acoustic Biomarkers for Abnormal Respiratory Sound Event Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to use two sets of diversified acoustic biomarkers extracted using Discrete Wavelet Transform (DWT) and deep encoded features from the intermediate layer of a pre-trained Audio Event Detection (AED) model trained using sounds from daily activities. |
U. Tiwari; S. Bhosale; R. Chakraborty; S. K. Kopparapu; |
263 | Speaker-Independent Brain Enhanced Speech Denoising Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel deep learning method referred to as the Brain Enhanced Speech Denoiser (BESD), that takes advantage of the attended auditory information present in the brain activity of the listener to denoise a multi-talker speech. |
M. Hosseini; L. Celotti; �. Plourde; |
264 | Shapelet Based Visual Assessment of Cluster Tendency in Analyzing Complex Upper Limb Motion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an unsupervised method for shapelet extraction using maximin shape sampling and shape-based distance computation for selecting key shapelets representing characteristic motion patterns. |
S. Datta; C. Karmakar; P. Rathore; M. Palaniswami; |
265 | Human-Centered Favorite Music Classification Using EEG-Based Individual Music Preference Via Deep Time-Series CCA Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A method to classify a user?s like or dislike musical pieces based on the extraction of his or her music preference is proposed in this paper. |
R. Sawata; T. Ogawa; M. Haseyama; |
266 | Multi-Scale and Multi-Region Facial Discriminative Representation for Automatic Depression Level Prediction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: For these reasons, we propose a multi-scale and multi-region fa-cial dynamic representation method to improve the prediction performance. |
M. Niu; J. Tao; B. Liu; |
267 | ECG Heart-Beat Classification Using Multimodal Image Fusion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel Image Fusion Model (IFM) for ECG heart-beat classification to overcome the weaknesses of existing machine learning techniques that rely either on manual feature extraction or direct utilization of 1D raw ECG signal. |
Z. Ahmad; A. Tabassum; L. Guan; N. Khan; |
268 | Estimation of Visual Features of Viewed Image From Individual and Shared Brain Information Based on FMRI Data Using Probabilistic Generative Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a method for estimation of visual features based on brain responses measured when subjects view images. |
T. Higashi; K. Maeda; T. Ogawa; M. Haseyama; |
269 | Hierarchical Pose Classification for Infant Action Analysis and Mental Development Assessment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a hierarchical pose classifier, given a baby image frame that com-bines the benefits of 3D human pose estimation and scene context information. |
J. Zhou; Z. Jiang; J. -H. Yoo; J. -N. Hwang; |
270 | On The Relationship Between Speech-Based Breathing Signal Prediction Evaluation Measures and Breathing Parameters Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper investigates whether there is a systematic relationship between the different objective measures used for training and evaluating the neural network models and the end-goal, i.e. estimation of breathing parameters such as, breathing rate and tidal volume. |
Z. Mostaani; V. Srikanth Nallanthighal; A. H�rm�; H. Strik; M. Magimai-Doss; |
271 | Prediction of Egfr Mutation Status in Lung Adenocarcinoma Using Multi-Source Feature Representations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we propose a hybrid framework, namely HC-DLR, to noninvasively predict EGFR mutation status by fusing multi-source features including low-level handcrafted radiomics (HCR) features, high-level deep learning-based radiomics (DLR) features, and demographics features. |
J. Cheng; J. Liu; M. Jiang; H. Yue; L. Wu; J. Wang; |
272 | Training Neural Networks with Domain Pattern-Aware Auxiliary Task for Sleep Staging Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Accordingly, we present an auxiliary classification task for sleep staging to enable NNs to exploit clinically significant EEG patterns in data. |
T. Lee; J. Hwang; H. Lee; |
273 | Classification of Expert-Novice Level Using Eye Tracking And Motion Data Via Conditional Multimodal Variational Autoencoder Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a semi-supervised anomaly detection approach that requires only sensor data of experts for training and identifies those of novices as anomalies. |
Y. Akamatsu; K. Maeda; T. Ogawa; M. Haseyama; |
274 | Gate Trimming: One-Shot Channel Pruning for Efficient Convolutional Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a one-shot global pruning approach called Gate Trimming (GT), which is more efficient to compress the CNNs. |
F. Yu; C. Han; P. Wang; X. Huang; L. Cui; |
275 | Deep S3PR: Simultaneous Source Separation and Phase Retrieval Using Deep Generative Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we demonstrate that by restricting the solutions to lie in the range of a deep generative model, we can constrain the search space sufficiently to solve S3PR.Code associated with this work is available at https://github.com/computational-imaging/DeepS3PR. |
C. A. Metzler; G. Wetzstein; |
276 | Adversarial Attacks on Object Detectors with Limited Perturbations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel attack framework named DTTACK to fool both one-stage and two-stage object detectors with limited perturbations. |
Z. Shi; et al. |
277 | A Consensus Equilibrium Solution For Deep Image Prior Powered By Red Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we formulate DeepRED as a consensus equilibrium problem and set up a fixed-point algorithm for solving the equilibrium equations. |
R. Hyder; H. Mansour; Y. Ma; P. T. Boufounos; P. Wang; |
278 | Suremap: Predicting Uncertainty in Cnn-Based Image Reconstructions Using Stein�s Unbiased Risk Estimate Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we use Stein?s unbiased risk estimate (SURE) to develop per-pixel confidence intervals, in the form of heatmaps, for compressive sensing reconstruction using the approximate message passing (AMP) framework with CNN-based denoisers. |
R. Kitichotkul; C. A. Metzler; F. Ong; G. Wetzstein; |
279 | Multi-Initialization Meta-Learning with Domain Adaptation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To improve the performance on multi-modal tasks, we propose multi-initialization meta-learning with domain adaptation (MIML-DA) to tackle such domain shift. |
Z. Chen; D. Wang; |
280 | Stochastic Deep Unfolding for Imaging Inverse Problems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose SCRED-Net as a novel methodology that introduces a stochastic approximation to the unfolded regularization by denoising (RED) algorithm. |
J. Liu; Y. Sun; W. Gan; X. Xu; B. Wohlberg; U. S. Kamilov; |
281 | Fusion-Based Digital Image Correlation Framework for Strain Measurement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome this issue, we propose an end-to-end DIC framework incorporating the image fusion principle to achieve full-field strain measurement over the curved surface. |
L. Shi; D. Liu; M. Umeda; N. Hana; |
282 | Learning Sparsifying Transforms for Image Reconstruction in Electrical Impedance Tomography Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a blind compressed sensing algorithm, dubbed TL-EIT, which simultaneously optimizes the sparsifying transform and updates the reconstructed image. |
K. Yang; N. Borijindargoon; B. P. Ng; S. Ravishankar; B. Wen; |
283 | D-VDAMP: Denoising-Based Approximate Message Passing for Compressive MRI Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a CNN architecture for removing colored Gaussian noise and combine it with the recently proposed VDAMP algorithm, whose effective noise follows a predictable colored Gaussian distribution. |
C. A. Metzler; G. Wetzstein; |
284 | Empirically Accelerating Scaled Gradient Projection Using Deep Neural Network for Inverse Problems in Image Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we present a novel DNN-based convergent iterative algorithm that accelerates conventional optimization algorithms. |
B. H. Lee; S. Y. Chun; |
285 | Synthetic Aperture Acoustic Imaging with Deep Generative Model Based Source Distribution Prior Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose to image large acoustic sources with a combination of synthetic aperture and their geometric structures modeled by a conditional generative adversarial network (cGAN). |
B. Fan; S. Das; |
286 | Non-Local Single Image DE-Raining Without Decomposition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: On top of a new insight in single image de-raining, a nonlocal de-raining algorithm is proposed in this paper to remove the rain streaks from the rainy image. |
C. Zheng; Z. Li; Y. Li; S. Wu; |
287 | Frame-Rate-Aware Aggregation for Efficient Video Super-Resolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In contrast to the previous works that perform explicit motion estimation and compensation, we propose a novel deep neural network which performs implicit motion estimation with frame-rate-based temporal aggregation. |
T. Isobe; F. Zhu; S. Wang; |
288 | Measurement Coding Framework with Adjacent Pixels Based Measurement Matrix for Compressively Sensed Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To further compress measurements, the output of block-based compressed sensing, this work presents a measurement coding framework using measurement-domain intra prediction. |
R. Wan; J. Zhou; B. Huang; H. Zeng; Y. Fan; |
289 | Multiview Sensing with Unknown Permutations: An Optimal Transport Approach Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we take a fresh look at this problem through the lens of optimal transport (OT). |
Y. Ma; P. T. Boufounos; H. Mansour; S. Aeron; |
290 | A High-Frame-Rate Eye-Tracking Framework for Mobile Devices Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we tackle the tracking efficiency challenge and introduce GazeHFR, a biologic-inspired eye-tracking model specialized for mobile devices, offering both high accuracy and efficiency. |
Y. Chang; C. He; Y. Zhao; T. Lu; N. Gu; |
291 | Catiloc: Camera Image Transformer for Indoor Localization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper the problem of single image indoor camera localization has been addressed. |
A. Ghofrani; R. M. Toroghi; S. Mojtaba Tabatabaie; |
292 | Sar Image Autofocusing Using Wirtinger Calculus and Cauchy Regularization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, an optimization model using Cauchy regularization is proposed for simultaneous SAR image reconstruction and autofocusing. |
Z. -Y. Zhang; O. Pappas; A. Achim; |
293 | A Homogeneity-Based Multiscale Hyperspectral Image Representation for Sparse Spectral Unmixing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a computationally efficient multiscale representation method for hyperspectral data adapted to the unmixing problem. |
L. C. Ayres; S. J. M. de Almeida; J. C. M. Bermudez; R. A. Borsoi; |
294 | Learning to Estimate Kernel Scale and Orientation of Defocus Blur with Asymmetric Coded Aperture Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle this problem, we propose a deep-learning-based framework estimating the kernel scale and orientation of the defocus blur to ad-just lens focus rapidly. |
J. Li; Q. Dai; J. Wen; |
295 | Transmittance Regularizer for Binary Coded Aperture Design in A Computational Imaging End-to-end Approach Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, this work proposes two transmittance regularizers that jointly induce binary en-tries and adjust the transmittance level to be incorporated in an E2E approach. |
J. Bacca; T. Gelvez; H. Arguello; |
296 | Fourier Transformation Autoencoders for Anomaly Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces Fourier Trans-forms into AutoEncoders to demonstrate how the inclusion of a frequency domain presents less noisy features for a deep learning network to detect anomalies. |
D. Lappas; V. Argyriou; D. Makris; |
297 | Zero-Gradient Constraints for Destriping of Remote-Sensing Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes an effective and efficient destriping method for remote-sensing data. |
K. Naganuma; S. Takeyama; S. Ono; |
298 | Selection Based on Statistical Characteristics for Object Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a multi-scale sample selection based on statistical characteristics for object detection. |
Z. Li; Y. Yuan; D. Ma; |
299 | CSPN: Multi-Scale Cascade Spatial Pyramid Network for Object Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To solve this problem and obtain better detection performance, we propose a novel net-work named Multi-Scale Cascade Spatial Pyramid Network (MS-CSPN) to strengthen Feature Pyramid. |
T. Wang; C. Ma; H. Su; W. Wang; |
300 | Dual-Stream Network Based On Global Guidance for Salient Object Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To remedy the problems, we propose a dual-stream network based on global guidance with two plug-ins, global attention based multi-scale high-level feature extraction module (GAMS) to mine global guidance and scale adaptive global guidance module (SAGG) to seamlessly integrate the global guidance into each decoding layer. |
S. Gao; Q. Guo; W. Zhang; W. Zhang; Z. Ji; |
301 | SSFENet: Spatial and Semantic Feature Enhancement Network for Object Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel network to address this dilemma, denoted as Spatial and Semantic Feature Enhancement Network (SSFENet). |
T. Wang; C. Ma; H. Su; W. Wang; |
302 | Saliency-Driven Versatile Video Coding for Neural Object Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we pro-pose such a saliency-driven coding framework for the video coding for machines task using the latest video coding standard Versatile Video Coding (VVC). |
K. Fischer; F. Fleckenstein; C. Herglotz; A. Kaup; |
303 | Object-Oriented Relational Distillation for Object Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose a novel Object-Oriented Relational Distillation (OORD) method that drives small detection models to have an effective performance like large detection models with constant efficiency. |
S. Miao; R. Feng; |
304 | Ensembling Object Detectors for Image and Video Data Analysis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a method for ensembling the outputs of multiple object detectors for improving detection performance and precision of bounding boxes on image data. |
K. Chumachenko; J. Raitoharju; A. Iosifidis; M. Gabbouj; |
305 | Training Real-Time Panoramic Object Detectors with Virtual Dataset Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a panoramic virtual dataset for training object detectors on 360? images. |
Q. -Y. Shen; T. -G. Huang; P. -X. Ding; J. He; |
306 | Fast: Feature Aggregation for Detecting Salient Object in Real-Time Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces a method named FAST for real-time salient object detection with an extremely efficient CNN architecture. |
L. Tang; B. Li; Y. Wu; B. Xiao; S. Ding; |
307 | Exploiting The Dual-Tree Complex Wavelet Transform for Ship Wake Detection in SAR Imagery Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we analyse synthetic aperture radar (SAR) images of the sea surface using an inverse problem formulation whereby Radon domain information is enhanced in order to accurately detect ship wakes. |
W. Ma; A. Achim; O. Karakus; |
308 | Task-Related Self-Supervised Learning For Remote Sensing Image Change Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, an unsupervised change detection method based on Task-related Self-supervised Learning Change Detection network with smooth mechanism(TSLCD) is proposed to eliminate it. |
Z. Cai; Z. Jiang; Y. Yuan; |
309 | Unsupervised Common Particular Object Discovery and Localization By Analyzing A Match Graph Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes an unsupervised method that more accurately discovers and localizes common particular objects within a set of images. |
M. Okuda; S. Satoh; Y. Sato; Y. Kidawara; |
310 | Predictive Coding for Lossless Dataset Compression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We prove the equivalence of dataset compression to compressing a permutation-invariant structure of the data and implement such a scheme via predictive coding. |
M. Barowsky; A. Mariona; F. P. Calmon; |
311 | Adaptive Dual Tree Structure For Screen Content Coding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, adaptive dual tree structure is proposed in this paper wherein the coding structure of each coding tree unit is switched between separate and joint coding structure to adapt the textures adaptively. |
W. Zhu; J. Xu; L. Zhang; Y. Wang; |
312 | SNR-Adaptive Deep Joint Source-Channel Coding for Wireless Image Transmission Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Considering the problem of joint source-channel coding (JSCC) for multi-user transmission of images over noisy channels, an autoencoder-based novel deep joint source-channel coding scheme is proposed in this paper. |
M. Ding; J. Li; M. Ma; X. Fan; |
313 | Relying on A Rate Constraint to Reduce Motion Estimation Complexity Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a rate-based candidate elimination strategy for Motion Estimation, which is considered one of the main sources of encoder complexity. |
G. B. Sant�Anna; L. Henrique Cancellier; I. Seidel; M. Grellert; J. L. G�ntzel; |
314 | A Novel Viewport-Adaptive Motion Compensation Technique for Fisheye Video Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel viewport-adaptive motion compensation technique that applies the motion vectors in different perspective viewports in order to realize these motion planes. |
A. Regensky; C. Herglotz; A. Kaup; |
315 | Rate-Distortion Optimized Motion Estimation for On-the-Sphere Compression of 360 Videos Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, the on-the-sphere compression [1] for omnidirectional still images is extended to videos. |
A. Marie; N. Mahmoudian Bidgoli; T. Maugey; A. Roumy; |
316 | Adaptive GOP Size Decision for Multi-Pass Video Coding Based on Hidden Markov Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a novel method to determine the size of each group of picture (GOP) using the multi-pass information is presented. |
B. Li; J. Han; Y. Xu; |
317 | Improved Intra Mode Coding Beyond Av1 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, two methods are proposed to further reduce the signaling cost of delta angles: cross-component delta angle coding, and context-adaptive delta angle coding, whereby the cross-component and spatial correlation of the delta angles are explored, respectively. |
Y. Jin; L. Zhao; X. Zhao; S. Liu; A. C. Bovik; |
318 | Decision Tree Based Inter Partition Termination For Av1 Encoding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To deal with this problem, in this paper, we propose a decision tree based algorithm to early terminate the inter prediction process by predicting splitting decisions at each depth. |
X. Chen; Y. Zhang; Y. Li; J. Wen; |
319 | Image Coding For Machines: An End-To-End Learned Approach Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an image codec for machines which is neural network (NN) based and end-to-end learned. |
N. Le; H. Zhang; F. Cricri; R. Ghaznavi-Youvalari; E. Rahtu; |
320 | Sparse Flow Adversarial Model For Robust Image Compression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel compression method called sparse flow adversarial model (SFAM). |
S. Zhao; S. Yang; Z. Liu; Z. Feng; X. Liu; |
321 | HVS-Based Perceptual Color Compression of Image Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel perceptual image coding technique, named Perceptual Color Compression (PCC). |
L. Prangnell; V. Sanchez; |
322 | HOCA: Higher-Order Channel Attention for Single Image Super-Resolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we propose a higher-order channel attention (HOCA) module to enhance the representation ability of CNNs. |
Y. Lv; T. Dai; B. Chen; J. Lu; S. -T. Xia; J. Cao; |
323 | Image Super-Resolution Using Multi-Resolution Attention Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these issues, we propose a multi-resolution attention network (MRAN), which progressively reconstructs images at large scale factors by aggregating features from multiple resolutions. |
A. Liu; S. Li; Y. Chang; |
324 | Real Image Super-Resolution Using Token Based Contextual Attention Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To alleviate these issues, we propose a new token based attention module with innovative contextual encoding to enable SR models to be robust to image patch sizes at testing. |
Z. Pan; B. Li; |
325 | Feature Redundancy Mining: Deep Light-Weight Image Super-Resolution Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, by considering the correlation and redundancy of feature maps, we propose a feature information mining network to efficiently investigate the features, for the SISR problem. |
J. Xiao; W. Jia; K. -M. Lam; |
326 | Lightweight Non-Local Network for Image Super-Resolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these issues, we propose a lightweight non-local network (LNLN) for image super resolution in this paper. |
R. Wang; T. Lei; W. Zhou; Q. Wang; H. Meng; A. K. Nandi; |
327 | Lightweight and Accurate Single Image Super-Resolution with Channel Segregation Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we propose an efficient channel segregation block containing multiple branches with different depths, enabling the model to preserve basic content, and focusing on optimizing the detail content with fewer parameters. |
Z. -H. Niu; X. -P. Lin; A. -N. Yu; Y. -H. Zhou; Y. -B. Yang; |
328 | Deep Learning Architectural Designs for Super-Resolution Of Noisy Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose to jointly perform denoising and super-resolution. |
A. Villar-Corrales; F. Schirrmacher; C. Riess; |
329 | Joint Coupled Transform Learning Framework for Multimodal Image Super-Resolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we model the cross-modal dependencies between different modalities for Multimodal Image Super-Resolution (MISR), i.e., enhance the Low Resolution (LR) image of target modality with the guidance of a High Resolution (HR) image from another modality. |
A. Gigie; A. A. Kumar; A. Majumdar; K. Kumar; M. G. Chandra; |
330 | Hyperspectral Image Super-Resolution Via Adjacent Spectral Fusion Strategy Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we explore a new structure for hyperspectral image SR via adjacent spectral fusion strategy. |
Q. Li; Q. Wang; X. Li; |
331 | Raw Data Processing for Practical Time-of-Flight Super-Resolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we note that while attempting to address the last two issues, e. g., via burst mode, the lateral resolution can be effectively increased. |
M. H. Conde; |
332 | Edge-Aware Multi-Scale Progressive Colorization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these problems, we propose a novel edge-aware multi-scale progressive network (EMSPN). |
J. Xia; G. Tan; Y. Xiao; F. Xu; C. -S. Leung; |
333 | Learning Representation of Multi-Scale Object for Fine-Grained Image Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In our work, to extract more local features, we propose a method that could proposes multiple discriminative regions on different scales, which could provide more refined local and multi-sacle representation for fine-grained image retrieval. |
K. Sun; J. Zhu; |
334 | Super-Resolution and Infection Edge Detection Co-Guided Learning for Covid-19 Ct Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel super-resolution and infection edge detection co-guided learning network for COVID-19 CT segmentation (CogSeg). |
Y. Sang; J. Sun; S. Wang; H. Qi; K. Li; |
335 | Gating Feature Dense Network for Single Anisotropic Mr Image Super-Resolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a gating feature dense network to reconstruct HR MR images from low resolution acquisitions, where we use local residual dense block (LRDB) as the backbone. |
W. He; Y. Hu; L. Wang; Z. He; J. Du; |
336 | Adaptable Ensemble Distillation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose Adaptable Ensemble Distillation (AED) that inherits the merits of existing OKD methods while overcoming their major drawbacks. |
Y. Wang; D. Yang; W. Zhang; Z. Jiang; W. Zhang; |
337 | A Scale Invariant Measure of Flatness for Deep Network Minima Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we show that for deep networks with positively homogenous activations, these rescalings constitute equivalence relations, and that these equivalence relations induce a quotient manifold structure in the parameter space. |
A. Rangamani; N. H. Nguyen; A. Kumar; D. Phan; S. P. Chin; T. D. Tran; |
338 | Multi-Order Adversarial Representation Learning for Composed Query Image Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: So this paper proposes a new Multi-order Adversarial Network (MAN) which uses multilevel representations and simultaneously explores their low-order and high-order interactions, obtaining low-order and high-order features. |
Z. Fu; X. Chen; J. Dong; S. Ji; |
339 | Deep Neural Networks with Flexible Complexity While Training Based on Neural Ordinary Differential Equations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we experimentally investigate the effectiveness of using neural ordinary differential equations (NODEs) as a component to provide further depth to relatively shallower networks rather than stacked layers (depth) which achieved improvement with fewer parameters. |
Z. Luo; S. -i. Kamata; Z. Sun; W. Zhou; |
340 | Improving Memory Banks for Unsupervised Learning with Large Mini-Batch, Consistency and Hard Negative Mining Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce 3 improvements to the vanilla memory bank-based formulation which brings massive accuracy gains: (a) Large mini-batch: we pull multiple augmentations for each sample within the same batch and show that this leads to better models and enhanced memory bank updates. |
A. Bulat; E. S�nchez-Lozano; G. Tzimiropoulos; |
341 | Robust Binary Loss for Multi-Category Classification with Label Noise Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address the problem, we propose to train deep models with robust binary loss functions. |
D. Liu; G. Yang; J. Wu; J. Zhao; F. Lv; |
342 | A Plug and Play Fast Intersection Over Union Loss for Boundary Box Regression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we design a Fast Intersection over Union (FIoU) loss, which can not only keep the advantages but also solve the weakness of IoU-based losses. |
Z. Kuang; X. Fang; R. Zhang; X. Shao; H. Wang; |
343 | Attribute Decomposition for Flow-Based Domain Mapping Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To handle the mixed features for better generation, this paper presents an attribute decomposition based on the sequence data and carries out the flow-based image domain mapping. |
S. -J. Huang; J. -T. Chien; |
344 | Ada-Sise: Adaptive Semantic Input Sampling for Efficient Explanation of Convolutional Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we combine both approaches as a hybrid visual explanation algorithm and propose an efficient interpretation method for convolutional neural networks. |
M. Sudhakar; S. Sattarzadeh; K. N. Plataniotis; J. Jang; Y. Jeong; H. Kim; |
345 | Network Pruning Using Linear Dependency Analysis on Feature Maps Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we regard a channel ?redundant? if its output is linearly dependent with respect to those of other channels. |
H. Pan; Z. Chao; J. Qian; B. Zhuang; S. Wang; J. Xiao; |
346 | Multiple-Input Multiple-Output Fusion Network for Generalized Zero-Shot Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we propose a Multiple-Input Multiple-Output Fusion Network to GZSL. |
F. Zhong; G. Wang; Z. Chen; X. Yuan; F. Xia; |
347 | Representative Local Feature Mining for Few-Shot Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Given this, we propose a novel method that chooses representative local features to facilitate few-shot learning. |
K. Yan; L. Liu; J. Hou; P. Wang; |
348 | KAN: Knowledge-Augmented Networks for Few-Shot Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, considering that semantic information can enhance understanding when visual information is limited, we propose Knowledge-Augmented Networks (KAN), which combines the visual features with the semantic information extracted from knowledge graph to represent the features of each class. |
Z. Zhu; X. Lin; |
349 | Few-Shot Image Classification with Multi-Facet Prototypes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In particular, we propose an adaptive similarity measure, relying on predicted facet importance weights for a given set of categories. |
K. Yan; Z. Bouraoui; P. Wang; S. Jameel; S. Schockaert; |
350 | Self-Supervised Learning for Few-Shot Image Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we proposed to train a more generalized embedding network with self-supervised learning (SSL) which can provide robust representation for downstream tasks by learning from the data itself. |
D. Chen; Y. Chen; Y. Li; F. Mao; Y. He; H. Xue; |
351 | Domain Adaptation for Learning Generator From Paired Few-Shot Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a Paired Few-shot GAN (PFS-GAN) model for learning generators with sufficient source data and a few target data. |
C. -C. Teng; P. -Y. Chen; W. -C. Chiu; |
352 | Deep Semi-Supervised Metric Learning Via Identification of Manifold Memberships Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a method which allows the use of class-representative anchors (proxies), and avoids the computational costs associated with triplet sampling. |
F. Zhuang; P. Moulin; |
353 | A Ranked Similarity Loss Function with Pair Weighting for Deep Metric Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these problems, we propose to build a ranked similarity loss function with pair weighting (dubbed RMS loss). |
J. Wang; Z. Zhang; D. Huang; W. Song; Q. Wei; X. Li; |
354 | Statistical Distance Metric Learning for Image Set Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we obviate the need of feature aggregation and propose a novel Statistical Distance Metric Learning (SDML) framework, which represents each image set as a probability distribution in embedding feature space and compares two image sets by statistical distance between their distributions. |
T. -Y. Hu; A. G. Hauptmann; |
355 | Distribution-Aware Hierarchical Weighting Method for Deep Metric Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose distribution-aware hierarchical weighting (DHW) method for deep metric learning. |
Y. Zhu; Y. Feng; M. Zhou; B. Qiang; L. Hou U; J. Zhu; |
356 | Integrated Grad-Cam: Sensitivity-Aware Visual Explanation of Deep Convolutional Networks Via Integrated Gradient-Based Scoring Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Addressing this problem, we introduce a solution to tackle this issue by computing the path integral of the gradient-based terms in Grad-CAM. |
S. Sattarzadeh; M. Sudhakar; K. N. Plataniotis; J. Jang; Y. Jeong; H. Kim; |
357 | Visualizing Association in Exemplar-Based Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To solve this problem, we propose a novel method of explainable classification; this method uses images representing each image class, which we call exemplars. |
T. Kashima; R. Hataya; H. Nakayama; |
358 | HFGCNET: High-Frequency Graph Reasoning for Finer Semantic Image Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work presents a high-frequency graph convolution operation to solve the above problems. |
Z. Sun; R. Wang; Z. Luo; W. Chen; |
359 | Unsupervised Image Segmentation with Spatial Triplet Markov Trees Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this article we study an extension of HMTs called Spatial Triplet Markov Trees (STMTs) which is designed to greatly increase the correlations of the random variables while keeping the possibility of direct and exact inference procedures. |
H. Gangloff; J. -B. Courbot; E. Monfrini; C. Collet; |
360 | Cross Scene Video Foreground Segmentation Via Co-Occurrence Probability Oriented Supervised and Unsupervised Model Interaction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a cross scene video foreground segmentation framework to extend the generalization capability of those supervised model depending on scene-specific training. |
D. Liang; B. Kang; X. Liu; H. Sun; L. Zhang; N. Liu; |
361 | Instance Segmentation with The Number of Clusters Incorporated in Embedding Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose to embed prior clustering information into an embedding learning framework FCRNet, stimulating the one-stage instance segmentation. |
J. Cao; H. Yan; |
362 | Decouple The High-Frequency and Low-Frequency Information of Images for Semantic Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: At present, the semantic segmentation methods are all based on CNN and ignore the advantages of traditional image processing technology. We combine the two and make them promote each other. |
L. Shan; X. Li; W. Wang; |
363 | MPDNet: A 3D Missing Part Detection Network Based on Point Cloud Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle the issue, in this paper, we propose a novel model named MPDNet, which exploits 3D point cloud pairs as input for missing part detection. |
Z. Fan; H. Liu; J. He; M. Zhang; X. Du; |
364 | SM+: Refined Scale Match for Tiny Person Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate the scale alignment between pre-training and target datasets, and propose a new refined Scale Match method (termed SM+) for tiny person detection. |
N. Jiang; X. Yu; X. Peng; Y. Gong; Z. Han; |
365 | Sub-Band Grouping Spectral Feature-Attention Block for Hyperspectral Image Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we proposed a novel sub-band grouping recurrent neural network (RNN) model with gated recurrent units (GRUs) to find the intrinsic feature in spectral information. |
W. Zhou; S. -i. Kamata; Z. Luo; |
366 | Unsupervised Stacked Capsule Autoencoder for Hyperspectral Image Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on HSI data’s aforementioned structural characteristics, combined with the stacked capsule autoencoder, we propose our model to achieve an unsupervised HSI classification. |
E. Pan; Y. Ma; X. Mei; F. Fan; J. Ma; |
367 | Robust Graph Autoencoder for Hyperspectral Anomaly Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to tackle these problems, we propose a robust graph autoencoder (RGAE) for hyperspectral anomaly detection. |
G. Fan; Y. Ma; J. Huang; X. Mei; J. Ma; |
368 | Reflectance-Oriented Probabilistic Equalization for Image Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To solve this problem, we propose a novel 2D histogram equalization approach. |
X. Wu; Y. Sun; A. Kimura; K. Kashino; |
369 | PD-GAN: Perceptual-Details GAN for Extremely Noisy Low Light Image Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To solve the problems, we pro-pose perceptual-details GAN (PD-GAN) utilizing Zero-DCE to initially recover illumination and combine residual dense-block Encoder-Decoder structure to suppress noise while finely adjusting the illumination. |
Y. Liu; Z. Wang; Y. Zeng; H. Zeng; D. Zhao; |
370 | Heterogeneous Two-Stream Network with Hierarchical Feature Prefusion for Multispectral Pan-Sharpening Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a heterogeneous two-stream network (HTSNet) with hierarchical feature prefusion for MS pan-sharpening. |
D. Wang; Y. Bai; B. Bai; C. Wu; Y. Li; |
371 | Synergic Feature Attention for Image Restoration Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome these problems, in this paper, we propose a novel Synergic Attention Network (SAT-Net) for image restoration as an inventive attempt to combine local and non-local attention mechanisms to restore complex textures and highly repetitive details distinguishingly. |
C. Mou; J. Zhang; |
372 | Efficient Multi-Objective GANs for Image Restoration Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we propose an efficient formulation of multiple loss components for training GANs. |
J. Su; H. Yin; |
373 | Self-Convolution: A Highly-Efficient Operator for Non-Local Image Restoration Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a novel Self-Convolution operator to exploit image non-local similarity in a self-supervised way. |
L. Guo; Z. Zha; S. Ravishankar; B. Wen; |
374 | NMF-SAE: An Interpretable Sparse Autoencoder for Hyperspectral Unmixing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we combine the advantages of both model-based and learning-based methods and propose a nonnegative matrix factorization (NMF) inspired sparse autoencoder (NMF-SAE) for hyperspectral unmixing. |
F. Xiong; J. Zhou; M. Ye; J. Lu; Y. Qian; |
375 | An ADMM Based Network for Hyperspectral Unmixing Tasks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we use algorithm unrolling approaches in order to design a new neural network structure applicable to hyperspectral unmixing challenges. |
C. Zhou; M. R. D. Rodrigues; |
376 | Variational Autoencoders for Hyperspectral Unmixing with Endmember Variability Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a variational autoencoder (VAE) framework for hyperspectral unmixing accounting for the endmember variability. |
S. Shi; M. Zhao; L. Zhang; J. Chen; |
377 | Augmented Gaussian Linear Mixture Model for Spectral Variability in Hyperspectral Unmixing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel hyperspectral unmixing through the perturbed linear mixture model to take into account the spectral variability offset of the linear mixture model. |
Y. E. Salehani; E. Arabnejad; S. Gazor; |
378 | UTDN: An Unsupervised Two-Stream Dirichlet-Net for Hyperspectral Unmixing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel two-stream Dirichlet-net, termed as uTDN, to address the above problems. |
Q. Jin; Y. Ma; X. Mei; H. Li; J. Ma; |
379 | Laplacian Regularized Tensor Low-Rank Minimization for Hyperspectral Snapshot Compressive Imaging Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a tensor-based low-rank reconstruction algorithm with hyper-Laplacian constraint for hyperspectral SCI systems. |
Y. Yang; F. Jiang; H. Lu; |
380 | Compressing Local Descriptor Models for Mobile Applications Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider these practical aspects and improve the state-of-the-art HardNet model through the use of depthwise separable layers and an efficient tensor decomposition. |
R. Miles; K. Mikolajczyk; |
381 | VK-Net: Category-Level Point Cloud Registration with Unsupervised Rotation Invariant Keypoints Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose VK-Net, a neural network that learns to discover a set of category-specific keypoints from a single point cloud in an unsupervised manner. |
Z. Chen; W. Yang; Z. Xu; Z. Shi; L. Huang; |
382 | Matching As Color Images: Thermal Image Local Feature Detection and Description Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Motivated by this challenge, we propose a triplet based Siamese CNN for feature detection and extraction for any given thermal image. |
B. Deshpande; S. Hanamsheth; Y. Lu; G. Lu; |
383 | Frame Rate Up-Conversion Using Key Point Agnostic Frequency-Selective Mesh-to-Grid Resampling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We use the model-based key point agnostic frequency-selective mesh-to-grid resampling (AFSMR) for this task and show that AFSMR works best for applications that contain irregular meshes with varying densities. |
V. Heimann; A. Spruck; A. Kaup; |
384 | Efficient Real-Time Video Stabilization with A Novel Least Squares Formulation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a novel video stabilization algorithm (LSstab) that removes unwanted motions in real-time. |
J. Ke; A. J. Watras; J. -J. Kim; H. Liu; H. Jiang; Y. H. Hu; |
385 | Decomposing Textures Using Exponential Analysis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a new approach using a recent 2-dimensional exponential analysis technique. |
Y. Hou; A. Cuyt; W. -s. Lee; D. Bhowmik; |
386 | G-Arrays: Geometric Arrays for Efficient Point Cloud Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel data structure for representing point clouds with a reduced memory requirement and a faster lookup than the state-of-the-art formats. |
H. Roodaki; M. Dehyadegari; M. N. Bojnordi; |
387 | QoE-Driven and Tile-Based Adaptive Streaming for Point Clouds Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this, we propose a QoE-driven and tile-based adaptive streaming approach for point clouds, to reduce transmission redundancy and maximize user?s QoE. |
L. Wang; C. Li; W. Dai; J. Zou; H. Xiong; |
388 | Dynamic Point Cloud Compression Using A Cuboid Oriented Discrete Cosine Based Motion Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: An improved commonality modeling technique is proposed that employs discrete cosine basis oriented motion models and the domains of such models are approximated by homogeneous regions called cuboids. |
A. Ahmmed; M. Paul; M. Murshed; D. Taubman; |
389 | An Adaptive Pyramid Single-View Depth Lookup Table Coding Method Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, an adaptive pyramid single-view depth lookup table coding method is proposed, with the purpose of designing a clean syntax structure in the sequence header with reasonably good performance. |
Y. Cai; R. Wang; S. Gu; J. Zhang; W. Gao; |
390 | Patch Decoder-Side Depth Estimation In Mpeg Immersive Video Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a new approach for achieving bitrate and pixel rate reduction in the MPEG immersive video coding setting. |
M. Milovanovic; F. Henry; M. Cagnazzo; J. Jung; |
391 | Geometry Consistency Of Augmented Reality Based On Semantics Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In augmented reality, for achieving geometric consistency in the perspective projection virtual-real, we propose a semantic consistency method to achieve the fusion between virtual and real objects with selected segmented objects in the real scene as references. |
H. Quan; M. Yao; X. Qian; |
392 | What And Where To Focus In Person Search Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: For these two findings, we first propose multilevel semantic aggregation algorithm for more discriminative feature descriptors. Then, a pose-assisted attention module is designed to highlight fine-grained area of the target and simultaneously capture valuable clues for identification. |
T. Zhou; K. Tian; |
393 | Stable and Effective One-Step Method for Person Search Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose an end-to-end model containing the feature extractor, the region proposal network, and the multi-task learning module. |
N. Lv; X. Xiang; X. Wang; J. Yang; R. Abdeen; A. El Saddik; |
394 | An Adaptive Part-Based Model For Person Re-Identification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address the part-misalignment problem and learn a more discriminative embedding for person Re-ID, we propose a novel Adaptive Part-based Model (APM), which adaptively partition the extracted feature maps by a Partition-Aware module to learn an embedding. |
X. -P. Lin; Y. -B. Yang; |
395 | Crowd Counting Via Multi-Level Regression With Latent Gaussian Maps Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a novel end-to-end crowd counting framework via multi-level regression with latent Gaussian maps is proposed, which is consisted of GaussianNet, EstimateNet and Discriminator. |
Y. Gao; H. Yang; |
396 | Lightweight Dual-Task Networks For Crowd Counting In Aerial Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, this paper proposes a lightweight dual-task network (LDNet) for crowd counting, which only uses bifurcated structure to overcome these new challenges in aerial images without complicated pipelines. |
Y. Tian; C. Duan; R. Zhang; Z. Wei; H. Wang; |
397 | SANet++: Enhanced Scale Aggregation with Densely Connected Feature Fusion for Crowd Counting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present SANet++ with a novel architecture to generate high-quality density maps and further perform accurate counting. |
S. Pan; Y. Zhao; F. Su; Z. Zhao; |
398 | Attentive Semantic Exploring for Manipulated Face Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, we propose a novel manipulated face detection method based on Multilevel Facial Semantic Segmentation and Cascade Attention Mechanism. |
Z. Chen; H. Yang; |
399 | Efficient Face Manipulation Via Deep Feature Disentanglement And Reintegration Net Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we propose a novel Feature Disentanglement and Reintegraion network (FDRNet), which employs ground-truth images as informative supervision and dynamically adapts the fusion of informative features of the ground-truth images effectively and efficiently. |
B. Cheng; T. Dai; B. Chen; S. Xia; X. Li; |
400 | Continuous Face Aging Generative Adversarial Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose the continuous face aging generative adversarial networks (CFA-GAN). |
S. Jeon; P. Lee; K. Hong; H. Byun; |
401 | Fast Inverse Mapping of Face GANs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We train a ResNet architecture to map given faces to latent vectors that can be used to generate faces nearly identical to the target. |
N. Bayat; V. R. Khazaie; Y. Mohsenzadeh; |
402 | Multi-Level Adaptive Region of Interest and Graph Learning for Facial Action Unit Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel multi-level adaptive ROI and graph learning (MARGL) framework to tackle this problem. |
J. Yan; B. Jiang; J. Wang; Q. Li; C. Wang; S. Pu; |
403 | Bridging Unpaired Facial Photos and Sketches By Line-Drawings Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel method to learn face sketch synthesis models by using unpaired data. |
M. Shang; F. Gao; X. Li; J. Zhu; L. Dai; |
404 | Temporal Rain Decomposition with Spatial Structure Guidance for Video Deraining Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a multi-frame deraining network with temporal rain decomposition and spatial structure guidance to more effectively accomplish video deraining. |
X. Xue; Y. Ding; L. Ma; Y. Wang; R. Liu; X. Fan; |
405 | GTA-Net: Gradual Temporal Aggregation Network for Fast Video Deraining Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, to effectively exploit temporal information, we develop a simple but effective network, Gradual Temporal Aggregation Network (GTA-Net for short). |
X. Xue; X. Meng; L. Ma; R. Liu; X. Fan; |
406 | Dense Feature Pyramid Grids Network for Single Image Deraining Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel densely connected network with Dense Feature Pyramid Grids Modules, called DFPGN, to solve the rain removal task. |
Z. Wang; C. Wang; Z. Su; J. Chen; |
407 | A Fast and Efficient Network for Single Image Deraining Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle this problem, we propose a novel Adaptive Dilated Network (ADN) to remove rain streaks from a single image while using less parameters and running faster than previous methods. |
Y. Yang; H. Lu; |
408 | DNANet: Dense Nested Attention Network for Single Image Dehazing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an innovative approach, called Dense Nested Attention Network (DNANet), to directly restore a clear image from a hazy image with a new topology of connection paths. |
D. Ren; J. Li; M. Han; M. Shu; |
409 | FWB-Net: Front White Balance Network for Color Shift Correction in Single Image Dehazing Via Atmospheric Light Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Bearing this in mind, in this study, first, a new non-homogeneous atmospheric scattering model (NH-ASM) is proposed for improving image modeling of hazy images taken under complex illumination conditions. Second, a new U-Net based front white balance module (FWB-Module) is dedicatedly designed to correct color shift before generating dehazing result via atmospheric light estimation. |
C. Wang; Y. Huang; Y. Zou; Y. Xu; |
410 | Learning Integrodifferential Models for Image Denoising Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce an integrodifferential extension of the edge-enhancing anisotropic diffusion model for image denoising. |
T. Alt; J. Weickert; |
411 | Unrolling of Deep Graph Total Variation for Image Denoising Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we combine classical graph signal filtering with deep feature learning into a competitive hybrid design?one that utilizes interpretable analytical low-pass graph filters and employs 80% fewer network parameters than state-of-the-art DL denoising scheme DnCNN. |
H. Vu; G. Cheung; Y. C. Eldar; |
412 | Learning Model-Blind Temporal Denoisers Without Ground Truths Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a general framework for temporal denoising that successfully addresses these challenges. |
Y. Li; B. Guo; J. Wen; Z. Xia; S. Liu; Y. Han; |
413 | Image Denoising Based on Correlation Adaptive Sparse Modeling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper aims to fully exploit local and non-local correlation of image contents separately so that near-optimal sparse representations are achieved and thus the uncertainty of signals is minimized. |
H. Liu; J. Zhang; C. Mou; |
414 | NASA: A Noise-Adaptive and Structure-Aware Learning Framework for Image Deblurring Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To settle this issue, we develop a Noise-Adaptive Structure-Aware learning framework (NASA) to achieve fully intelligent manufacturing. |
X. Liu; L. Ma; R. Liu; W. Zhong; X. Fan; Z. Luo; |
415 | Multiple Auxiliary Networks for Single Blind Image Deblurring Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose Multiple Auxiliary Networks (MANet) for single blind image deblurring to assist norm L1-loss function and enhance the quality of the deblurring image. |
C. Li; Q. Wang; S. Liu; X. Li; |
416 | Joint Learning of Image Aesthetic Quality Assessment and Semantic Recognition Based on Feature Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore the relationships between aesthetic quality assessment and semantic recognition task, and employ a multi-task convolutional neural network with feature enhancement mechanism to effectively integrate these two tasks. |
X. Liu; X. Nie; Z. Shen; Y. Yin; |
417 | Nested Error Map Generation Network for No-Reference Image Quality Assessment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a multi-task learning neural network for No-Reference image quality assessment (NR-IQA). |
J. Chen; H. Wang; G. Li; S. Liu; |
418 | Regression or Classification? New Methods to Evaluate No-reference Picture and Video Quality Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To make the problem more tractable, we propose two new methods – binary, and ordinal classification – as alternatives to evaluate and compare no-reference quality models at coarser levels. |
Z. Tu; et al. |
419 | Blind Image Quality Evaluator with Scale Robustness Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a more generalized blind image quality evaluator with scale robustness (BIQESR) to assess image quality by locating the robust feature points in a multi-scale space. |
C. Wang; M. Li; |
420 | Multi-Scale Feature-Guided Stereoscopic Video Quality Assessment Based on 3d Convolutional Neural Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we proposed a multi-scale feature-guided 3D convolutional neural network for SVQA which not only use 3D convolution to capture spatio-temporal features but also aggregate multi-scale information by a new multi-scale unit. |
Y. Feng; S. Li; Y. Chang; |
421 | No-Reference Stereoscopic Image Quality Assessment Based on The Human Visual System Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Recently witnessed the significant progress of biotechnology and motivated by the deeper research on the HVS, we take a step to bridge the gap between HVS and SIQA by generalizing the optic chiasm algorithm and introducing biological vision fusion mechanism in our work. |
F. Meng; S. Li; Y. Chang; |
422 | Stereo Rectification Based on Epipolar Constrained Neural Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel deep neural network-based method for stereo image rectification. |
Y. Wang; Y. Lu; G. Lu; |
423 | Multi-Scale Cascade Disparity Refinement Stereo Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, this paper presents MCDRNet, which combines traditional methods with neural networks to achieve real-time and accurate stereo matching results. |
X. Jia; et al. |
424 | Hierarchical Context Guided Aggregation Network for Stereo Matching Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a simple yet efficient network named Hierarchical Context Guided Aggregation Network (HCGANet). |
J. Peng; W. Xie; Z. Huang; W. Chen; Y. Zhao; |
425 | Cost Affinity Learning Network for Stereo Matching Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a novel cost affinity learning network(CAL-Net) whose Affinity Enhanced Module(AEM) extracts the affinity of the elements in the cost feature and reconstructs a more discriminative feature. |
S. Chen; B. Li; W. Wang; H. Zhang; H. Li; Z. Wang; |
426 | Video Quality Prediction Using Voxel-Wise FMRI Models of The Visual Cortex Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we address the problem of full-reference video quality prediction. |
N. S. Mahankali; S. S. Channappayya; |
427 | Tensor Decomposition Via Core Tensor Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an efficient TD algorithm that aims to learn a global mapping from input tensors to latent core tensors, under the assumption that the mappings of multiple tensors might be shared or highly correlated. |
J. ZHANG; Z. TAO; L. ZHANG; Q. ZHAO; |
428 | Sign Language Segmentation with Temporal Convolutional Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The objective of this work is to determine the location of temporal boundaries between signs in continuous sign language videos. |
K. Renz; N. C. Stache; S. Albanie; G. Varol; |
429 | An Adaptive Discriminant and Sparsity Feature Descriptor for Finger Vein Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an adaptive discriminant and sparsity feature descriptor (DSFD) for FV feature extraction and recognition. |
S. Li; B. Zhang; |
430 | Routinggan: Routing Age Progression and Regression with Disentangled Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these deficiencies and have the best of both worlds, this paper introduces a dropout-like method based on GAN (RoutingGAN) to route different effects in a high-level semantic feature space. |
Z. Huang; J. Zhang; H. Shan; |
431 | Semantic-Aware Unpaired Image-to-Image Translation for Urban Scene Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle this problem, in this paper, we reasonably modify the previous problem setting and present a novel semantic-aware method. |
Z. Li; R. Togo; T. Ogawa; M. Haseyama; |
432 | Fontnet: On-Device Font Understanding and Prediction Pipeline Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose two engines: Font Detection Engine, which identifies the font style, color and size attributes of text in an image and a Font Prediction Engine, which predicts similar fonts for a query font. |
R. S; R. Khurana; V. Agarwal; J. R. Vachhani; G. Bhanodai; |
433 | Agent-Environment Network for Temporal Action Proposal Generation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on the action definition that a human, known as an agent, interacts with the environment and performs an action that affects the environment, we propose a contextual Agent-Environment Network. |
V. -K. Vo-Ho; N. Le; K. Kamazaki; A. Sugimoto; M. -T. Tran; |
434 | Adaptive Multi-Domain Learning for Outdoor 3d Human Pose and Shape Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we first point out this problem and then address it via a novel cascade multi-domain learning module (CMDL), where multiple adapters are employed to extract more discriminative features for different domains. |
Z. Gui; S. Zhang; K. Wang; J. Yang; P. C. Yuen; |
435 | Lightweight Human Pose Estimation Under Resource-Limited Scenes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate the problem of lightweight human pose estimation under resource-limited scenes.We first redesign a lightweight bottleneck block with two concepts: depthwise convolution and attention mechanism. |
Z. Zhang; J. Tang; G. Wu; |
436 | Absolute 3d Pose Estimation and Length Measurement of Severely Deformed Fish from Monocular Videos in Longline Fishing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Unlike related works, which either require expensive 3D ground-truth data and/or multiple-view images to provide depth information, or are limited to rigid objects, we propose a novel frame-based method to estimate the absolute 3D fish pose and fish length from a single-view 2D segmentation mask. |
J. Mei; J. -N. Hwang; S. Romain; C. Rose; B. Moore; K. Magrane; |
437 | Camera Calibration with Pose Guidance Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To resolve above issues, we propose a calibration system called Calibration with Pose Guidance to improve calibration accuracy, reduce calibration variance among different users or different trials of the same person. |
Y. Ren; F. Hu; |
438 | Real Versus Fake 4k – Authentic Resolution Assessment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work aims at authentic resolution assessment (ARA). |
R. R. Shah; V. Anirudh Akundy; Z. Wang; |
439 | Perceptual Quality Assessment for Recognizing True and Pseudo 4k Content Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To meet the imperative demand for monitoring the quality of Ultra High-Definition (UHD) content in multimedia industries, we propose an efficient no-reference (NR) image quality assessment (IQA) metric to distinguish original and pseudo 4K contents and measure the quality of their quality in this paper. |
W. Zhu; G. Zhai; X. Min; X. Yang; X. -P. Zhang; |
440 | A New Tubular Structure Tracking Algorithm Based On Curvature-Penalized Perceptual Grouping Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a new minimal path-based framework for minimally interactive tubular structure tracking in conjunction with a perceptual grouping scheme. |
L. Liu; D. Chen; M. Shu; H. Shu; L. D. Cohen; |
441 | Multiple Human Tracking in Non-Specific Coverage with Wearable Cameras Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this problem, in this paper we propose a Markov Decision Process with jump state (JMDP) to model the target?s lifetime in tracking, and use optical flow of the camera motion and the statistical information of the targets to model the camera state transition. |
S. Wang; R. Han; W. Feng; S. Wang; |
442 | Fine-Grained Pose Temporal Memory Module for Video Pose Estimation and Tracking Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To better solve these problems and utilize the temporal information efficiently and effectively, we present a novel structure, called pose temporal memory module, which is flexible to be transferred into top-down pose estimation frameworks. |
C. WANG; et al. |
443 | Drawing Order Recovery from Trajectory Components Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on the idea that drawing trajectories are able to be recovered by connecting their trajectory components in correct orders, this work proposes a novel DOR method from static images. |
M. Yang; X. Zhou; Y. Sun; J. Chen; B. Qiang; |
444 | Deep Hashing for Motion Capture Data Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose an efficient retrieval method for human motion capture (MoCap) data based on supervised deep hash code learning. |
N. Lv; Y. Wang; Z. Feng; J. Peng; |
445 | Hierarchical Attention Fusion for Geo-Localization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we cast the geo-localization as a 2D image retrieval task. |
L. Yan; Y. Cui; Y. Chen; D. Liu; |
446 | AttentionLite: Towards Efficient Self-Attention Models for Vision Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel framework for producing a class of parameter and compute efficient models called AttentionLite suitable for resource constrained applications. |
S. Kundu; S. Sundaresan; |
447 | Attention-Guided Second-Order Pooling Convolutional Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To handle above limitations, this paper proposes a novel attention-guided second-order pooling convolutional network (ASP-Net). |
S. Chen; Q. Sun; C. Li; J. Zhang; Q. Zhang; |
448 | SA-Net: Shuffle Attention for Deep Convolutional Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an efficient Shuffle Attention (SA) module to address this issue, which adopts Shuffle Units to combine two types of attention mechanisms effectively. |
Q. -L. Zhang; Y. -B. Yang; |
449 | An Attention Based Wavelet Convolutional Model for Visual Saliency Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, an attention based Wavelet Convolutional Neural Network (WCNN) is implemented that efficiently extracts the spatial, spectral and semantic features of the image in multiple resolution and it turns out to be suitable for locating the visually salient regions. |
R. S. Bhooshan; S. K; |
450 | Cascade Attention Fusion for Fine-Grained Image Captioning Based on Multi-Layer LSTM Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a visual and semantic fusion network with a margin-based training guidance mechanism to generate fine image descriptions that depict more objects, attributes and other distinguishing aspects of images. |
S. Wang; et al. |
451 | Webly Supervised Deep Attentive Quantization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To solve this problem, we propose a novel method termed Webly Supervised Deep Attentive Quantization (WSDAQ), where deep quantization is trained on web images associated with some userprovided weak tags, without consulting any ground-truth labels. |
J. Wang; B. Chen; T. Dai; S. -T. Xia; |
452 | Unsupervised Audio-Visual Subspace Alignment for High-Stakes Deception Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this problem, we propose the first multimodal unsupervised transfer learning approach that detects real-world, high-stakes deception in videos with-out using high-stakes labels. |
L. MATHUR; M. J. MATARIc; |
453 | Violence Detection in Videos Based on Fusing Visual and Audio Information Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We proposed a neural network containing three modules for fusing multimodal information: 1) attention module for utilizing weighted features to generate effective features based on the mutual guidance between visual and audio information; 2) fusion module for integrating features by fusing visual and audio information based on the bilinear pooling mechanism; and 3) mutual Learning module for enabling the model to learn visual information from another neural network with a different architecture. |
W. -F. Pang; Q. -H. He; Y. -j. Hu; Y. -X. Li; |
454 | QUERYD: A Video Dataset with High-Quality Text and Audio Narrations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce QuerYD, a new large-scale dataset for retrieval and event localisation in video. |
A. -M. Oncescu; J. F. Henriques; Y. Liu; A. Zisserman; S. Albanie; |
455 | Generating Natural Questions from Images for Multimodal Assistants Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present an approach for generating diverse and meaningful questions that consider image content and metadata of image (e.g., location, associated keyword). |
A. Patel; A. Bindal; H. Kotek; C. Klein; J. Williams; |
456 | An Adaptive Multi-Scale and Multi-Level Features Fusion Network with Perceptual Loss for Change Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel adaptive multi-scale and multi-level features fusion network for change detection in very-high-resolution bi-temporal remote sensing images. |
J. Xu; Y. Luo; X. Chen; C. Luo; |
457 | SeeHear: Signer Diarisation and A New Dataset Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a framework to collect a large-scale, diverse sign language dataset that can be used to train automatic sign language recognition models.The first contribution of this work is SDTrack, a generic method for signer tracking and diarisation in the wild. |
S. Albanie; et al. |
458 | Semantic Image Synthesis from Inaccurate and Coarse Masks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we propose a smoothing method, which we call local label smoothing (LLS), that incorporates label smoothing per small patch of an input mask to learn mapping from masks to images even when semantic masks are inaccurate. |
K. Katsumata; H. Nakayama; |
459 | Range Guided Depth Refinement and Uncertainty-Aware Aggregation for View Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a framework of view synthesis, including range guided depth refinement and uncertainty-aware aggregation based novel view synthesis. |
Y. Chang; Y. Chen; G. Wang; |
460 | DP-VTON: Toward Detail-Preserving Image-Based Virtual Try-on Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To resolve this issue, we present a novel virtual try-on network, DP-VTON. |
Y. Chang; et al. |
461 | Light Field Style Transfer with Local Angular Consistency Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel optimization-based method for light field style transfer which iteratively propagates the style from the centre view towards the outer views while enforcing local angular consistency. |
D. Egan; M. Alain; A. Smolic; |
462 | Skip Attention GAN for Remote Sensing Image Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We establish Skip Attention Mechanism to deal with this problem, which learns dependencies between local points on low-resolution feature maps, and then upsample the attention map and combine it with high-resolution feature maps. |
K. Deng; K. Zhang; P. Yao; S. Cheng; P. He; |
463 | Image Generation Based on Texture Guided VAE-AGAN for Regions of Interest Detection in Remote Sensing Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To cope with this issue, we propose a novel method based on texture guided variational autoencoder-attention wise generative adversarial network (VAE-AGAN) to augment the training data for ROI detection. |
L. Zhang; Y. Liu; |
464 | EADNet: Efficient Asymmetric Dilated Network For Semantic Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an efficient asymmetric dilated semantic segmentation network, named EADNet, which consists of multiple developed asymmetric convolution branches with different dilation rates to capture the variable shapes and scales information of an image. |
Q. Yang; T. Chen; J. Fan; Y. Lu; C. Zuo; Q. Chi; |
465 | Ltaf-Net: Learning Task-Aware Adaptive Features and Refining Mask for Few-Shot Semantic Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we propose a novel model named LTAF-Net for few-shot segmentation. |
B. Mao; L. Wang; S. Xiang; C. Pan; |
466 | Cgan-Net: Class-Guided Asymmetric Non-Local Network for Real-Time Semantic Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a Class-Guided Asymmetric Non-local Network (CGAN-Net) to enhance the class-discriminability in learned feature map, while maintaining real-time efficiency. |
H. Chen; Q. Hu; J. Yang; J. Wu; Y. Guo; |
467 | Aggregation Architecture and All-to-one Network for Real-Time Semantic Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we make the following contributions: (i) First, unlike the previous three architectures, we propose a new aggregation architecture as the network back-bone. (ii) Second, a multi-level auxiliary loss design model is used for the training phase, which can improve the model segmentation effect. (iii) According to this aggregation structure, an all-to-one network (ATONet) for real-time semantic segmentation is proposed, which achieves a good trade-off between speed and accuracy by assembling the features of all blocks. |
K. CAO; X. HUANG; J. SHAO; |
468 | Nlkd: Using Coarse Annotations For Semantic Segmentation Based on Knowledge Distillation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a noise learning framework based on knowledge distillation NLKD, to improve segmentation performance on unclean data. |
D. Liang; Y. Du; H. Sun; L. Zhang; N. Liu; M. Wei; |
469 | Knowledge Reasoning for Semantic Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome the limitation of the traditional method, we propose a Knowledge Reasoning Net (KRNet) that consists of two crucial modules: (1) a prior knowledge mapping module that incorporates external knowledge by graph convolutional network to guide learning semantic representations and (2) a knowledge reasoning module that correlates these representations with a graph built on the external knowledge and explores their interactions via the knowledge reasoning. |
S. Chen; Z. Li; X. Yang; |
470 | Non-Convex Sparse Deviation Modeling Via Generative Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, the generative model is used to introduce the structural properties of the signal to replace the common sparse hypothesis, and a non-convex compressed sensing sparse deviation model based on the generative model (lq-Gen) is proposed. |
Y. Yang; H. Wang; H. Qiu; J. Wang; Y. Wang; |
471 | Imrnet: An Iterative Motion Compensation and Residual Reconstruction Network for Video Compressed Sensing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes an iterative motion compensation and residual reconstruction network for VCS, called ImrNet. |
X. Yang; C. Yang; |
472 | Deep Color Constancy Using Temporal Gradient Under Ac Light Sources Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: While most of conventional methods focus on only spatial information of a single image, we propose a deep spatio-temporal color constancy method. |
J. -W. HA; J. -S. YOO; J. -O. KIM; |
473 | End-to-End Learning of Variational Models and Solvers for The Resolution of Interpolation Problems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider an application to inverse problems with incomplete datasets (image inpainting and multivariate time series interpolation). |
R. Fablet; L. Drumetz; F. Rousseau; |
474 | Multi-Models Fusion for Light Field Angular Super-Resolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, therefore, we propose a multi-models fusion for LF SR in angular domain. |
F. Cao; P. An; X. Huang; C. Yang; Q. Wu; |
475 | Hide Chopin in The Music: Efficient Information Steganography Via Random Shuffling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore the room introduced by the low-rank property of natural signals (i.e., images, audios), and propose a training-free model for efficient information steganography, which provides a capacity of hiding full-size images into carriers of the same spatial resolution. |
Z. Sun; C. Li; Q. Zhao; |
476 | Pointer Networks for Arbitrary-Shaped Text Spotting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a highly efficient one-stage method named PointerNet for arbitrary-shaped text spotting. |
Y. Zhang; W. Yang; Z. Xu; Y. Li; Z. Chen; L. Huang; |
477 | Rotation Invariance Analysis of Local Convolutional Features in Image Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, our objective is to enhance the robustness of LC features against image rotation. |
L. Zhao; Y. Wang; J. Kato; |
478 | Signature Feature Marking Enhanced IRM Framework for Drone Image Analysis in Precision Agriculture Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we are proposing enhancement to Invariant Risk Minimization (IRM) framework which is Signature Feature Marking (SFM) enhanced IRM for object classification. |
A. Kadethankar; N. Sinha; V. Hegde; A. Burman; |
479 | Vehicle 3d Localization in Road Scenes VIA A Monocular Moving Camera Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present an effective vehicle 3D localization method, that utilizes 2D key-points predicted from a trained CNN to model the vehicles? structure, from which the ground points are further inferred. |
Y. Zhang; A. Zheng; K. Han; Y. Wang; J. -N. Hwang; |
480 | Gps-Denied Navigation Using Sar Images And Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a method for determining the navigation errors present at the beginning of a GPS-denied period utilizing data from a synthetic aperture radar (SAR) system. |
T. White; J. Wheeler; C. Lindstrom; R. Christensen; K. R. Moon; |
481 | Attention-Embedded Decomposed Network with Unpaired CT Images Prior for Metal Artifact Reduction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these issues, we propose an attention-embedded decomposed network to reducing metal artifacts in both image space and sinogram space with unpaired images. |
B. Zhao; Q. Ren; J. Li; Y. Zhao; |
482 | Partial Feature Aggregation Network for Real-Time Object Counting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an efficient and accurate light-weight network for object counting, called Partial Feature Aggregation Network (PFANet). |
H. Yu; L. Zhang; |
483 | A Bayesian Inference Approach for Location-Based Micro Motions Using Radio Frequency Sensing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The target tracking objective is formulated as an inference problem, by which we show how the Bayesian framework can be exploited to infer the parameters of interest for a given physics model. |
D. A. Maluf; A. Elnakeeb; M. Silverman; |
484 | Robust Spatial-Temporal Correlation Model for Background Initialization in Severe Scene Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we develop a novel method called co-occurrence pixel-block (CPB) model via spatial-temporal correlation for robust back-ground initialization. |
Y. Deng; W. Zhou; B. Peng; D. Liang; S. Kaneko; |
485 | 2D-FRFT Based Frequency Shift-Invariant Digital Image Encryption Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study the property of frequency shift in two-dimensional Fractional Fourier Transform (2D-FRFT) domain. |
L. Gao; L. Qi; L. Guan; |
486 | Capturing Banding in Images: Database Construction and Objective Assessment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work targets at capturing and quantifying banding artifacts in images. |
A. Kapoor; J. Sapra; Z. Wang; |
487 | On The Camera Position Dithering In Visual 3d Reconstruction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we adopt statistical analysis for camera dithering in visual 3D reconstruction and demonstrate the improvement of reconstruction accuracy brought by camera dithering under some conditions. |
Q. An; Y. Shen; |
488 | Long-Short Temporal Modeling for Efficient Action Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a new two-stream action recognition network, termed as MENet, consisting of a Motion Enhancement (ME) module and a Video-level Aggregation (VLA) module to achieve long-short temporal modeling. |
L. Wu; Y. Zou; C. Zhang; |
489 | Multi-Directional Convolution Networks with Spatial-Temporal Feature Pyramid Module for Action Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel and effective Multi-Directional Convolution (MDConv), which extracts features along different spatial-temporal orientations. |
B. Yang; Z. Wang; W. Ran; H. Lu; Y. -P. P. Chen; |
490 | Unsupervised Motion Representation Enhanced Network for Action Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To fill the gap, we propose UF-TSN, a novel end-to-end action recognition approach enhanced with an embedded lightweight unsupervised optical flow estimator. |
X. Yang; L. Kong; J. Yang; |
491 | An Improved Deep Relation Network for Action Recognition in Still Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose an efficient relation module that combines Human-Object and Scene-Object relations for action recognition. |
W. Wu; J. Yu; |
492 | Human-Aware Coarse-to-Fine Online Action Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a two-stage framework to efficiently and effectively detect actions on-the-fly. |
Z. Yang; D. Huang; J. Qin; Y. Wang; |
493 | SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we explore to remove the requirement of pre-defined anchors for TAD methods. |
R. Ning; C. Zhang; Y. Zou; |
494 | Semantic-Aware Context Aggregation for Image Inpainting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To handle this problem, we propose a novel semantic-aware context aggregation module (SACA) that aggregates distant contextual information from a semantic perspective by exploiting the internal semantic similarity of the input feature map. |
Z. Huang; C. Qin; R. Liu; Z. Weng; Y. Zhu; |
495 | Bishift-Net for Image Inpainting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by this, we propose a new inpainting model, which is called BiShift-Net. |
X. Zhou; T. Dai; Y. Jiang; S. -T. Xia; |
496 | OAS-Net: Occlusion Aware Sampling Network for Accurate Optical Flow Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To deal with these challenges, we propose a lightweight yet efficient optical flow network, named OAS-Net (occlusion aware sampling network) for accurate optical flow. |
L. Kong; X. Yang; J. Yang; |
497 | Mask4D: 4D Convolution Network for Light Field Occlusion Removal Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a simple yet effective LF occlusion removal method name Mask4D, which is a 4D convolution-based encoder-decoder network. |
Y. Li; et al. |
498 | Self-Supervised Depth Estimation Via Implicit Cues from Videos Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we improve the self-supervised learning framework for depth estimation using consecutive frames from monocular and stereo videos. |
J. Wang; G. Zhang; Z. Wu; X. Li; L. Liu; |
499 | Scene Completeness-Aware Lidar Depth Completion for Driving Scenario Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces Scene Completeness-Aware Depth Completion (SCADC) to complete raw lidar scans into dense depth maps with fine and complete scene structures. |
C. -Y. Wu; U. Neumann; |
500 | Semi-Supervised Feature Embedding for Data Sanitization in Real-World Events Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We address the issue of establishing which images represent an event of interest through a semi-supervised learning technique. |
B. Lavi; J. Nascimento; A. Rocha; |
501 | Exposing GAN-Generated Faces Using Inconsistent Corneal Specular Highlights Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we show that GAN synthesized faces can be exposed with the inconsistent corneal specular highlights between two eyes. |
S. Hu; Y. Li; S. Lyu; |
502 | A Features Decoupling Method for Multiple Manipulations Identification in Image Operation Chains Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on identifying the manipulations in an image operation chain composed of multiple manipulations in a certain order. |
J. Chen; X. Liao; W. Wang; Z. Qin; |
503 | Subjective and Objective Evaluation of Deepfake Videos Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, this paper, presents a subjective study, which, using 60 naïve subjects, evaluates how hard it is for humans to see if a video is a deepfake or not. |
P. Korshunov; S. Marcel; |
504 | Forensicability of Deep Neural Network Inference Pipelines Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose methods to infer properties of the execution environment of machine learning pipelines by tracing characteristic numerical deviations in observable outputs. |
A. Schl�gl; T. Kupek; R. B�hme; |
505 | SERN: Stance Extraction and Reasoning Network for Fake News Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To solve this problem, a novel Stance Extraction and Reasoning Network (SERN) is proposed to extract the stances implied in post-reply pairs implicitly and integrate the stance representations for fake news detection without manually labeling stances, which saves much time and effort. |
J. Xie; S. Liu; R. Liu; Y. Zhang; Y. Zhu; |
506 | An Efficient Paper Anti-Counterfeiting Method Based on Microstructure Orientation Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on the efficient microstructure orientation estimation of paper surface for authentication. |
Y. Sun; X. Liao; J. Liu; |
507 | Learning Double-Compression Video Fingerprints Left From Social-Media Platforms Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Most of the work performed so far on social media provenance has concentrated on images; in this paper, we propose a CNN architecture that analyzes video content to trace videos back to their social network of origin. |
I. Amerini; A. Anagnostopoulos; L. Maiano; L. R. Celsi; |
508 | Checking PRNU Usability on Modern Devices Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we first highlight that wrong source attribution can happen on smartphones belonging to the same brand when images are acquired both in default and in bokeh mode. |
C. Albisani; M. Iuliani; A. Piva; |
509 | Handwritten Digits Reconstruction from Unlabelled Embeddings Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate template reconstruction attack of touchscreen biometrics, based on handwritten digits writer verification. |
T. Thebaud; G. Le Lan; A. Larcher; |
510 | Effect of Video Pixel-Binning on Source Attribution of Mixed Media Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper investigates the attribution of visual media in the context of matching a video query object to an image fingerprint or vice versa. |
S. Taspinar; M. Mohanty; N. Memon; |
511 | Combining Dynamic Image and Prediction Ensemble for Cross-Domain Face Anti-Spoofing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Hence, we propose a source data-free domain adaptative face anti-spoofing framework to optimize the network in the target domain without using labeled source data via modeling it into a problem of learning with noisy labels. |
L. Lv; et al. |
512 | Label-Guided Dictionary Pair Learning for ECG Biometric Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome this limitation, our method, namely label-guided dictionary pair learning, aims to learn a projective dictionary and reconstructed dictionary jointly, which achieves signal representation and reconstruction simultaneously. |
M. Ma; G. Yang; K. Wang; Y. Huang; Y. Yin; |
513 | Backdoor Attack Against Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we demonstrate that it is possible to inject the hidden backdoor for infecting speaker verification models by poisoning the training data. |
T. Zhai; Y. Li; Z. Zhang; B. Wu; Y. Jiang; S. -T. Xia; |
514 | Class-Conditional Defense GAN Against End-To-End Speech Attacks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we propose a novel defense approach against end-to-end adversarial attacks developed to fool advanced speech-to-text systems such as DeepSpeech and Lingvo. |
M. Esmaeilpour; P. Cardinal; A. L. Koerich; |
515 | Selfgait: A Spatiotemporal Representation Learning Method for Self-Supervised Gait Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a self-supervised gait recognition method, termed SelfGait, which takes advantage of the massive, diverse, unlabeled gait data as a pre-training process to improve the representation abilities of spatiotemporal backbones. |
Y. Liu; Y. Zeng; J. Pu; H. Shan; P. He; J. Zhang; |
516 | Attack on Practical Speaker Verification System Using Universal Adversarial Perturbations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A two-step algorithm is proposed to optimize the universal adversarial perturbation to be text-independent and has little effect on the authentication text recognition. |
W. Zhang; et al. |
517 | Highly Efficient Protection of Biometric Face Samples with Selective JPEG2000 Encryption Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we will show that selective encryption of face biometric samples is secure. |
H. Hofbauer; Y. Mart�nez-D�az; S. Kirchgasser; H. M�ndez-V�zquez; A. Uhl; |
518 | Deep Auto-Encoding and Biohashing for Secure Finger Vein Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a deep-learning-based approach for secure finger vein recognition. |
H. O. Shahreza; S. Marcel; |
519 | Topic Sequence Embedding for User Identity Linkage from Heterogeneous Behavior Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a novel Topic Sequence Embedding (TSeqE) method is proposed to embed contextual information of topics to represent users? intrinsic characteristics for identity linkage. |
J. Yang; W. Zhou; W. Qian; J. Han; S. Hu; |
520 | Looking Through Walls: Inferring Scenes from Video-Surveillance Encrypted Traffic Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we show that it is possible to infer visual data by intercepting the encrypted video stream of a surveillance system, and how this may be leveraged to track the movements of a person inside the secured area. |
D. Mari; et al. |
521 | Optimal Attacking Strategy Against Online Reputation Systems with Consideration of The Message-Based Persuasion Phenomenon Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper aims to analyze the optimal attacking strategy, especially when considering the message-based persuasion phenomenon where users? ratings tend to be influenced by earlier ones. |
Z. Chen; H. V. Zhao; |
522 | STEP-GAN: A One-Class Anomaly Detection Model with Applications to Power System Security Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel structure for the multi-generator generative adversarial network (GAN) to address the challenges of detecting adversarial attacks. |
M. Adiban; A. Safari; G. Salvi; |
523 | Application-Layer DDOS Attacks with Multiple Emulation Dictionaries Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider the problem of identifying the members of a botnet under an application-layer (L7) DDoS attack, where a target site is flooded with a large number of requests that emulate legitimate users? patterns. |
M. Cirillo; M. D. Mauro; V. Matta; M. Tambasco; |
524 | Secret Key Generation Over Wireless Channels Using Short Blocklength Multilevel Source Polar Coding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by the state-of-the-art performance provided by polar codes in the short blocklength regime for channel coding, we propose an explicit protocol based on polar codes for generating the secret keys. |
H. Hentil�; Y. Y. Shkel; V. Koivunen; |
525 | Efficient Network Protection Games Against Multiple Types Of Strategic Attackers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper considers network protection games against different types of attackers for a heterogeneous network system with N units. |
Z. Xu; M. Baykal-G�rsoy; |
526 | Detection Of Malicious DNS and Web Servers Using Graph-Based Approaches Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose both unsupervised and semi-supervised defenses based on the available knowledge of the defender. |
J. Jia; Z. Dong; J. Li; J. W. Stokes; |
527 | Low Complexity Secure P-Tensor Product Compressed Sensing Reconstruction Outsourcing and Identity Authentication in Cloud Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Specifically, we propose a low complexity and secure PTP-CSR outsourcing model to protect the signal privacy, and further introduce user authentication and data verification services. |
M. Wang; D. Xiao; J. Liang; |
528 | Privacy-Preserving Near Neighbor Search Via Sparse Coding with Ambiguation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a framework for privacy-preserving approximate near neighbor search via stochastic sparsifying encoding. |
B. Razeghi; S. Ferdowsi; D. Kostadinov; F. P. Calmon; S. Voloshynovskiy; |
529 | Privacy-Preserving Optimal Insulin Dosing Decision Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a privacy-preserving optimal insulin dosing decision in the IoMT system (PIDM) to assist doctors in their decision-making with the patients privacy. |
Z. Ying; et al. |
530 | Privacy-Accuracy Trade-Off of Inference As Service Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a general framework to provide a desirable trade-off between inference accuracy and privacy protection in the inference as service scenario. |
Y. Jin; L. Lai; |
531 | Federated Learning with Local Differential Privacy: Trade-Offs Between Privacy, Utility, and Communication Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider Gaussian mechanisms to preserve local differential privacy (LDP) of user data in the FL model with SGD. |
M. Kim; O. G�nl�; R. F. Schaefer; |
532 | Scalable Privacy-Preserving Distributed Extremely Randomized Trees for Structured Data With Multiple Colluding Parties Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we extend the distributed Extremely Randomized Trees (ERT) approach w.r.t. privacy and scalability. |
A. Aminifar; F. Rabbi; Y. Lamo; |
533 | Active Privacy-Utility Trade-Off Against A Hypothesis Testing Adversary Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We formulate both problems as a Markov decision process (MDP), and numerically solve them by advantage actor-critic (A2C) deep reinforcement learning (RL). |
E. Erdemir; P. L. Dragotti; D. G�nd�z; |
534 | Baitradar: A Multi-Model Clickbait Detection Algorithm Using Deep Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This issue is addressed in this study by proposing an algorithm called BaitRadar, which uses a deep learning technique where six inference models are jointly consulted to make the final classification decision. |
B. Gamage; A. Labib; A. Joomun; C. H. Lim; K. Wong; |
535 | Enabling Efficient and Expressive Spatial Keyword Queries On Encrypted Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a Secure Spatial Keyword Queries (SSKQ) construction supporting expressive query types. |
X. Wang; J. Ma; X. Liu; |
536 | Privacy-Preserving Cloud-Based DNN Inference Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel privacy preserving cloud-based DNN inference framework (PROUD), which greatly improves the computational efficiency. |
S. Xie; B. Liu; Y. Hong; |
537 | Crypto-Oriented Neural Architecture Design Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Differently, we propose to optimize the design of crypto-oriented neural architectures, introducing a novel Partial Activation layer. |
A. Shafran; G. Segev; S. Peleg; Y. Hoshen; |
538 | Integrating Deep Learning with First-Order Logic Programmed Constraints for Zero-Day Phishing Attack Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce the integration method of deep learning and logic programmed domain knowledge to inject the real-world constraints. |
S. -J. Bu; S. -B. Cho; |
539 | Improved Probabilistic Context-Free Grammars for Passwords Using Word Extraction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a word extraction approach for passwords, and further present an improved PCFG model, called WordPCFG. |
H. Cheng; W. Li; P. Wang; K. Liang; |
540 | Enhancing Image Steganography Via Stego Generation And Selection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Unlike most existing steganography methods which are mainly focused on designing embedding cost, in this paper, we propose a new method to enhance existing steganographic methods via stego generation and selection. |
T. Song; M. Liu; W. Luo; P. Zheng; |
541 | Synchronous Multi-Bit Audio Watermarking Based on Phase Shifting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We considered the use of the distribution of the phase spectrum and propose an effective multi-bit audio watermarking method based on phase shifting. |
S. Wang; W. Yuan; Z. Zhang; J. Wang; M. Unoki; |
542 | Image Steganography Based on Iterative Adversarial Perturbations Onto A Synchronized-Directions Sub-Image Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel steganographic scheme to incorporate synchronizing modification directions and iterative adversarial perturbations to enhance steganographic performance. |
X. Qin; S. Tan; W. Tang; B. Li; J. Huang; |
543 | Extending The Reverse JPEG Compatibility Attack to Double Compressed Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we provide mathematical analysis and demonstrate experimentally that this attack can be extended to double compressed images when the first compression quality is 93 or larger and the second quality equal or larger than the first quality. |
J. Butora; J. Fridrich; |
544 | Reversible Data Hiding in Jpeg Images for Privacy Protection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, an improved algorithm is proposed to conceal privacy information in JPEG images. |
Y. Huang; X. Cao; H. -T. Wu; Y. -m. Cheung; |
545 | A Layered Embedding-Based Scheme to Cope with Intra-Frame Distortion Drift In IPM-Based HEVC Steganography Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To solve this problem, we propose a layered embedding scheme which embeds information into the intra-prediction modes (IPMs) of 4?4 intra-frame prediction units (PUs) in HEVC. |
X. Jia; J. Wang; Y. Liu; X. Kang; Y. -q. Shi; |
546 | Meta-Learning with Attention for Improved Few-Shot Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose meta-learning with attention mechanisms. |
Z. Hou; A. Walid; S. -Y. Kung; |
547 | B-Small: A Bayesian Neural Network Approach to Sparse Model-Agnostic Meta-Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a Bayesian neural network based MAML algorithm, which we refer to as the B-SMALLalgorithm. |
A. Madan; R. Prasad; |
548 | Deep Transform and Metric Learning Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We hence propose a novel Deep DL approach where each DL layer can be formulated and solved as a combination of one linear layer and a Recurrent Neural Network, where the RNN is flexibly regraded as a layer-associated learned metric. |
W. Tang; E. Chouzenoux; J. -C. Pesquet; H. Krim; |
549 | Robustness and Diversity Seeking Data-Free Knowledge Distillation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome this challenge, we propose robustness and diversity seeking data-free KD (RDSKD) in this paper. |
P. Han; J. Park; S. Wang; Y. Liu; |
550 | Ensemble Distillation Approaches for Grammatical Error Correction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper examines the application of both these distillation approaches to a sequence prediction task, grammatical error correction (GEC). |
Y. Fathullah; M. J. F. Gales; A. Malinin; |
551 | Train Your Classifier First: Cascade Neural Networks Training from Upper Layers to Lower Layers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, for the first time, we show that the frozen classifier is transferable within the same dataset. |
S. Zhang; C. -T. Do; R. Doddipatla; E. Loweimi; P. Bell; S. Renals; |
552 | How Convolutional Neural Networks Deal with Aliasing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The question we aim to answer in this paper is simply: how and to what extent do CNNs counteract aliasing? |
A. H. Ribeiro; T. B. Sch�n; |
553 | Canet: Context-Aware Loss for Descriptor Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel loss function to introduce more context information to facilitate training. |
T. Chen; X. Hu; J. Xiao; G. Zhang; H. Ruan; |
554 | Progressive Multi-Stage Feature Mix for Person Re-Identification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a Progressive Multi-stage feature Mix network (PMM), which enables the model to find out the more precise and diverse features in a progressive manner. |
Y. Zhang; B. He; L. Sun; Q. Li; |
555 | Using Deep Image Priors to Generate Counterfactual Explanations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose a novel regularization strategy based on an auxiliary loss estimator jointly trained with the predictor, which efficiently guides the prior to re-cover natural pre-images. |
V. Narayanaswamy; J. J. Thiagarajan; A. Spanias; |
556 | Elliptical Shape Recovery from Blurred Pixels Using Deep Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study the problem of ellipse recovery from blurred shape images. |
H. Zamani; P. Rostami; A. Amini; F. Marvasti; |
557 | Factorized CRF with Batch Normalization Based on The Entire Training Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study we show that it is feasible to calculate the mean and variance using the entire training dataset instead of standard BN for any network node obtained as a linear function of the input features. |
E. Goldman; J. Goldberger; |
558 | Evolutionary Quantization of Neural Networks with Mixed-Precision Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we develop a novel evolutionary based method to automatically determine the bit-widths of weights and activations in each convolutional layer, namely, Evolutionary Mixed-Precision Quantization (EMQ). |
Z. Liu; X. Zhang; S. Wang; S. Ma; W. Gao; |
559 | Evolving Quantized Neural Networks for Image Classification Using A Multi-Objective Genetic Algorithm Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this problem, this paper proposes a mixed precision quantization method combined with channel expansion of CNNs by using a multi-objective genetic algorithm, called MOGAQNN. |
Y. Wang; X. Wang; X. He; |
560 | Spectral Domain Convolutional Neural Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Although embedding CNN architectures in the spectral do-main is widely exploited to accelerate the training process, we demonstrate that it is also possible to use the spectral domain to reduce the memory footprint, a method we call Spectral Domain Convolutional Neural Network (SpecNet) that performs both the convolution and the activation operations in the spectral domain. |
B. Guan; J. Zhang; W. A. Sethares; R. Kijowski; F. Liu; |
561 | Parametric Spectral Filters for Fast Converging, Scalable Convolutional Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this, we propose the usage of spectral parametric functions to represent massive spectral domain filters with only a few trainable parameters. |
L. Wood; E. C. Larson; |
562 | Feature Reuse for A Randomization Based Neural Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a feature reuse approach for an existing multi-layer randomization based feedforward neural network. |
X. Liang; M. Skoglund; S. Chatterjee; |
563 | A ReLU Dense Layer to Improve The Performance of Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose ReDense as a simple and low complexity way to improve the performance of trained neural networks. |
A. M. Javid; S. Das; M. Skoglund; S. Chatterjee; |
564 | Nested Learning for Multi-Level Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we address this problem in the context of nested learning. |
R. Achddou; J. M. Di Martino; G. Sapiro; |
565 | Cross-Modal Representation Reconstruction for Zero-Shot Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a Cross-Modal Representation Reconstruction (CM-RR) framework to bridge the semantic gap between visual features and semantic attributes, as well as introducing a novel regularizer for automatically feature selection. |
Y. Wang; S. Zhao; |
566 | HIGCNN: Hierarchical Interleaved Group Convolutional Neural Networks for Point Clouds Analysis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present an efficient and lightweight neural network for point clouds analysis, named HIGCNN, which can achieve better performance but lower complexity compared to existing methods. |
J. Dang; J. Yang; |
567 | AutoKWS: Keyword Spotting with Differentiable Architecture Search Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to leverage recent advances in differentiable neural architecture search to discover more efficient networks. |
B. Zhang; W. Li; Q. Li; W. Zhuang; X. Chu; Y. Wang; |
568 | Embedding Semantic Hierarchy in Discrete Optimal Transport for Risk Minimization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to incorporate the risk-aware inter-class correlation in a discrete optimal transport (DOT) training framework by configuring its ground distance matrix. |
Y. Ge; et al. |
569 | Identifying Spammers to Boost Crowdsourced Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To curb their effect, a novel spectral algorithm for spammer detection that utilizes second-order statistics of annotators, is developed and preliminary results on synthetic and real data showcase the potential of this approach. |
P. A. Traganitis; G. B. Giannakis; |
570 | A Rank-Constrained Clustering Algorithm with Adaptive Embedding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this article, a novel Rank-Constrained clustering algorithm with Adaptive Embedding called RCAE is proposed, where the spectral embedding and the clustering structure are learned simultaneously, so, the influence of noise on performance is greatly reduced. |
S. Pei; F. Nie; R. Wang; X. Li; |
571 | Towards Efficient Age Estimation By Embedding Potential Gender Features Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a simplified deep learning network for age estimation by simultaneously learning aging and potential gender features. |
Y. Deng; L. Fei; S. Teng; W. Zhang; D. Liu; Y. Hou; |
572 | Adversarial Attacks on Coarse-to-Fine Classifiers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we examine the susceptibility of coarse-to-fine hierarchical classifiers to such types of attacks. |
I. R. Alkhouri; G. K. Atia; |
573 | GDTW: A Novel Differentiable DTW Loss for Time Series Tasks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on DTW, we propose a novel loss function for time series data called Gumbel-Softmin based fast DTW (GDTW). |
X. Liu; N. Li; S. -T. Xia; |
574 | Hierarchical Recurrent Neural Network for Handwritten Strokes Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The paper presents an original solution to the online handwritten document processing in a free form, which is aimed at separating multi-class handwritten documents into texts, tables, formulas, drawings, etc. |
I. Degtyarenko; et al. |
575 | Robust Domain-Free Domain Generalization with Class-Aware Alignment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose DomainFree Domain Generalization (DFDG), a model-agnostic method to achieve better generalization performance on the unseen test domain without the need for source domain labels. |
W. Zhang; M. Ragab; R. Sagarna; |
576 | One-Bit Compressed Sensing Using Untrained Network Prior Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we address the problem of one-bit compressed sensing using the data-driven deep learning approach. |
S. Kafle; G. Joseph; P. K. Varshney; |
577 | Deep Unfolding Network for Block-Sparse Signal Recovery Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we put forward a block-sparse reconstruction network named Ada-BlockLISTA based on the concept of deep unfolding. |
R. Fu; V. Monardo; T. Huang; Y. Liu; |
578 | REST: Robust LEarned Shrinkage-Thresholding Network Taming Inverse Problems with Model Mismatch Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider compressive sensing problems with model mismatch where one wishes to recover a sparse high-dimensional vector from low-dimensional observations subject to uncertainty in the measurement operator. |
W. Pu; C. Zhou; Y. C. Eldar; M. R. D. Rodrigues; |
579 | Unfolding Neural Networks for Compressive Multichannel Blind Deconvolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a learned-structured unfolding neural network for the problem of compressive sparse multichannel blind-deconvolution. |
B. Tolooshams; S. Mulleti; D. Ba; Y. C. Eldar; |
580 | Sparsity Driven Latent Space Sampling for Generative Prior Based Compressive Sensing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a proximal meta-learning (PML) algorithm to enforce sparsity in the latent-space while training the generator. |
V. Killedar; P. K. Pokala; C. Sekhar Seelamantula; |
581 | A Sparse Coding Approach to Automatic Diet Monitoring with Continuous Glucose Monitors Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This article presents an approach to estimate dietary intake automatically by analyzing the post-prandial glucose response (PPGR) of a meal, as measured with continuous glucose monitors. |
A. Das; et al. |
582 | Speeding Up of Kernel-Based Learning for High-Order Tensors Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a fast Grassmannian kernel-based method for high-order tensor learning based on the equivalence between the Tucker and the tensortrain decompositions. |
O. Karmouda; J. Boulanger; R. Boyer; |
583 | A Fast Randomized Adaptive CP Decomposition For Streaming Tensors Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a fast adaptive algorithm for CAN- DECOMP/PARAFAC decomposition of streaming three-way tensors using randomized sketching techniques. |
L. T. Thanh; K. Abed-Meraim; N. L. Trung; A. Hafiane; |
584 | Rank-Revealing Block-Term Decomposition for Tensor Completion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, BTD is employed for the completion of a tensor from its partially observed entries. |
A. A. Rontogiannis; P. V. Giampouras; E. Kofidis; |
585 | Kernel Learning with Tensor Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we introduce a Tensor Network (TN) approach to learning kernel embeddings, with a TN serving to map the input to a low dimensional manifold, where a suitable base kernel function can be applied. |
K. Konstantinidis; S. Li; D. P. Mandic; |
586 | Fiber-Sampled Stochastic Mirror Descent for Tensor Decomposition with �-Divergence Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a unified stochastic mirror descent framework is developed for large-scale ?-divergence CPD. |
W. Pu; S. Ibrahim; X. Fu; M. Hong; |
587 | Regularized Recovery By Multi-Order Partial Hypergraph Total Variation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we take this divergence into consideration, and propose a multi-order hypergraph Laplacian and the corresponding total variation. |
R. Qu; J. He; H. Feng; C. Xu; B. Hu; |
588 | Learning Discriminative Features for Semi-Supervised Anomaly Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consolidate the model?s discriminative power by introducing a transfer learning scheme to anomaly detection, thereby the model suffers less perturbation caused by pollution. |
Z. Feng; J. Tang; Y. Dou; G. Wu; |
589 | RGLN: Robust Residual Graph Learning Networks Via Similarity-Preserving Mapping on Graphs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose a residual graph learning paradigm to infer edge connectivities and weights in graphs, which is cast as distance metric learning under a low-rank assumption and a similarity-preserving regularization. |
J. Tang; X. Gao; W. Hu; |
590 | Sequence-Level Self-Teaching Regularization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we extend the previous approach and propose a sequence self-teaching network to regularize the sequence-level information in speech recognition. |
E. Sun; L. Lu; Z. Meng; Y. Gong; |
591 | Wearing A Mask: Compressed Representations of Variable-Length Sequences Using Recurrent Neural Tangent Kernels Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this gap, we extend existing methods that rely on the use of kernels to variable-length sequences via use of the Recurrent Neural Tangent Kernel (RNTK). |
S. Alemohammad; et al. |
592 | H-GPR: A Hybrid Strategy for Large-Scale Gaussian Process Regression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a hybrid strategy called H-GPR to combine these two well-established approaches. |
N. Li; Y. Gao; W. Li; Y. Jiang; S. -T. Xia; |
593 | Learning Optimal Lattice Codes for MIMO Communications Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel reinforcement learning approach to learning lattice codes for MIMO channels. |
L. Amor�s; M. Pitk�nen; |
594 | A Bayesian Interpretation of The Light Gated Recurrent Unit Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We derive a layerwise recurrence without the assumptions of previous work, and show that it leads to a standard recurrence with modest modifications to reflect use of log-probabilities. |
A. Bittar; P. N. Garner; |
595 | A Large-Dimensional Analysis of Symmetric SNE Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work provides first answers by leveraging a large dimensional statistics approach, where the number n and dimension p of the large-dimensional data are of the same magnitude. |
C. S�journ�; R. Couillet; P. Comon; |
596 | A Dynamical Systems Perspective on Online Bayesian Nonparametric Estimators with Adaptive Hyperparameters Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents and analyzes constant step size stochastic gradient algorithms in reproducing kernel Hilbert Space (RKHS), which encapsulates various adaptive nonlinear interpolation schemes. |
A. Koppel; A. S. Bedi; V. Krishnamurthy; |
597 | Online Multi-Hop Information Based Kernel Learning Over Graphs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A multikernel-based approach is developed, which is capable of leveraging global network information, and scales well with network size as well. |
Z. Zong; Y. Shen; |
598 | Sparsity in Max-Plus Algebra and Applications in Multivariate Convex Regression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study concepts of sparsity in the max-plus algebra and apply them to the problem of multivariate convex regression. |
N. Tsilivis; A. Tsiamis; P. Maragos; |
599 | Complex-Valued Vs. Real-Valued Neural Networks for Classification Perspectives: An Example on Non-Circular Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Motivated by radar and especially Synthetic Aperture Radar (SAR) applications, we propose a statistical analysis of fully connected feed-forward neural networks performance in the cases where real and imaginary parts of the data are correlated through the non-circular property. |
J. A. Barrachina; C. Ren; C. Morisseau; G. Vieillard; J. . -P. Ovarlez; |
600 | High-Frequency Adversarial Defense for Speech and Audio Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We explore a smoothing approach based on additive noise masking in priority high frequencies. |
R. Olivier; B. Raj; M. Shah; |
601 | Learning Separable Time-Frequency Filterbanks for Audio Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we aim to learn audio representations directly from raw audio, and at the same time mitigate its training burden by employing a light-weight architecture. |
J. Pu; Y. Panagakis; M. Pantic; |
602 | Upsampling Artifacts in Neural Audio Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we address this gap by studying this problem from the audio signal processing perspective. |
J. Pons; S. Pascual; G. Cengarle; J. Serr�; |
603 | Deep Convolutional and Recurrent Networks for Polyphonic Instrument Classification from Monophonic Raw Audio Waveforms Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we attempt to recognize musical instruments in polyphonic audio by only feeding their raw waveforms into deep learning models. |
K. Avramidis; A. Kratimenos; C. Garoufis; A. Zlatintsi; P. Maragos; |
604 | Learning Audio Embeddings with User Listening Data for Content-Based Music Recommendation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To combat this problem, we first explore user listening history and demographics to construct a user embedding representing the user’s music preference. |
K. Chen; B. Liang; X. Ma; M. Gu; |
605 | Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a simple yet efficient neural network architecture to exploit both acoustic and lexical information from speech. |
Z. Peng; Y. Lu; S. Pan; Y. Liu; |
606 | Neural Audio Fingerprint for High-Specific Audio Retrieval Based on Contrastive Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we generate a low-dimensional representation from a short unit segment of audio, and couple this fingerprint with a fast maximum inner-product search. |
S. Chang; et al. |
607 | Self-Training and Pre-Training Are Complementary for Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we show that pseudo-labeling and pre-training with wav2vec 2.0 are complementary in a variety of labeled data setups. |
Q. Xu; et al. |
608 | Unsupervised Discriminative Learning of Sounds for Audio Event Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: On several audio event classification benchmarks, we show a fast and effective alternative that pre-trains the model unsupervised, only on audio data and yet delivers on-par performance with ImageNet pre-training. |
S. Hornauer; K. Li; S. X. Yu; S. Ghaffarzadegan; L. Ren; |
609 | Similarity Analysis of Self-Supervised Speech Representations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we aim to provide a comparative study of some of the most representative self-supervised algorithms. |
Y. -A. Chung; Y. Belinkov; J. Glass; |
610 | Joint Masked CPC And CTC Training For ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we demonstrate a single-stage training of ASR models that can utilize both unlabeled and labeled data. |
C. Talnikar; T. Likhomanenko; R. Collobert; G. Synnaeve; |
611 | A Comparison of Discrete Latent Variable Models for Speech Representation Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a comparison of two different approaches which are broadly based on predicting future time-steps or auto-encoding the input signal. |
H. Zhou; A. Baevski; M. Auli; |
612 | Federated Learning from Big Data Over Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper formulates and studies a novel algorithm for federated learning from large collections of local datasets. |
Y. Sarcheshmehpour; M. Leinonen; A. Jung; |
613 | Efficient Client Contribution Evaluation for Horizontal Federated Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper an efficient method is proposed to evaluate the contributions of federated participants. |
J. Zhao; X. Zhu; J. Wang; J. Xiao; |
614 | A Quantitative Metric for Privacy Leakage in Federated Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel method to approximate the mutual information between the high-dimensional gradients and batched input data. |
Y. Liu; X. Zhu; J. Wang; J. Xiao; |
615 | DP-SIGNSGD: When Efficiency Meets Privacy and Robustness Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we bridge this gap by presenting an improved method called DP-SIGNSGD, which can meet all the aforementioned properties. |
L. Lyu; |
616 | Federated Algorithm with Bayesian Approach: Omni-Fedge Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider the problem of Federated Learning (FL) under non-i.i.d data setting. |
S. A. Kesanapalli; B. N. Bharath; |
617 | Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a framework by which the degree of non-IID-ness can be varied, consequently illustrating a trade-off between model quality and the computational cost of federated training, which we capture through a novel metric. |
D. Guliani; F. Beaufays; G. Motta; |
618 | Cross-Silo Federated Training in The Cloud with Diversity Scaling and Semi-Supervised Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel aggregation algorithm that takes update diversity into account and significantly outperforms Federated Averaging (FedAvg). |
K. Nandury; A. Mohan; F. Weber; |
619 | Gradual Federated Learning Using Simulated Annealing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we put forth a new update strategy based on the simulated annealing (SA) algorithm, in which the user devices choose their training parameters between the global evaluation model and their local models probabilistically. |
L. T. Nguyen; B. Shim; |
620 | Optimal Importance Sampling for Federated Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We derive optimal importance sampling strategies for both agent and data selection and show that under convexity and Lipschitz assumptions, non-uniform sampling without replacement improves the performance of the original FedAvg algorithm. |
E. Rizk; S. Vlaski; A. H. Sayed; |
621 | Multi-Tier Federated Learning for Vertically Partitioned Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose Tiered Decentralized Coordinate Descent (TDCD), a communication-efficient decentralized training algorithm for such two-tiered networks. |
A. Das; S. Patterson; |
622 | Energy Minimization for Federated Learning with IRS-Assisted Over-the-Air Computation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper investigates the deployment of federated learning (FL) over an over-the-air computation (AirComp) and intelligent reflecting surface (IRS) based wireless network. |
Y. Hu; et al. |
623 | Adaptive Quantization of Model Updates for Communication-Efficient Federated Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose an adaptive quantization strategy called AdaQuantFL that aims to achieve communication efficiency as well as a low error floor by changing the number of quantization levels during the course of training. |
D. Jhunjhunwala; A. Gadhikar; G. Joshi; Y. C. Eldar; |
624 | HebbNet: A Simplified Hebbian Learning Framework to Do Biologically Plausible Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we introduce a new Hebbian learning based neural network, called HebbNet. |
M. Gupta; A. Ambikapathi; S. Ramasamy; |
625 | T-k-means: A ROBUST AND STABLE K-means VARIANT Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a robust and stable k-means variant, dubbed the t-k-means, as well as its fast version to alleviate those problems. |
Y. Li; Y. Zhang; Q. Tang; W. Huang; Y. Jiang; S. -T. Xia; |
626 | Adaptive Feature Weight Learning For Robust Clustering Problem with Sparse Constraint Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To deal with this situation, we propose a novel weight learning mechanism to adaptively reweight each feature in the data. |
F. Nie; W. Chang; X. Li; J. Xu; G. Li; |
627 | Assisted Learning: Cooperative AI with Autonomy Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a method named ASCII for an agent to improve its classification performance through assistance from other agents, without sharing proprietary data and model information. |
J. Zhou; X. Xian; N. Li; J. Ding; |
628 | Geom-Spider-EM: Faster Variance Reduced Stochastic Expectation Maximization for Nonconvex Finite-Sum Optimization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an extension of the Stochastic Path-Integrated Differential EstimatoR EM (SPIDER-EM) and derive complexity bounds for this novel algorithm, designed to solve smooth nonconvex finite-sum optimization problems. |
G. Fort; E. Moulines; H. -T. Wai; |
629 | Learning A Tree of Neural Nets Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In particular, we show that, rather than improving a ResNet by making it deeper, it is better to construct a tree of small ResNets. |
A. Zharmagambetov; M. �. Carreira-Perpi��n; |
630 | Corrupted Contextual Bandits: Online Learning with Corrupted Context Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to address the corrupted-context setting, we propose to combine the standard contextual bandit approach with a classical multi-armed bandit mechanism. |
D. Bouneffouf; |
631 | Training A Bank of Wiener Models with A Novel Quadratic Mutual Information Cost Function Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a novel training methodology to adapt parameters of a bank of Wiener models (BWMs), i.e., a bank of linear filters followed by a static memoryless nonlinearity, using full pdf information of the projected outputs and the desired signal. |
B. Hu; J. C. Principe; |
632 | Information and Regularization in Restricted Boltzmann Machines Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we study mutual information in Restricted Boltzmann Machines (RBM) and its relationship with the different regularization techniques. |
M. Vera; L. R. Vega; P. Piantanida; |
633 | Deep Deterministic Information Bottleneck with Matrix-Based Entropy Functional Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce the matrix-based R?nyi?s a-order entropy functional to parameterize Tishby et al. information bottleneck (IB) principle [1] with a neural network. |
X. Yu; S. Yu; J. C. Pr�ncipe; |
634 | Transitive Transfer Sparse Coding for Distant Domain Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel framework called transitive transfer sparse coding (TTSC) to solve the two distant domains transfer learning problem. |
L. Feng; F. Qian; X. He; Y. Fan; H. Cai; G. Hu; |
635 | Fast Local Representation Learning with Adaptive Anchor Graph Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to solve this issue, we propose a fast local representation learning with adaptive anchor graph to learn local structure information through similarity matrix in anchor-based graph. |
C. Zhang; F. Nie; Z. Wang; R. Wang; X. Li; |
636 | Learning On Heterogeneous Graphs Using High-Order Relations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an approach for learning on heterogeneous graphs without using meta-paths. |
S. H. Lee; F. Ji; W. P. Tay; |
637 | Incomplete Multi-View Subspace Clustering with Low-Rank Tensor Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these issues, we propose a novel Incomplete Multi-view Subspace Clustering with Low-rank Tensor (IMSCLT) method, which could be the first tensor-based incomplete multi-view clustering method to the best of our knowledge. |
J. Liu; S. Teng; W. Zhang; X. Fang; L. Fei; Z. Zhang; |
638 | Channel-Wise Mix-Fusion Deep Neural Networks for Zero-Shot Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a channel-wise mix-fusion ZSL model (CMFZ) to contextualize the ZSL classifier’s discriminative information by incorporating much richer visual semantic information from both objects and their semantic surrounding environments. |
G. Wang; N. Guau; H. Ye; X. Yi; H. Cheng; J. Zhu; |
639 | Online Unsupervised Learning Using Ensemble Gaussian Processes with Random Features Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work develops an efficient online approach based on random features by replacing spatial with spectral subsampling. |
G. V. Karanikolas; Q. Lu; G. B. Giannakis; |
640 | Dimension Selected Subspace Clustering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a new dimension sketching algorithm is proposed, which aims to select informative dimensions that have significant effects on the clustering results. |
S. Li; Y. Luo; J. Chambers; W. Wang; |
641 | Deep Ensemble Siamese Network For Incremental Signal Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a new Deep Ensemble Siamese Network (DESN) is constructed for unknown category detection and incremental accumulation of signals from the detected category. |
C. Yang; S. Yang; |
642 | Non-Recursive Graph Convolutional Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, in this paper, we propose a novel architecture named Non-Recursive Graph Convolutional Network (NRGCN) to improve both the training efficiency and the learning performance of GCNs in the context of node classification. |
H. Chen; Z. Deng; Y. Xu; Z. Li; |
643 | Ego-Based Entropy Measures for Structural Representations on Graphs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose VNEstruct, a simple approach, based on entropy measures of the neighborhood?s topology, for generating low-dimensional structural representations, that is time- efficient and robust to graph perturbations. |
G. Dasoulas; G. Nikolentzos; K. Seaman; A. Virmaux; M. Vazirgiannis; |
644 | Symmetric Sub-graph Spatio-Temporal Graph Convolution and Its Application in Complex Activity Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we analyze hand skeleton-based complex activities by modeling dynamic hand skeletons through a spatiotemporal graph convolutional neural network (ST-GCN). |
P. Das; A. Ortega; |
645 | Progressive Spatio-Temporal Graph Convolutional Network for Skeleton-Based Human Action Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a method to automatically find a compact and problem-specific topology for spatio-temporal graph convolutional networks in a progressive manner. |
N. Heidari; A. Iosifidis; |
646 | Sparse-Coded Dynamic Mode Decomposition on Graph for Prediction of River Water Level Distribution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work proposes a method for estimating dynamics on graph by using dynamic mode decomposition (DMD) and sparse approximation with graph filter banks (GFBs). |
Y. ARAI; S. MURAMATSU; H. YASUDA; K. HAYASAKA; Y. OTAKE; |
647 | Graph Frequency Analysis of COVID-19 Incidence to Identify County-Level Contagion Patterns in The United States Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We thus conduct a graph frequency analysis to inves- tigate the spread patterns of COVID-19 in different US counties. |
Y. Li; G. Mateos; |
648 | Generalized Polytopic Matrix Factorization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This article aims to characterize all eligible polytopic sets that are suitable for the PMF framework. |
G. Tatli; A. T. Erdogan; |
649 | Exact Linear Convergence Rate Analysis for Low-Rank Symmetric Matrix Completion Via Gradient Descent Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper performs a local analysis of the exact linear convergence rate of gradient descent for factorization-based symmetric matrix completion. |
T. Vu; R. Raich; |
650 | Structured Support Exploration for Multilayer Sparse Matrix Factorization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we address two problems with the application of proximal algorithms to sparse matrix factorization. |
Q. -T. Le; R. Gribonval; |
651 | Optimal Selection of Matrix Shape and Decomposition Scheme for Neural Network Compression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We formulate this problem as a mixed-integer optimization over the weights, ranks, and decompositions schemes; and we provide an efficient alternating optimization algorithm involving two simple steps: a step over the weights of the neural network (solved by SGD), and a step over the ranks and decomposition schemes (solved by an SVD). |
Y. Idelbayev; M. �. Carreira-Perpi��n; |
652 | Sparse Graph Based Sketching for Fast Numerical Linear Algebra Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study sketching matrices that are obtained from bipartite graphs that are sparse, i.e., have left degree s that is small. |
D. Hu; S. Ubaru; A. Gittens; K. L. Clarkson; L. Horesh; V. Kalantzis; |
653 | Cold Start Revisited: A Deep Hybrid Recommender with Cold-Warm Item Harmonization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we show that a straightforward application of parametric models may lead to discrepancies between the cold and warm items? distributions in the CF space. |
O. Barkan; R. Hirsch; O. Katz; A. Caciularu; Y. Weill; N. Koenigstein; |
654 | On A Guided Nonnegative Matrix Factorization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: For this reason, we propose an approach based upon the nonnegative matrix factorization (NMF) model, deemed Guided NMF, that incorporates user-designed seed word supervision. |
J. Vendrow; J. Haddock; E. Rebrova; D. Needell; |
655 | Nonnegative Unimodal Matrix Factorization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a new Nonnegative Matrix Factorization (NMF) model called Nonnegative Unimodal Matrix Factorization (NuMF), which adds on top of NMF the unimodal condition on the columns of the basis matrix. |
A. M. S. Ang; N. Gillis; A. Vandaele; H. D. Sterck; |
656 | Kernel Orthogonal Nonnegative Matrix Factorization: Application to Multispectral Document Image Decomposition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a new kernel orthogonal NMF method that does not suffer from the pre-image issue. |
A. Rahiche; M. Cheriet; |
657 | Random Projection Streams for (Weighted) Nonnegative Matrix Factorization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we thus investigate an alternative framework to structured random projections?named random projection streams (RPS)?which (i) are based on classical random compression strategies only?and are thus data-independent?and (ii) can benefit from the above fast techniques. |
F. Yahaya; M. Puigt; G. Delmaire; G. Roussel; |
658 | Multivariate Non-Negative Matrix Factorization with Application to Energy Disaggregation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In the proposed architecture the baseline Non-negative Matrix Factorization approach is expanded utilizing multi-variate signals. |
P. A. Schirmer; I. Mporas; |
659 | Continuous-Time Self-Attention in Neural Differential Equation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study presents a new continuous-time attention to improve sequential learning where the region of interest in continuous-time state trajectory over observed as well as missing samples is sufficiently attended. |
J. -T. Chien; Y. -H. Chen; |
660 | Blind Deinterleaving of Signals in Time Series with Self-Attention Based Soft Min-Cost Flow Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an end-to-end learning approach to address deinterleaving of patterns in time series, in particular, radar signals. |
O. Can; Y. Z. G�rb�z; B. Yildirim; A. A. Alatan; |
661 | Attention on Attention Sparse Dense Convolutional Network for Financial Signal Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: For a better solution, we propose a novel Attention on Attention Sparse Dense Convolutional Network (AoA-SDCN), which strengthens time decay characteristics by adding dense connections at close points. |
T. Zhu; J. Li; X. Liu; Y. Jiang; S. -T. Xia; |
662 | Stock Movement Prediction and Portfolio Management Via Multimodal Learning with Transformer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces a novel high performing multimodal deep learning architecture(Trans-DiCE) for stock movement prediction utilizing financial indicators and news data. |
D. Daiya; C. Lin; |
663 | A Quaternion-Valued Variational Autoencoder Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel VAE defined in the quaternion domain, which exploits the properties of quaternion algebra to improve performance while significantly reducing the number of parameters required by the network. |
E. Grassucci; D. Comminiello; A. Uncini; |
664 | Learning A Sparse Generative Non-Parametric Supervised Autoencoder Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper concerns the supervised generative non parametric autoencoder. |
M. Barlaud; F. Guyard; |
665 | DAG-GAN: Causal Structure Learning with Generative Adversarial Nets Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider DAG structure learning from the perspective of distributional optimization and design an adversarial framework named DAG-GAN to detect the DAG structure from data. |
Y. Gao; L. Shen; S. -T. Xia; |
666 | Relaxed Wasserstein with Applications to GANs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a new class of Relaxed Wasserstein (RW) distances by generalizing Wasserstein-1 distance with Bregman cost functions. |
X. Guo; J. Hong; T. Lin; N. Yang; |
667 | Environment-Independent Wi-Fi Human Activity Recognition with Adversarial Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we in this paper present WiHARAN, a Wi-Fi-based activity recognition system that can learn environment-independent features from Channel State Information (CSI) traces. |
Z. Wang; S. Chen; W. Yang; Y. Xu; |
668 | A Robust to Noise Adversarial Recurrent Model for Non-Intrusive Load Monitoring Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose EnerGAN++, an adversarially trained model for robust energy disaggregation. |
M. Kaselimi; A. Voulodimos; N. Doulamis; A. Doulamis; E. Protopapadakis; |
669 | Enhancing Data-Free Adversarial Distillation with Activation Regularization and Virtual Interpolation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We add an activation regularizer and a virtual interpolation method to improve the data generation efficiency. |
X. Qu; J. Wang; J. Xiao; |
670 | Sequential Adversarial Anomaly Detection with Deep Fourier Kernel Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a novel adversarial detector for the anomalous sequence when there are only one-class training samples. |
S. Zhu; H. S. Yuchi; M. Zhang; Y. Xie; |
671 | Incorporate Maximum Mean Discrepancy in Recurrent Latent Space for Sequential Generative Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we incorporate maximum mean discrepancy in the recurrent structure for distribution regularization. |
Y. Zhang; Y. Wang; Y. Dong; |
672 | FMA-ETA: Estimating Travel Time Entirely Based on FFN with Attention Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To solve this problem, we propose a novel, brief and effective framework mainly based on feed-forward network (FFN) for ETA, FFN with Multifactor Attention (FMA-ETA). |
Y. Sun; et al. |
673 | A Unified Approach to Translate Classical Bandit Algorithms to Structured Bandits Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel approach to gradually estimate the hidden ? * and use the estimate together with the mean reward functions to substantially reduce exploration of sub-optimal arms. |
S. Gupta; S. Chaudhari; S. Mukherjee; G. Joshi; O. Yagan; |
674 | Near-Optimal Algorithms for Piecewise-Stationary Cascading Bandits Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Considering piecewise-stationary environments, two efficient algorithms, GLRT-CascadeUCB and GLRT-CascadeKL-UCB, are developed. |
L. Wang; H. Zhou; B. Li; L. R. Varshney; Z. Zhao; |
675 | Optimum Feature Ordering for Dynamic Instance�Wise Joint Feature Selection and Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on these properties, we propose a fast algorithm and demonstrate its effectiveness compared to the state?of?the?art using 4 publicly available datasets. |
Y. W. Liyanage; D. Zois; |
676 | POLA: Online Time Series Prediction By Adaptive Learning Rates Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose POLA (Predicting Online by Learning rate Adaptation) to automatically regulate the learning rate of recurrent neural network models to adapt to changing time series patterns across time. |
W. Zhang; |
677 | Singer Identification Using Deep Timbre Feature Learning with KNN-NET Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study the issue of automatic singer identification (SID) in popular music recordings, which aims to recognize who sang a given piece of song. |
X. Zhang; J. Qian; Y. Yu; Y. Sun; W. Li; |
678 | Implicit HRTF Modeling Using Temporal Convolutional Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present a data-driven approach to learn HRTFs implicitly with a neural network that achieves state of the art results compared to traditional approaches but relies on a much simpler data capture that can be performed in arbitrary, non-anechoic rooms. |
I. D. Gebru; et al. |
679 | Improving The Classification of Rare Chords With Unlabeled Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we explore techniques to improve performance for rare classes in the task of Automatic Chord Recognition (ACR). |
M. Bortolozzo; R. Schramm; C. R. Jung; |
680 | Loopnet: Musical Loop Synthesis Conditioned on Intuitive Musical Parameters Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Taking such criteria into account, we present LoopNet, a feed-forward generative model for creating loops conditioned on intuitive parameters. |
P. Chandna; A. Ramires; X. Serra; E. G�mez; |
681 | Micaugment: One-Shot Microphone Style Transfer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a method to perform one-shot microphone style transfer. |
Z. Borsos; Y. Li; B. Gfeller; M. Tagliasacchi; |
682 | Wasserstein Barycenter Transport for Acoustic Adaptation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a multi-source domain adaptation algorithm called Wasserstein Barycenter Transport, which transports the source domains to a target domain by creating an intermediate domain using the Wasserstein barycenter. |
E. F. Montesuma; F. -M. Ngol� Mboula; |
683 | Efficient Adversarial Audio Synthesis VIA Progressive Upsampling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel generative model called PUGAN, which progressively synthesizes high-quality audio in a raw waveform. |
Y. Cho; M. Chang; S. Lee; H. Lee; G. J. Kim; J. Choo; |
684 | Multi-Channel Speech Enhancement Using Graph Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a different research direction by viewing each audio channel as a node lying in a non-Euclidean space and, specifically, a graph. |
P. Tzirakis; A. Kumar; J. Donley; |
685 | Multi-Decoder Dprnn: Source Separation for Variable Number of Speakers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an end-to-end trainable approach to single-channel speech separation with unknown number of speakers. |
J. Zhu; R. A. Yeh; M. Hasegawa-Johnson; |
686 | Data-Efficient Framework for Real-World Multiple Sound Source 2d Localization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose to use adversarial learning methods to close the gap between synthetic and real do-mains. |
G. Le Moing; et al. |
687 | Fusing Information Streams in End-to-End Audio-Visual Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new fusion strategy, incorporating reliability information in a decision fusion net that considers the temporal effects of the attention mechanism. |
W. Yu; S. Zeiler; D. Kolossa; |
688 | Cooperative Scenarios for Multi-Agent Reinforcement Learning in Wireless Edge Caching Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate multi-agent reinforcement learning (MARL), and identify four scenarios for cooperation. |
N. Garg; T. Ratnarajah; |
689 | Robust Deep Reinforcement Learning for Underwater Navigation with Unknown Disturbances Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a robust Proximal Policy Optimization agent and train it using simulations of an underwater medium: this agent shows an excellent performance when facing unknown disturbances, being able to approach the performance of the optimal agent which had an exact knowledge of the underwater disturbance. |
J. Parras; S. Zazo; |
690 | Online Hyper-Parameter Tuning for The Contextual Bandit Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We have presented here two algorithms that uses a bandit to find the optimal exploration of the contextual bandit algorithm, which we hope is the first step toward the automation of the multi-armed bandit algorithm. |
D. Bouneffouf; E. Claeys; |
691 | Double-Linear Thompson Sampling for Context-Attentive Bandits Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we analyze and extend an online learning frame-work known as Context-Attentive Bandit, motivated by various practical applications, from medical diagnosis to dialog systems, where due to observation costs only a small subset of a potentially large number of context variables can be observed at each iteration; however, the agent has a freedom to choose which variables to observe. |
D. Bouneffouf; R. Feraud; S. Upadhyay; Y. Khazaeni; I. Rish; |
692 | On The Marginal Benefit of Active Learning: Does Self-Supervision Eat Its Cake? Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this aim, this paper provides a novel algorithmic framework integrating self-supervised pretraining, active learning, and consistency-regularized self-training. |
Y. -C. Chan; M. Li; S. Oymak; |
693 | Robust Maml: Prioritization Task Buffer with Adaptive Learning Process for Model-Agnostic Meta-Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a more robust MAML based on an adaptive learning scheme and a prioritization task buffer (PTB) referred to as Robust MAML (RMAML) for improving scalability of training process and alleviating the problem of distribution mismatch. |
T. Nguyen; T. Luu; T. Pham; S. Rakhimkul; C. D. Yoo; |
694 | Introducing Deep Reinforcement Learning to Nlu Ranking Tasks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we address these issues with a deep reinforcement learning approach that ranks suggestions from multiple experts in an online fashion. |
G. Yu; E. Barut; C. Su; |
695 | Temporal Link Prediction Via Reinforcement Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To deal with these challenges, we present a novel reinforcement learning approach with an update mechanism to integrate temporal information. |
Y. Tao; Y. Li; Z. Wu; |
696 | A Deep Reinforcement Learning Approach To Audio-Based Navigation In A Multi-Speaker Environment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we use deep reinforcement learning to create an autonomous agent that can navigate in a two-dimensional space using only raw auditory sensory information from the environment, a problem that has received very little attention in the reinforcement learning literature. |
P. Giannakopoulos; A. Pikrakis; Y. Cotronis; |
697 | Global-Localized Agent Graph Convolution for Multi-Agent Reinforcement Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we model the global and localized cooperation among agents by global and localized agent graphs and propose a novel graph convolutional reinforcement learning mechanism based on these two graphs which allows each agent to communicate with neighbors and all a-gents to cooperate at the high level. |
Y. Liu; Y. Dou; S. Shen; P. Qiao; |
698 | Gaussian Process Temporal-Difference Learning with Scalability and Worst-Case Performance Guarantees Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The present paper revisits policy evaluation via temporal-difference (TD) learning from the Gaussian process (GP) perspective. |
Q. Lu; G. B. Giannakis; |
699 | Self-Inference Of Others� Policies For Homogeneous Agents In Cooperative Multi-Agent Reinforcement Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we propose to let each agent infer the others? policies with its own model, given that the agents are homogeneous. |
Q. Lin; Q. Ling; |
700 | Semi-Supervised Batch Active Learning Via Bilevel Optimization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a novel batch acquisition strategy for active learning in the setting where the model training is performed in a semi-supervised manner. |
Z. Borsos; M. Tagliasacchi; A. Krause; |
701 | Kernearl-Based Lifelong Policy Gradient Reinforcement Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a lifelong RL algorithm based on the kernel method to leverage nonlinear features of the data based on a popular union-of-subspace model. |
R. Mowakeaa; S. -J. Kim; D. K. Emge; |
702 | Policy Augmentation: An Exploration Strategy For Faster Convergence of Deep Reinforcement Learning Algorithms Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a revolutionary algorithm, called Policy Augmentation, is introduced. |
A. Mahyari; |
703 | Graphcomm: A Graph Neural Network Based Method for Multi-Agent Reinforcement Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose GraphComm, a method makes use of the relation-ships among agents for MARL communication. |
S. Shen; et al. |
704 | In Situ Calibration of Cross-Sensitive Sensors in Mobile Sensor Arrays Using Fast Informed Non-Negative Matrix Factorization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we assume a set of mobile geolocalized sensor arrays observing an area over time. |
O. V. thanh; M. Puigt; F. Yahaya; G. Delmaire; G. Roussel; |
705 | Multiphish: Multi-Modal Features Fusion Networks for Phishing Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a features fusion networks (MultiPhish) which is the first study on fusing multi-modal features with neural networks for the phishing detection task. |
L. Zhang; P. Zhang; L. Liu; J. Tan; |
706 | Failure Prediction By Confidence Estimation of Uncertainty-Aware Dirichlet Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, it is first shown that uncertainty-aware deep Dirichlet neural networks provide an improved separation between the confidence of correct and incorrect predictions in the true class probability (TCP) metric. |
T. Tsiligkaridis; |
707 | Two-Stage Framework for Seasonal Time Series Forecasting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a two-stage framework to forecast univariate seasonal time series. |
Q. Xu; Q. Wen; L. Sun; |
708 | Recursive Input and State Estimation: A General Framework for Learning from Time Series With Missing Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a single unifying framework, Recursive Input and State Estimation (RISE), for this general approach and reformulate existing models as specific instances of this framework. |
A. Garc�a-Dur�n; R. West; |
709 | On The Performance-Complexity Tradeoff in Stochastic Greedy Weak Submodular Optimization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we study the tradeoff between performance and complexity when one resorts to random sampling strategies to reduce the query complexity of GREEDY. |
A. Hashemi; H. Vikalo; G. de Veciana; |
710 | Semi-Supervised Time Series Classification By Temporal Relation Prediction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a simple and effective method of Semi-supervised Time series classification architecture (termed as SemiTime) by gaining from the structure of unlabeled data in a self-supervised manner. |
H. Fan; F. Zhang; R. Wang; X. Huang; Z. Li; |
711 | Continuous Cnn For Nonuniform Time Series Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose the Continuous CNN (CCNN), which estimates the inherent continuous inputs by interpolation, and performs continuous convolution on the continuous input. |
H. Shi; et al. |
712 | Blend-Res2net: Blended Representation Space By Transformation of Residual Mapping with Restrained Learning for Time Series Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose Blend-Res2Net that blends two different representation spaces: H1(x) = F(x) + Trans(x) and H2(x) = F(Trans(x)) + x with the intention of learning over richer representation by capturing the temporal as well as the spectral signatures (Trans(?) represents the transformation function). |
A. Ukil; A. J. Jara; L. Marin; |
713 | Spatiotemporal Attention for Multivariate Time Series Prediction and Interpretation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel deep learning architecture, called spatiotemporal attention mechanism (STAM) for simultaneous learning of the most important time steps and variables. |
T. Gangopadhyay; S. Y. Tan; Z. Jiang; R. Meng; S. Sarkar; |
714 | Tabular Transformers for Modeling Multivariate Time Series Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here we propose neural network models that represent tabular time series that can optionally leverage their hierarchical structure. |
I. Padhi; et al. |
715 | Real-Time Synchronization in Neural Networks for Multivariate Time Series Anomaly Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a mathematical formulation of neural network layers, which generate a synchronized representation from asynchronous multivariate input. |
A. Abdulaal; T. Lancewicki; |
716 | Fast Graph Kernel with Optical Random Features Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to leverage kernel random features within the graphlet framework, and establish a theoretical link with a mean kernel metric. |
H. Ghanem; N. Keriven; N. Tremblay; |
717 | Fast Hierarchy Preserving Graph Embedding Via Subspace Constraints Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an inductive method, FastHGE, to learn node representations more efficiently and generalize to new nodes more easily. |
X. Chen; L. Du; M. Chen; Y. Wang; Q. Long; K. Xie; |
718 | Graph Embedding Using Multi-Layer Adjacent Point Merging Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose a novel graph embedding method using a multi-layer adjacent point merging model. |
J. Huang; H. Kasai; |
719 | Node Attribute Completion in Knowledge Graphs with Multi-Relational Propagation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Our approach, denoted as MRAP, imputes the values of missing attributes by propagating information across the multi-relational structure of a knowledge graph. |
E. Bayram; A. Garc�a-Dur�n; R. West; |
720 | UserReg: A Simple But Strong Model for Rating Prediction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a simple linear model based on Matrix Factorization (MF), called UserReg, which regularizes users’ latent representations with explicit feedback information for rating prediction. |
H. Zhang; I. Ganchev; N. S. Nikolov; M. Stevenson; |
721 | Toward Skills Dialog Orchestration with Online Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we study the task of online posterior dialog orchestration, where we define posterior orchestration as the task of selecting a subset of skills which most appropriately answer a user input using features extracted from both the user input and the individual skills. |
D. Bouneffouf; R. Feraud; S. Upadhyay; M. Agarwal; Y. Khazaeni; I. Rish; |
722 | Adaptive Re-Balancing Network with Gate Mechanism for Long-Tailed Visual Question Answering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, we propose a unified Adaptive Re-balancing Network (ARN) to take care of classification in both head and tail classes, exhaustively improving performance for VQA. |
H. Chen; R. Liu; H. Fang; X. Zhang; |
723 | Co-Capsule Networks Based Knowledge Transfer for Cross-Domain Recommendation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a CDR method with co-capsule networks based knowledge transfer to implement the recommendation for the cold-start users. |
H. Li; L. Yu; Y. Leng; Q. Du; |
724 | Modurec: Recommender Systems with Feature and Time Modulation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these problems, we propose Modurec: an autoencoder-based method that combines all available information using the feature-wise modulation mechanism, which has demonstrated its effectiveness in several fields. |
J. Maroto; C. Vignac; P. Frossard; |
725 | Sig2Sig: Signal Translation Networks to Take The Remains of The Past Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a signal translation networks, Sig2Sig, that converts from the new sensor signals to old ones in order to reuse the past model, which was trained on plenty of old sensor signals. |
S. Kim; H. Lee; J. Han; J. -H. Kim; |
726 | Solving A Class of Non-Convex Min-Max Games Using Adaptive Momentum Methods Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an adaptive momentum min-max algorithm that generalizes adaptive momentum methods to the non-convex min-max regime. |
B. Barazandeh; D. A. Tarzanagh; G. Michailidis; |
727 | Minimizing Weighted Concave Impurity Partition Under Constraints Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we formulate and solve a variant of the partition problem called the minimum weighted concave impurity partition under constraint (MIPUC). |
T. Nguyen; T. Nguyen; |
728 | Constant Approximation Algorithm for Minimizing Concave Impurity Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a linear time algorithm with bounded guarantee based on the maximum likelihood principle. |
T. Nguyen; H. Le; T. Nguyen; |
729 | Fusing Multitask Models By Recursive Least Squares Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a transformation-based multi-task fusion method, called transformation fusion(TF), which is implemented by recursive least squares. |
X. Li; L. Shan; W. Wang; |
730 | Centrality Based Number of Cluster Estimation in Graph Clustering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we propose a new algorithm for estimating the number of clusters in a graph using the centrality measure. |
M. Shamsi; S. Beheshti; |
731 | Dependence-Guided Multi-View Clustering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel approach called dependence-guided multi-view clustering (DGMC). |
X. Dong; D. Wu; F. Nie; R. Wang; X. Li; |
732 | Improved Step-Size Schedules for Noisy Gradient Methods Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper shows that the diminishing step-size strategies can indeed be applied for a broad class of noisy gradient methods. |
S. Khirirat; X. Wang; S. Magn�sson; M. Johansson; |
733 | Respipe: Resilient Model-Distributed DNN Training at Edge Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we design ResPipe, a novel resilient model-distributed DNN training mechanism against delayed/failed workers. |
P. Li; E. Koyuncu; H. Seferoglu; |
734 | An Optimal Stochastic Compositional Optimization Method with Applications to Meta Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a new Stochastically Corrected Stochastic Compositional gradient method (SCSC). |
Y. Sun; T. Chen; W. Yin; |
735 | Decentralized Optimization on Time-Varying Directed Graphs Under Communication Constraints Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a communication-efficient algorithm for decentralized convex optimization that rely on sparsification of local updates exchanged between neighboring agents in the network. |
Y. Chen; A. Hashemi; H. Vikalo; |
736 | Decentralized Deep Learning Using Momentum-Accelerated Consensus Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this context, we propose and analyze a novel decentralized deep learning algorithm where the agents interact over a fixed communication topology (without a central server). |
A. Balu; Z. Jiang; S. Y. Tan; C. Hedge; Y. M. Lee; S. Sarkar; |
737 | Demystifying Model Averaging for Communication-Efficient Federated Matrix Factorization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate the federated MF problem and propose a new MA based algorithm, named FedMAvg, by judiciously combining the alternating minimization technique and MA. |
S. Wang; R. C. Suwandi; T. -H. Chang; |
738 | Sample Efficient Subspace-Based Representations for Nonlinear Meta-Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work explores a more general class of nonlinear tasks with applications ranging from binary classification, generalized linear models and neural nets. |
H. I. Gulluk; Y. Sun; S. Oymak; M. Fazel; |
739 | Multi-Task Learning Via Sharing Inexact Low-Rank Subspace Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study the relations among multiple tasks by properly learning their shared common subspace. |
X. Wang; F. Nie; |
740 | On The Adversarial Robustness of Principal Component Analysis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate the adversarial robustness of principal component analysis (PCA) algorithms. |
Y. Li; F. Li; L. Lai; J. Wu; |
741 | Fast Manifold Landmarking Using Extreme Eigen-Pairs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we select landmarks to minimize the condition number (? |
F. Wang; G. Cheung; Y. Wang; W. -T. Tan; |
742 | Affine Projection Subspace Tracking Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider the problem of estimating and tracking an R-dimensional subspace with relevant information embedded in an N-dimensional ambient space, given that N>>R. |
M. Vil�; C. A. L�pez; J. Riba; |
743 | A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a hierarchical subspace model for acoustic unit discovery. |
B. Yusuf; L. Ondel; L. Burget; J. Cernock�; M. Sara�lar; |
744 | Independent Vector Analysis Using Semi-Parametric Density Estimation Via Multivariate Entropy Maximization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a new flexible and efficient multivariate PDF estimation technique based on the maximum entropy principle and apply this technique to the development of an effective IVA algorithm that successfully matches multivariate latent sources from a wide range of distributions. |
L. P. Damasceno; C. C. Cavalcante; T. Adali; Z. Boukouvalas; |
745 | ICA with Orthogonality Constraint: Identifiability And A New Efficient Algorithm Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we derive the identifiability conditions, starting from the orthogonal ICA maximum likelihood cost function. |
B. Gabrielson; M. A. B. S. Akhonda; Z. Boukouvalas; S. -J. Kim; T. Adali; |
746 | Blind Extraction of Moving Sources Via Independent Component and Vector Analysis: Examples Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper is devoted to the recently proposed mixing model with constant separating vector (CSV) for Blind Source Extraction of moving sources using the FastDIVA algorithm, which is an extension of the famous FastICA and FastIVA for static mixtures. |
N. Amor; J. Cmejla; V. Kautsk�; Z. Koldovsk�; T. Kounovsk�; |
747 | Single Channel Voice Separation for Unknown Number of Speakers Under Reverberant and Noisy Settings Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a unified network for voice separation of an unknown number of speakers. |
S. E. Chazan; L. Wolf; E. Nachmani; Y. Adi; |
748 | Unsupervised Musical Timbre Transfer for Notification Sounds Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a method to transform artificial notification sounds into various musical timbres. |
J. Yang; T. Cinquin; G. S�r�s; |
749 | Visual Privacy Protection Via Mapping Distortion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose the mapping distortion based protection (MDP) and its augmentation-based extension (AugMDP) to protect the data privacy by modifying the original dataset. |
Y. Li; P. Liu; Y. Jiang; S. -T. Xia; |
750 | L-Red: Efficient Post-Training Detection of Imperceptible Backdoor Attacks Without Access to The Training Set Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a Lagrangian-based RED (L-RED) that does not require knowledge of the number of source classes (or whether an attack is present). |
Z. Xiang; D. J. Miller; G. Kesidis; |
751 | Multi-View Contrastive Learning for Online Knowledge Distillation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We there-fore propose Multi-view Contrastive Learning (MCL) for OKD to implicitly capture correlations of feature embeddings encoded by multiple peer networks, which provide various views for understanding the input data instances. |
C. Yang; Z. An; Y. Xu; |
752 | Dynamic Texture Recognition Via Nuclear Distances on Kernelized Scattering Histogram Spaces Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on the conjecture that the most distinctive characteristic of a dynamic texture is the appearance of its individual frames, this work proposes to describe dynamic textures as kernelized spaces of frame-wise feature vectors computed using the Scattering transform. |
A. Sagel; J. W�rmann; H. Shen; |
753 | Clustering A Collection of Networks With Mixtures of L1-Sparse Graphical Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here we consider a mixture model of sparse Gaussian graphical models and develop an exact EM algorithm that improves considerably on a previous approximation. |
Z. Yue; V. Solo; |
754 | Taking A Closer Look at Synthesis: Fine-Grained Attribute Analysis for Person Re-Identification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To facilitate development in this field, we reviewed the previously developed synthetic dataset GPR and built an improved one (GPR+) with larger number of identities and distinguished attributes. |
S. Xiang; Y. Fu; G. You; T. Liu; |
755 | Unified Clustering and Outlier Detection on Specialized Hardware Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present a novel QUBO formulation of the unified clustering and outlier detection problem and use the Fujitsu Digital Annealer, a specialized CMOS hardware, to solve it. |
E. Cohen; H. Ushijima-Mwesigwa; A. Mandal; A. Roy; |
756 | Class-Imbalanced Classifiers Using Ensembles of Gaussian Processes And Gaussian Process Latent Variable Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, binary classifiers based on Gaussian processes are chosen as bases for inferring the predictive distributions of test latent variables. |
L. Yang; C. Heiselman; J. G. Quirk; P. M. Djuric; |
757 | Improving Deep Learning Sound Events Classifiers Using Gram Matrix Feature-Wise Correlations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a new Sound Event Classification (SEC) method which is inspired in recent works for out-of-distribution detection. |
A. J. Neto; A. G. C. Pacheco; D. C. Luvizon; |
758 | Adversarially Robust Classification Based on GLRT Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore, in the setting of classical composite hypothesis testing, a defense strategy based on the generalized likelihood ratio test (GLRT), which jointly estimates the class of interest and the adversarial perturbation. |
B. Puranik; U. Madhow; R. Pedarsani; |
759 | Cross-Corpus Speech Emotion Recognition Using Joint Distribution Adaptive Regression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on the research of cross-corpus speech emotion recognition (SER), in which the training and testing speech signals in cross-corpus SER belong to dierent speech corpus. |
J. Zhang; L. Jiang; Y. Zong; W. Zheng; L. Zhao; |
760 | Meta-Cognition-Based Simple And Effective Approach To Object Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore a meta-cognitive learning strategy for object detection to improve generalization ability while at the same time maintaining detection speed. |
S. P. Kumar; C. Gautam; S. Sundaram; |
761 | Graphnet: Graph Clustering with Deep Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel deep graph clustering framework to tackle these two issues. |
X. Zhang; J. Mu; H. Liu; X. Zhang; |
762 | Attention Enhanced Spatial Temporal Neural Network For HRRP Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a novel Attention Enhanced Convolutional Gated Recurrent Unit network (AC-GRU) for HRRP recognition which improves the representation of the spatial and temporal co-occurrence in the HRRP sequences. |
Y. Chu; Z. Guo; |
763 | DHCN: Deep Hierarchical Context Networks For Image Annotation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce in this paper DHCN: a novel Deep Hierarchical Context Network that leverages different sources of contexts including geometric and semantic relationships. |
M. Jiu; H. Sahbi; |
764 | Online Classification of Dynamic Multilayer-Network Time Series in Riemannian Manifolds Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work exploits Riemannian manifolds to introduce a geometric framework for online state and community classification in dynamic multilayer networks where nodes are annotated with time series. |
C. Ye; K. Slavakis; J. Nakuci; S. F. Muldoon; J. Medaglia; |
765 | On The Power of Deep But Naive Partial Label Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we challenge this view by revealing the hidden power of the oldest and naivest PLL method when it is instantiated with deep neural networks. |
J. Seo; J. S. Huh; |
766 | Advances in Morphological Neural Networks: Training, Pruning and Enforcing Shape Constraints Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study an emerging class of neural networks, the Morphological Neural networks, from some modern perspectives. |
N. Dimitriadis; P. Maragos; |
767 | Adversarial Learning Via Probabilistic Proximity Analysis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a model-agnostic defense approach wherein the true class label of the falsified instance is inferred by analyzing its proximity to each class as measured based on class-conditional data distributions. |
J. Hollis; J. Kim; R. Raich; |
768 | Class Aware Robust Training Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To improve robust accuracy of the important class(es), we are the first to propose a novel adversarial training method with class imbalance taken into account. |
Z. Xia; B. Chen; T. Dai; S. -T. Xia; |
769 | Non-Singular Adversarial Robustness of Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we formalize the notion of non-singular adversarial robustness for neural networks through the lens of joint perturbations to data inputs as well as model weights. |
Y. -L. Tsai; C. -Y. Hsu; C. -M. Yu; P. -Y. Chen; |
770 | Towards Adversarial Robustness Via Compact Feature Representations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we explore hypothesis that reducing the size of the model?s feature representation while maintaining its generalizability would discard spurious features while retaining perceptually relevant ones. |
M. A. Shah; R. Olivier; B. Raj; |
771 | Adversarial Examples Detection Beyond Image Space Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To detect both few-perturbation attacks and large-perturbation attacks, we propose a method beyond image space by a two-stream architecture, in which the image stream focuses on the pixel artifacts and the gradient stream copes with the confidence artifacts. |
K. Chen; et al. |
772 | Strong Data Augmentation Sanitizes Poisoning and Backdoor Attacks Without An Accuracy Tradeoff Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: However, we find that strong data augmentations, such as mixup and CutMix, can significantly diminish the threat of poisoning and backdoor attacks without trading off performance. |
E. Borgnia; et al. |
773 | Contrastive Predictive Coding Supported Factorized Variational Autoencoder For Unsupervised Learning Of Disentangled Speech Representations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we address disentanglement of style and content in speech signals. |
J. Ebbers; M. Kuhlmann; T. Cord-Landwehr; R. Haeb-Umbach; |
774 | Contrastive Separative Coding for Self-Supervised Representation Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To extract robust deep representations from long sequential modeling of speech data, we propose a self-supervised learning approach, namely Contrastive Separative Coding (CSC). |
J. Wang; M. W. Y. Lam; D. Su; D. Yu; |
775 | Contrastive Semi-Supervised Learning for ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by the successes of contrastive representation learning for both computer vision and speech applications, and more recently for supervised learning of visual objects [1], we propose Contrastive Semi-supervised Learning (CSL). |
A. Xiao; C. Fuegen; A. Mohamed; |
776 | Contrastive Learning of General-Purpose Audio Representations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce COLA, a self-supervised pre-training approach for learning a general-purpose representation of audio. |
A. Saeed; D. Grangier; N. Zeghidour; |
777 | SEQ-CPC : Sequential Contrastive Predictive Coding for Automatic Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by the contrastive predictive coding (CPC), we propose a feature representation scheme for automatic speech recognition (ASR), which encodes sequential dependency information from raw audio signals. |
Y. Chen; et al. |
778 | On Scaling Contrastive Representations for Low-Resource Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We explore the performance of such systems without fine-tuning by training a state-of-the-art speech recognizer on the fixed representations from the computationally demanding wav2vec 2.0 framework. |
L. Borgholt; T. M. S. Tax; J. D. Havtorn; L. Maal�e; C. Igel; |
779 | Convex Neural Autoregressive Models: Towards Tractable, Expressive, and Theoretically-Backed Models for Sequential Forecasting and Generation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Consequently, we introduce techniques to derive tractable, expressive, and theoretically-interpretable models that are nearly equivalent to neural autoregressive models. |
V. Gupta; B. Bartan; T. Ergen; M. Pilanci; |
780 | Inertial Proximal Deep Learning Alternating Minimization for Efficient Neutral Network Training Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work develops an improved DLAM by the well-known inertial technique, namely iPDLAM, which predicts a point by linearization of current and last iterates. |
L. Qiao; T. Sun; H. Pan; D. Li; |
781 | Kalman Optimizer for Consistent Gradient Descent Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose Kalman Optimizor (KO), an efficient stochastic optimization algorithm that adopts Kalman filter to make consistent estimation of the local gradient by solving an adaptive filtering problem. |
X. Yang; |
782 | Kalmannet: Data-Driven Kalman Filtering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present an offline training method, and numerically illustrate that KalmanNet can achieve optimal performance without full knowledge of the model parameters. |
G. Revach; N. Shlezinger; R. J. G. van Sloun; Y. C. Eldar; |
783 | HCGM-Net: A Deep Unfolding Network for Financial Index Tracking Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on sparse index tracking and employ a Frank-Wolfe-based algorithm which we translate into a deep neural network, a strategy known as deep unfolding. |
R. Pauwels; E. Tsiligianni; N. Deligiannis; |
784 | Augmenting Transferred Representations for Stock Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we show that using transfer learning can help with this task, by pre-training a model to extract universal features on the full universe of stocks of the S&P500 index and then transferring it to another model to directly learn a trading rule. |
E. Fons; P. Dawson; X. -j. Zeng; J. Keane; A. Iosifidis; |
785 | A Framework for Pruning Deep Neural Networks Using Energy-Based Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a framework for pruning DNNs based on a population-based global optimization method. |
H. Salehinejad; S. Valaee; |
786 | Prototype-Based Personalized Pruning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a dynamic personalization method called prototype-based personalized pruning (PPP). |
J. Kim; S. Chang; S. Yun; N. Kwak; |
787 | Tensor Reordering for CNN Compression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We show how parameter redundancy in Convolutional Neural Network (CNN) filters can be effectively reduced by pruning in spectral domain. |
M. Ulicny; V. A. Krylov; R. Dahyot; |
788 | Pruning of Convolutional Neural Networks Using Ising Energy Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an Ising energy model within an optimization framework for pruning convolutional kernels and hidden units. |
H. Salehinejad; S. Valaee; |
789 | Reweighted Dynamic Group Convolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by the previous work, a new reweighted dynamic group convolution (RDGC) structure, including a reweighted pruning module and a survival loss, is proposed in this work for more precise channel pruning. |
W. Chen; C. Wang; Z. Zhang; Z. Huo; L. Gao; |
790 | Layer-Wise Interpretation of Deep Neural Networks Using Identity Initialization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an interpretation method for a deep multilayer perceptron, which is the most general architecture of NNs, based on identity initialization (namely, initialization using identity matrices). |
S. Kubota; H. Hayashi; T. Hayase; S. Uchida; |
791 | Detection of Post-Traumatic Stress Disorder Using Learned Time-Frequency Representations from Pupillometry Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study investigates the utility of pupillometry as a biomarker to detect PTSD in a sample of 39 adults with (n = 22) and without (n = 17) PTSD. |
B. Taha; M. Kirk; P. Ritvo; D. Hatzinakos; |
792 | Subject-Invariant Eeg Representation Learning For Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an adversarial deep domain adaptation approach for emotion recognition from electroencephalogram (EEG) signals. |
S. Rayatdoost; Y. Yin; D. Rudrauf; M. Soleymani; |
793 | Towards Parkinson�s Disease Prognosis Using Self-Supervised Learning and Anomaly Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose to tackle the problem as a semi-supervised anomaly detection task, where we model the physiological patterns of healthy subjects instead. |
H. Jiang; W. Y. Bryan Lim; J. Shyuan Ng; Y. Wang; Y. Chi; C. Miao; |
794 | In-Bed Pressure-Based Pose Estimation Using Image Space Representation Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we address this challenge by presenting a novel end-to-end framework capable of accurately locating body parts from vague pressure data. |
V. Davoodnia; S. Ghorbani; A. Etemad; |
795 | Towards The Development of Subject-Independent Inverse Metabolic Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we describe an approach to monitor diet automatically, by analyzing fluctuations in glucose after a meal is consumed. |
S. Sajjadi; et al. |
796 | Human-Expert-Level Brain Tumor Detection Using Deep Learning with Data Distillation And Augmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome these challenges, we propose a new method for training a deep neural network that distills particularly representative training examples and augments the training data by mixing these samples from one class with those from the same and other classes to create additional training samples. |
D. Lu; N. Polomac; I. Gacheva; E. Hattingen; J. Triesch; |
797 | Multimodal Punctuation Prediction with Contextual Dropout Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We first present a transformer-based approach for punctuation prediction that achieves 8% improvement on the IWSLT 2012 TED Task, beating the previous state of the art [1]. We next describe our multimodal model that learns from both text and audio, which achieves 8% improvement over the text-only algorithm on an internal dataset for which we have both the audio and transcriptions. |
A. Silva; B. -J. Theobald; N. Apostoloff; |
798 | Multi-Modal Label Dequantized Gaussian Process Latent Variable Model for Ordinal Label Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents multi-modal label dequantized Gaussian process latent variable model (mLDGP) for ordinal label estimation. |
M. Matsumoto; K. Maeda; N. Saito; T. Ogawa; M. Haseyama; |
799 | Generative Information Fusion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we demonstrate the ability to exploit sensing modalities for mitigating an unrepresented modality or for potentially re-targeting resources. |
K. Tran; W. Sakla; H. Krim; |
800 | Self-Augmented Multi-Modal Feature Embedding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To exploit this fact, we propose the use of self-augmentation and combine it with multi-modal feature embedding. |
S. Matsuo; S. Uchida; B. K. Iwana; |
801 | Optimize What Matters: Training DNN-Hmm Keyword Spotting Model Using End Metric Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We address this loss-metric mismatch with a novel end-to-end training strategy that learns the DNN parameters by optimizing for the detection score. |
A. Shrivastava; A. Kundu; C. Dhir; D. Naik; O. Tuzel; |
802 | Co-Attentional Transformers for Story-Based Video Understanding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel co-attentional transformer model to better capture long-term dependencies seen in visual stories such as dramas and measure its performance on the video question answering task. |
B. Bebensee; B. -T. Zhang; |
803 | Deep Generative Demixing: Error Bounds for Demixing Subgaussian Mixtures of Lipschitz Signals Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we investigate the subgaussian demixing problem for two Lipschitz signals, with GNN demixing as a special case. |
A. Berk; |
804 | Towards An Intrinsic Definition of Robustness for A Classifier Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we point out that averaging the radius of robustness of samples in a validation set is a statistically weak measure. |
T. Giraudon; V. Gripon; M. L�we; F. Vermet; |
805 | Phase Transitions for One-Vs-One and One-Vs-All Linear Separability in Multiclass Gaussian Mixtures Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present precise formulae characterizing the phase transitions as a function of the data geometry and the number of classes. |
G. R. Kini; C. Thrampoulidis; |
806 | Leaky Integrator Dynamical Systems and Reachable Sets Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work applies the controllability matrix of control theory to quickly identify a reduced size replacement reservoir. |
B. Whiteaker; P. Gerstoft; |
807 | Benign Overfitting in Binary Classification of Gaussian Mixtures Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper studies benign-overfitting for data generated from a popular binary Gaussian mixtures model (GMM) and classifiers trained by support-vector machines (SVM). |
K. Wang; C. Thrampoulidis; |
808 | An Order-Optimal Adaptive Test Plan for Noisy Group Testing Under Unknown Noise Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an adaptive test plan consisting of a hierarchy of biased random walks guided by a local sequential test which together lend adaptivity and agnosticism to the unknown noise model. |
S. Salgia; Q. Zhao; |
809 | SapAugment: Learning A Sample Adaptive Policy for Data Augmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To formalize these intuitions, we propose a novel method to learn a Sample-Adaptive Policy for Augmentation ? SapAugment. |
T. -Y. Hu; et al. |
810 | Hierarchical Coded Elastic Computing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose two hierarchical coded elastic computing schemes that can further speed up the system by exploiting stragglers and effectively allocating tasks among available nodes. |
S. Kiani; T. Adikari; S. C. Draper; |
811 | Synthesize & Learn: Jointly Optimizing Generative and Classifier Networks for Improved Drowsiness Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We focus on the problem of alleviating the class imbalance problem by using generative adversarial networks (GAN) to synthesize examples of sparse classes directly in the feature-space. |
S. Banerjee; A. Joshi; A. Ghoneim; S. Kyal; T. Mishra; |
812 | A Joint Convolutional and Spatial Quad-Directional LSTM Network for Phase Unwrapping Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a novel Convolutional Neural Network (CNN) that incorporates a Spatial Quad-Directional Long Short Term Memory (SQD-LSTM) for phase unwrapping, by formulating it as a regression problem. |
M. V. Perera; A. De Silva; |
813 | Integrated Classification and Localization of Targets Using Bayesian Framework In Automotive Radars Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Compared to state-of-the-art using independent classification and tracking, in this paper, we propose an integrated tracker and classifier leading to a novel Bayesian framework. |
A. Dubey; A. Santra; J. Fuchs; M. L�bke; R. Weigel; F. Lurz; |
814 | A DNN Autoencoder for Automotive Radar Interference Mitigation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a novel interference mitigation approach using an autoencoder in combination with a traditional interference detection filter is introduced. |
S. Chen; J. Taghia; T. Fei; U. K�hnau; N. Pohl; R. Martin; |
815 | DURAS: Deep Unfolded Radar Sensing Using Doppler Focusing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose Deep Unfolded Radar Sensing (DURAS), a model-based deep learning architecture to address this problem. |
P. Goyal; S. Mulleti; A. Gupta; Y. C. Eldar; |
816 | NNAKF: A Neural Network Adapted Kalman Filter for Target Tracking Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In the present paper we advocate a novel method to increase Q during maneuvers (and hence the Kalman gains) based on a recurrent neural network (RNN). |
S. Jouaber; S. Bonnabel; S. Velasco-Forero; M. Pilt�; |
817 | Multi-Sample Online Learning for Spiking Neural Networks Based on Generalized Expectation Maximization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: While prior work used single-sample estimators obtained from a single run of the network, this paper proposes to leverage multiple compartments that sample independent spiking signals while sharing synaptic weights. |
H. Jang; O. Simeone; |
818 | Probabilistic Graph Neural Networks for Traffic Signal Control Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work presents a variational graph learning model TSC-GNN (Traffic Signal Control via probabilistic Graph Neural Networks) to learn the latent representations of agents and generate Q-value while taking traffic uncertainty conditions into account. |
T. Zhong; Z. Xu; F. Zhou; |
819 | Task-Aware Neural Architecture Search Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel framework for neural architecture search, utilizing a dictionary of models of base tasks and the similarity between the target task and the atoms of the dictionary; hence, generating an adaptive search space based on the base models of the dictionary. |
C. P. Le; M. Soltani; R. Ravier; V. Tarokh; |
820 | F-Net: Fusion Neural Network for Vehicle Trajectory Prediction in Autonomous Driving Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, based on recurrent neural networks and convolutional neural networks, we propose a fusion neural network architecture named F-Net to deal with vehicle trajectory prediction on highway and urban scenarios in autonomous driving applications. |
J. Wang; P. Wang; C. Zhang; K. Su; J. Li; |
821 | Unsupervised Reconstruction of Sea Surface Currents from AIS Maritime Traffic Data Using Learnable Variational Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we show that an unsupervised variational learning scheme provides new means to elucidate how AIS data streams can be converted into sea surface currents. |
S. Bena�chouche; C. Le Goff; Y. Guichoux; F. Rousseau; R. Fablet; |
822 | A Compact Joint Distillation Network for Visual Food Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In view of this, this paper proposes a joint distillation framework that targets to achieve a high visual food recognition accuracy using a compact network. |
Z. Heng; K. -H. Yap; A. C. Kot; |
823 | Pipeline Safety Early Warning Method for Distributed Signal Using Bilinear CNN and LightGBM Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we utilized a novel machine learning method based on the spatiotemporal features of distributed optical fiber sensor signals to monitor the safety of oil and gas pipelines in real time. |
Y. Yang; Y. Li; H. Zhang; |
824 | Deep Learning Based Hybrid Precoding in Dual-Band Communication Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a deep learning-based method that uses spatial and temporal information extracted from the sub-6GHz band to predict/track beams in the millimeter-wave (mmWave) band. |
R. Ismayilov; R. L. G. Cavalcante; S. Stanczak; |
825 | Deep Learning-Based Cross-Layer Resource Allocation for Wired Communication Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a cross-layer resource allocation (RA) scheme based on deep learning is introduced for multi-tone multi-user wired communication systems such as, for instance, digital subscriber line (DSL) systems under the current G.fast standard. |
P. Behmandpoor; J. Verdyck; M. Moonen; |
826 | ATVIO: Attention Guided Visual-Inertial Odometry Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In the paper, we propose a novel attention guided deep framework for visual-inertial odometry (ATVIO) to improve the performance of VIO. |
L. Liu; G. Li; T. H. Li; |
827 | Feature Integration Via Semi-Supervised Ordinally Multi-Modal Gaussian Process Latent Variable Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a method of feature integration via semi-supervised ordinally multi-modal Gaussian process latent variable model (Semi-OMGP). |
K. Kamikawa; K. Maeda; T. Ogawa; M. Haseyama; |
828 | A Multi-Layer Multi-Channel Attentive Network for Gender and Age Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper a multi-layer multi-channel attentive network based on the idea of divide-and-conquer is proposed. |
J. Chen; H. Yu; Y. Kang; |
829 | Effect of Language Proficiency on Subjective Evaluation of Noise Suppression Algorithms Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Whereas standard tests for assessing perceived quality make use of native listeners, we assume that noise-reduced speech and residual noise may affect native and non-native listeners of a target language in different ways. |
B. Naderi; G. Mittag; R. Z. Jim�nez; S. M�ller; |
830 | Melody Harmonization Using Orderless Nade, Chord Balancing, and Blocked Gibbs Sampling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we apply the concept of orderless NADE, which takes the melody and its partially masked chord sequence as the input of the BiLSTM-based networks to learn the masked ground truth, to the training process. |
C. -E. Sun; Y. -W. Chen; H. -S. Lee; Y. -H. Chen; H. -M. Wang; |
831 | Cross-Domain Semi-Supervised Deep Metric Learning for Image Sentiment Analysis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a novel method on image sentiment analysis called cross-domain semi-supervised deep metric learning (CDSS-DML). |
Y. Liang; K. Maeda; T. Ogawa; M. Haseyama; |
832 | Audiovisual Highlight Detection in Videos Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we test the hypothesis that interesting events in unstructured videos are inherently audiovisual. |
K. Mundnich; A. Fenster; A. Khare; S. Sundaram; |
833 | Teacher-Assisted Mini-Batch Sampling for Blind Distillation Using Metric Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The proposed framework introduces metric learning to blind distillation. |
N. Inoue; |
834 | Rule-Embedded Network for Audio-Visual Voice Activity Detection in Live Musical Video Streams Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a rule-embedded network to fuse the audio-visual (A-V) inputs for better detection of the target voice. |
Y. Hou; Y. Deng; B. Zhu; Z. Ma; D. Botteldooren; |
835 | Reinforcement Stacked Learning with Semantic-Associated Attention for Visual Question Answering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, depending on these problems, a semantic-associated attention method and a reinforcement stacked learning mechanism are proposed. |
X. Xiao; C. Zhang; S. Xiang; C. Pan; |
836 | Hierarchical Refined Attention for Scene Text Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel framework named hierarchical refined attention network (HRAN) for STR. |
M. Zhang; M. Ma; P. Wang; |
837 | Collaborative Learning to Generate Audio-Video Jointly Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Towards this, we propose a method that demonstrates that we are able to generate naturalistic samples of video and audio data by the joint correlated generation of audio and video modalities. |
V. K. Kurmi; V. Bajaj; B. N. Patro; K. S. Venkatesh; V. P. Namboodiri; P. Jyothi; |
838 | An Attention-Seq2Seq Model Based on CRNN Encoding for Automatic Labanotation Generation from Motion Capture Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an attention-seq2seq model based on Convolutional Recurrent Neural Network (CRNN). |
M. Li; Z. Miao; X. -P. Zhang; W. Xu; |
839 | Show and Speak: Directly Synthesize Spoken Description of Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a new model, referred to as the show and speak (SAS) model that, for the first time, is able to directly synthesize spoken descriptions of images, bypassing the need for any text or phonemes. |
X. Wang; S. Feng; J. Zhu; M. Hasegawa-Johnson; O. Scharenborg; |
840 | Drawgan: Text to Image Synthesis with Drawing Generative Adversarial Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel drawing generative adversarial networks (DrawGAN) for text-to-image synthesis. |
Z. Zhang; J. Zhou; W. Yu; N. Jiang; |
841 | Disentangling Subject-Dependent/-Independent Representations for 2D Motion Retargeting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a novel network to separate subject-dependent and -independent motion features and to reconstruct a new skeleton with or without subject-dependent motion features. |
F. Xie; G. Irie; T. Matsubayashi; |
842 | Network and Content-Dependent Bitrate Ladder Estimation for Adaptive Bitrate Video Streaming Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a method is presented to estimate bitrate ladders on the basis of both content complexity and network traces. |
P. Lebreton; K. Yamagishi; |
843 | Ultra-Low Bitrate Video Conferencing Using Deep Image Animation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we propose a novel deep learning approach for ultra-low bitrate video compression for video conferencing applications. |
G. Konuko; G. Valenzise; S. Lathuili�re; |
844 | Hierarchical Bit-Wise Differential Coding (HBDC) of Point Cloud Attributes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Targeting both computing and coding efficiencies, we propose in this work a novel hierarchical bit-wise differential coding scheme to compress point cloud attributes. |
Y. Huang; B. Wang; C. . -C. J. Kuo; H. Yuan; J. Peng; |
845 | Learning-Based Lossless Compression of 3D Point Cloud Geometry Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a learning-based, lossless compression method for static point cloud geometry, based on context-adaptive arithmetic coding. |
D. T. Nguyen; M. Quach; G. Valenzise; P. Duhamel; |
846 | Image Coding with Neural Network-Based Colorization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Motivated by the excellent results obtained with deep learning-based solutions in the area of automatic colorization, this paper proposes an image coding solution integrating a deep learning-based colorization process to estimate the chrominance components based on the decoded luminance which is regularly encoded with a conventional image coding standard. |
D. Lopes; J. Ascenso; C. Brites; F. Pereira; |
847 | Joint Reinforcement Learning and Game Theory Bitrate Control Method for 360-Degree Dynamic Adaptive Streaming Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A joint reinforcement learning (RL) and game theory method is presented for segment-level continuous bitrate selection and tile-level bitrate allocation in tile-based 360-degree streaming to increase users? quality of experience (QoE). |
X. Wei; M. Zhou; S. Kwong; H. Yuan; T. Xiang; |
848 | HCAG: A Hierarchical Context-Aware Graph Attention Model for Depression Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose HCAG, a novel Hierarchical Context-Aware Graph attention model for ADD. |
M. Niu; K. Chen; Q. Chen; L. Yang; |
849 | When Face Recognition Meets Occlusion: A New Benchmark Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we pioneer a simulated occlusion face recognition dataset. |
B. Huang; et al. |
850 | A Triplet Appearance Parsing Network for Person Re-Identification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study develops a Triplet person Appearances Parsing Framework (TAPF) which eliminates the surrounding interference factors of bounding boxes for person re-identification. |
M. Xiong; et al. |
851 | Part-Aligned Network with Background for Misaligned Person Search Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a part-aligned network with background (PANB) to address this misalignment issue. |
X. Zhong; Y. Liu; W. Huang; X. Wang; B. Ma; J. Yuan; |
852 | Learning Pose-Adaptive Lip Sync with Cascaded Temporal Convolutional Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we design a cascaded temporal convolutional network to successively generate mouth shapes and corresponding jawlines based on audio signals and template headposes. |
R. Zheng; B. Song; C. Ji; |
853 | Assessment of Bipolar Disorder Using Heterogeneous Data of Smartphone-Based Digital Phenotyping Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to reduce the cost of social and medical resources, this study collects the user?s data by the App on smartphones, consisting of location data (GPS), self-report scales, daily mood, sleeping time and records of multi-media (text, speech, video) which are heterogeneous digital phenotyping data, to build a database. |
H. -Y. Su; C. -H. Wu; C. -R. Liou; E. C. -L. Lin; P. See Chen; |
854 | Multi-Granularity Feature Interaction and Relation Reasoning for 3D Dense Alignment and Face Reconstruction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a multi-granularity feature interaction and relation reasoning network (MFIRRN) which can recover a detail-rich 3D face and perform more accurate dense alignment in an unconstrained environment. |
L. Li; X. Li; K. Wu; K. Lin; S. Wu; |
855 | Independent Sign Language Recognition with 3d Body, Hands, and Face Reconstruction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we employ SMPL-X, a contemporary parametric model that enables joint extraction of 3D body shape, face and hands information from a single image. |
A. Kratimenos; G. Pavlakos; P. Maragos; |
856 | Multimodal Cross- and Self-Attention Network for Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel Multimodal Cross- and Self-Attention Network (MCSAN) to tackle this problem. |
L. Sun; B. Liu; J. Tao; Z. Lian; |
857 | Multi-Target DoA Estimation with An Audio-Visual Fusion Mechanism Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: With this motivation, we propose to use neural networks with audio and visual signals for multi-speaker localization. |
X. Qian; M. Madhavi; Z. Pan; J. Wang; H. Li; |
858 | Improving Multimodal Speech Enhancement By Incorporating Self-Supervised and Curriculum Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a co-attention based framework that incorporates self-supervised and curriculum learning to derive the target speech in noisy environments. |
Y. Cheng; M. He; J. Yu; R. Feng; |
859 | Autoencoder for Vibrotactile Signal Compression Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by the recent success of deep neural network (DNN) based autoencoder, we make the first attempt to apply autoencoder for lossy compression of haptic vibrotactile signals, where a convolutional neural network (CNN) and a rate-distortion (RD) function are used as the transform and cost functions, respectively. |
Z. Li; R. Hassen; Z. Wang; |
860 | Cross-Modal Knowledge Distillation For Fine-Grained One-Shot Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we adopt domain- specific knowledge to fill the gap of insufficient annotated data. |
J. Zhao; X. Lin; Y. Yang; J. Yang; L. He; |
861 | Learning Audio-Visual Correlations From Variational Cross-Modal Generation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a novel Variational AutoEncoder (VAE) framework that consists of Multiple encoders and a Shared decoder (MS-VAE) with an additional Wasserstein distance constraint to tackle the problem. |
Y. Zhu; Y. Wu; H. Latapie; Y. Yang; Y. Yan; |
862 | ECCL: Explicit Correlation-Based Convolution Boundary Locator for Moment Localization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a new boundary-determining strategy called explicit correlation-based convolution boundary locator (ECCL), which can handle any lengths of videos and moments while leveraging fine-grained matching relationships. |
X. Liu; X. Nie; J. Teng; F. Hao; Y. Yin; |
863 | COOPNet: Multi-Modal Cooperative Gender Prediction in Social Media User Profiling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel text-image cooperation framework (COOPNet), a bridge connection network architecture that exchanges information between texts and images. |
L. Li; K. Hu; Y. Zheng; J. Liu; K. A. Lee; |
864 | Robust Latent Representations Via Cross-Modal Translation and Alignment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this limitation, we aim to improve the testing performance of uni-modal systems using multiple modalities during training only. |
V. Rajan; A. Brutti; A. Cavallaro; |
865 | Semi-Supervised Multimodal Image Translation for Missing Modality Imputation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a semi-supervised algorithm for multimodal learning with missing data, namely Cyclic Autoencoder (CycAE). |
W. Sun; F. Ma; Y. Li; S. -L. Huang; S. Ni; L. Zhang; |
866 | Deep Adversarial Quantization Network for Cross-Modal Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a seamless multimodal binary learning method for cross-modal retrieval. |
Y. Zhou; Y. Feng; M. Zhou; B. Qiang; L. Hou U; J. Zhu; |
867 | Scalable Discriminative Discrete Hashing For Large-Scale Cross-Modal Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a supervised discrete-based cross-modal hashing method, named Scalable Discriminative Discrete Hashing (SDDH), for cross-modal retrieval, where 1) the discrete hash codes are directly obtained by multi-modal features and semantic labels so that the quantization errors are dramatically reduced, and 2) the discrete hash codes simultaneously preserve the heterogeneous similarity and manifold information in the original space by employing matrix factoring with orthogonal and balanced constraints. |
J. Qin; L. Fei; J. Zhu; J. Wen; C. Tian; S. Wu; |
868 | Hierarchical Similarity Learning for Language-Based Product Image Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on the cross-modal similarity measurement, and propose a novel Hierarchical Similarity Learning (HSL) network. |
Z. Ma; F. Liu; J. Dong; X. Qu; Y. He; S. Ji; |
869 | Bidirectional Focused Semantic Alignment Attention Network for Cross-Modal Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to further accurately capture the multi-modal semantic information, a bidirectional focused semantic alignment attention network (BFSAAN) is proposed to handle cross-modal retrieval tasks. |
S. Cheng; L. Wang; A. Du; Y. Li; |
870 | Detection of Audio-Video Synchronization Errors Via Event Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a new method and a large-scale database to detect audio-video synchronization(A/V sync) errors in tennis videos. |
J. P. Ebeneze; Y. Wu; H. Wei; S. Sethuraman; Z. Liu; |
871 | FC2RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address the dilemma, a novel proposed corner-aware convolution in which the sampling positions tightly cover the text area is utilized to encode an initial corner prediction into the feature maps, which can be further used to produce a refined corner prediction. |
X. Qin; Y. Zhou; Y. Guo; D. Wu; W. Wang; |
872 | DoA Estimation of A Hidden RF Source Exploiting Simple Backscatter Radio Tags Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work emulates a multi-antenna system using a singleantenna receiver and exploiting the beauty and simplicity of backscatter radio. |
G. Vougioukas; A. Bletsas; |
873 | Probability of Resolution of G-MUSIC: An Asymptotic Approach Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, the outlier production mechanism of the G-MUSIC Direction-of-Arrival estimation technique is investigated using tools from Random Matrix Theory. |
D. Schenck; X. Mestre; M. Pesavento; |
874 | A Partially-Relaxed Robust DOA Estimator Under Non-Gaussian Low-Rank Interference and Noise Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel robust DOA estimator from the data collected at the sensor array under the corruption of non-Gaussian interference and noise. |
M. Trinh-Hoang; M. N. El Korso; M. Pesavento; |
875 | Non-Coherent DOA Estimation of Off-Grid Signals With Uniform Circular Arrays Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: However, this may not be correct in practice; in order to address this issue, an off-grid model involved with a bias vector is proposed and an efficient two-step method based on this model is developed. |
Z. Wan; W. Liu; |
876 | Enhanced Standard Esprit For Overcoming Imperfections In DOA Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The proposed methods use statistics of the subspace perturbation. |
M. Esfandiari; S. A. Vorobyov; |
877 | Constrained Tensor Decomposition for 2d DOA Estimation In Transmit Beamspace Mimo Radar with Subarrays Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a constrained tensor decomposition method that enables two dimensional (2D) direction of arrival (DOA) estimation for transmit beamspace (TB) Multiple-Input Multiple-Output (MIMO) radar with subarrays is proposed. |
F. Xu; S. A. Vorobyov; |
878 | Alternating Projections Gridless Covariance-Based Estimation For DOA Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a gridless sparse iterative covariance-based estimation method based on alternating projections for direction-of-arrival (DOA) estimation. |
Y. Park; P. Gerstoft; |
879 | Synthetic Data For Dnn-Based Doa Estimation of Indoor Speech Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper investigates the use of different room impulse response (RIR) simulation methods for synthesizing training data for deep neural network-based direction of arrival (DOA) estimation of speech in reverberant rooms.Different sets of synthetic RIRs are obtained using the image source method (ISM) and more advanced methods including diffuse reflections and/or source directivity. |
F. B. Gelderblom; Y. Liu; J. Kvam; T. A. Myrvoll; |
880 | Direction Of Arrival Estimation For Non-Coherent Sub-Arrays Via Joint Sparse And Low-Rank Signal Recovery Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider a more challenging DOA estimation task where the array is composed of non-coherent sub-arrays (i.e., sub-arrays that observe different unknown phase shifts due to using low-cost unsynchronized local oscillators). |
T. Tirer; O. Bialer; |
881 | Sparsity And Nonnegativity Constrained Krylov Approach For Direction Of Arrival Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents an approach that imposes two additional constraints to the inverse problem, namely sparsity and nonnegativity of the solution. |
H. Baali; A. Bouzerdoum; A. Khelif; |
882 | Hybrid Analog-Digital MIMO Radar Receivers With Bit-Limited ADCs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we study reduced cost MIMO radar receivers restricted to operate with low resolution ADCs. |
F. Xi; N. Shlezinger; Y. C. Eldar; |
883 | Sparse Array Transceiver Design for Enhanced Adaptive Beamforming in MIMO Radar Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we examine the active sparse array design enabling the maximum signal to interference plus noise ratio (MaxSINR) beamforming at the MIMO radar receiver. |
S. A. Hamza; W. Zhai; X. Wang; M. G. Amin; |
884 | Sparse Parameter Estimation for PMCW MIMO Radar Using Few-Bit ADCs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we consider target parameter estimation of phase-modulated continuous-wave (PMCW) multiple-input multiple-output (MIMO) radars with few-bit analog-to-digital converters (ADCs). |
C. -Y. Wu; J. Li; T. F. Wong; |
885 | Parameter Identifiability Of Spatial-Smoothing-Based Bistatic Mimo Radar Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we are devoted to establishing more accurate conditions by studying the positive definiteness of smoothed target covariance matrix. |
J. Shi; F. Wen; Y. Liu; Q. Shen; Z. Li; Z. Liu; |
886 | Parameter Estimation for Coherent Passive MIMO Radar with Unknown Signals Under Direct Path Influence Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper studies the problem of joint target position and velocity estimation for a coherent passive radar system. |
Z. Wang; Q. He; |
887 | Riemannian Geometric Optimization Methods for Joint Design of Transmit Sequence and Receive Filter of MIMO Radar Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel optimization framework to solve the resultant non-convex problem on a Riemannian product manifold. |
J. Li; G. Liao; Y. Huang; A. Nehorai; |
888 | High Accuracy Tracking of Targets Using Massive MIMO Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes an accurate and novel multipath-resilient system to track the targets by leveraging the large number of antennas in massive MIMO systems. |
X. Zeng; F. Zhang; B. Wang; K. J. Ray Liu; |
889 | Admm-Based Fast Algorithm for Robust Multi-Group Multicast Beamforming Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We develop an alternating direction method of multipliers (ADMM)based fast algorithm to solve this problem directly with convergence guarantee. |
N. Mohamadi; M. Dong; S. ShahbazPanahi; |
890 | Scalable and Distributed MMSE Algorithms for Uplink Receive Combining in Cell-Free Massive MIMO Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, the problem of optimal uplink receive combining is tackled by providing an efficient distributed MMSE algorithm, with a minimal number of exchanged parameters between the APs and the network center. |
R. Van Rompaey; M. Moonen; |
891 | Antenna Selection for Massive MIMO Systems Based on POMDP Framework Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To avoid the computational complexity associated with the value iteration algorithm, we herein propose to use the simple myopic antenna selection policy based on the fact that for any arbitrary number of antennas and RF chains, under the assumption of positively correlated two-state Markov channel model, the myopic policy is optimal. |
S. Sharifi; S. Shahbaz Panahi; M. Dong; |
892 | RIS-Aided Joint Localization and Synchronization with A Single-Antenna Mmwave Receiver Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we show that thanks to the use of a reconfigurable intelligent surface (RIS), joint localization and synchronization is possible with only downlink MISO transmissions. |
A. Fascista; A. Coluccia; H. Wymeersch; G. Seco-Granados; |
893 | Joint Channel, Data, and Phase-Noise Estimation in MIMO-OFDM Systems Using A Tensor Modeling Approach Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a two-stage tensor-based receiver for joint channel, phase-noise (PN), and data estimation in MIMO-OFDM systems. |
B. Sokal; P. R. B. Gomes; A. L. F. de Almeida; M. Haardt; |
894 | Robust Steerable Differential Beamformers with Null Constraints for Concentric Circular Microphone Arrays Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we extend our recently developed beamforming method for CMAs to the design of differential beamformers with CCMAs, which takes advantage of the symmetric null constraints from the beampattern. |
X. Wang; G. Huang; I. Cohen; J. Benesty; J. Chen; |
895 | Close-Talking Recording with Planarly Distributed Microphones Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper provides a close-talking recording method with microphones in arbitrary planar distributions based on sound pressure interpolation. |
T. Okamoto; |
896 | (W)Earable Microphone Array and Ultrasonic Echo Localization for Coarse Indoor Environment Mapping Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a microphone array structure for spherical sound incidence angle tracking that can be attached to headphones or directly integrated into earphones. |
F. Pfreundtner; J. Yang; G. S�r�s; |
897 | Characterization of Mems Microphone Sensitivity and Phase Distributions with Applications in Array Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Hence, this work demonstrates a free-field comparison method for measuring these variations in a batch of arrays. |
P. W. A. Wijnings; S. Stuijk; R. Scholte; H. Corporaal; |
898 | Directional Sparse Filtering Using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech Mixtures Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an algorithm based on the directional sparse filtering (DSF) framework that utilizes the Lehmer mean with learnable weights to adaptively account for source imbalance. |
K. Watcharasupat; A. H. T. Nguyen; C. -H. Ooi; A. W. H. Khong; |
899 | Distributed Speech Separation in Spatially Unconstrained Microphone Arrays Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a distributed algorithm that can process spatial information in a spatially unconstrained microphone array. |
N. Furnon; R. Serizel; I. Illina; S. Essid; |
900 | An Adaptive Non-Linear Process for Under-Determined Virtual Microphone Beamforming Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose to extend existing virtual microphone signal interpolation by employing an adaptive non-linear (ANL) process for acoustic beamforming. |
M. Bekrani; A. H. T. Nguyen; A. W. H. Khong; |
901 | Window Beamformer for Sparse Concentric Circular Array Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work proposes a practical fixed beamformer for a sparse concentric circular array (CCA) of microphones. |
R. Sharma; I. Cohen; B. Berdugo; |
902 | Single-Point Array Response Control with Minimum Pattern Deviation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a beampattern synthesis scheme based on the single-point array response control with minimum pattern deviation (SPARC-MPD) method. |
X. Ai; L. Gan; |
903 | Focusing-Based Wideband Adaptive Beamforming Using Covariance Matrix Reconstruction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To maximize the overall performance of focusing-based beamformer, this paper presents an adaptive focusing transformation-based beamforming algorithm using covariance matrix reconstruction. |
P. Chen; W. Wang; J. Gao; |
904 | Bayesian Multiple Change-Point Detection of Propagating Events Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we take a Bayesian approach and model the detection of spatial events as a Bayesian multiple change point detection problem. |
T. Halme; E. Nitzan; V. Koivunen; |
905 | One-Bit Autocorrelation Estimation With Non-Zero Thresholds Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents an autocorrelation estimator based on a one-bit quantizer with a non-zero threshold. |
C. -L. Liu; Z. -M. Lin; |
906 | A Novel Bayesian Approach for The Two-Dimensional Harmonic Retrieval Problem Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we propose a novel Bayesian strategy to address the two dimensional harmonic retrieval problem, through remodeling and reparameterization of the standard data model. |
R. R. Pote; B. D. Rao; |
907 | On Overfitting in Discrete Super-Resolution Recovery Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper studies the overfitting in discrete super-resolution problem. |
W. Lu; H. Qiao; |
908 | SIML: Sieved Maximum Likelihood for Array Signal Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a Sieved Maximum Likelihood (SiML) method. |
M. Simeoni; P. Hurley; |
909 | Estimation of Groundwater Storage Variations in Indus River Basin Using Grace Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a data processing framework that processes and combines these data-sets to provide an estimate of GWS changes. |
Y. Sattar; Z. Khalid; |
910 | Temporal Exemplar Channels In High-Multipath Environments Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: As a result, in this work, we introduce a machine learning (ML) based exemplar extraction approach on mmWave wireless spatial-channel measurements. |
M. Kashef; P. Vouras; R. Jones; R. Candell; K. A. Remley; |
911 | Multi-Vehicle Velocity Estimation Using IEEE 802.11ad Waveform Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a multi-target velocity estimation technique using IEEE 802.11ad waveform in a vehicle-to-vehicle (V2V) scenario. |
G. Han; S. Kim; J. Choi; |
912 | Real-Time Interaural Time Delay Estimation Via Onset Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces a novel real-time estimation method inspired by the precedence effect. |
E. Ren; G. C. Ornelas; H. -A. Loeliger; |
913 | EKFNet: Learning System Noise Statistics from Measurement Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, to reduce the time and manpower spent on fine-tuning an extended Kalman filter (EKF), we propose a new learning framework, EKFNet, for automatically estimating the best process and measurement noise covariance pair from the real measurement data. |
L. Xu; R. Niu; |
914 | Sliding-Capon Based Convolutional Beamspace for Linear Arrays Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A new method to design the filter for convolutional beamspace (CBS), called Capon-CBS, is proposed. |
P. -C. Chen; P. P. Vaidyanathan; |
915 | Target Detection from Distributed Passive Sensors: Semi-Labeled Data Quantization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work considers the case where both the intensity data and the label (distance) values are coarsely quantized to decrease communication cost. |
Z. Sutton; P. Willett; S. Marano; |
916 | Sparse Factorization-Based Detection of Off-the-Grid Moving Targets Using FMCW Radars Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate the application of continuous sparse signal reconstruction algorithms for the estimation of the ranges and speeds of multiple moving targets using an FMCW radar. |
G. M. de Galland; T. Feuillen; L. Vandendorpe; L. Jacques; |
917 | A Robust Copula Model for Radar-Based Landmine Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a robust copula model for landmine detection based on a likelihood ratio test. |
A. D. Pambudi; F. Ahmad; A. M. Zoubir; |
918 | Radar Clutter Classification Using Expectation-Maximization Method Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, the problem of classifying radar clutter returns into statistically homogeneous subsets is addressed. |
S. Han; P. Addabbo; D. Orlando; G. Ricci; |
919 | A Meta-Learning Framework for Few-Shot Classification of Remote Sensing Scene Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we provide a metalearning framework for few-shot classification of RS scene. |
P. Zhang; Y. Bai; D. Wang; B. Bai; Y. Li; |
920 | Differential Convolution Feature Guided Deep Multi-Scale Multiple Instance Learning for Aerial Scene Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a deep multi-scale multiple instance learning (DMSMIL) framework to tackle the above challenges. |
B. Zhou; J. Yi; Q. Bi; |
921 | Generalized Thinned Coprime Array for DOA Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a generalized thinned coprime array by introducing the flexible inter-element spacings, where the conventional one can be seen as a special case. |
J. Shi; Y. Liu; F. Wen; Z. Liu; P. Hu; Z. Gong; |
922 | TCLA Array: A New Sparse Array Design with Less Mutual Coupling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper aims to introduce a new proposed sparse array that takes into account all these considerations. |
A. M. A. Shaalan; J. Du; Y. -H. Tu; |
923 | Low Mutual Coupling Sparse Array Design Using ULA Fitting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a general sparse array (SA) design principle, called uniform linear array (ULA) fitting, is proposed. |
W. Shi; Y. Li; S. A. Vorobyov; |
924 | Low-Rank and Sparse Decomposition for Joint DOA Estimation and Contaminated Sensors Detection with Sparsely Contaminated Arrays Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we consider sparsely contaminated arrays in which only a few sensors are contaminated by sensor gain and phase errors, and moreover, the number of contaminated sensors as well as their positions are unknown. |
H. Huang; A. M. Zoubir; |
925 | Fundamental Trade-Offs in Noisy Super-Resolution with Synthetic Apertures Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The main contribution of this paper is to rigorously establish that nested arrays provide lower Cram?r-Rao bounds than a ULA (with the same number of sensors) in the low SNR regime, and therefore can lead to better resolvability of closely spaced sources. |
S. Shahsavari; J. Millhiser; P. Pal; |
926 | Enhanced Blind Calibration of Uniform Linear Arrays with One-Bit Quantization By Kullback-Leibler Divergence Covariance Fitting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel solution approach for the blind calibration problem, namely, without using known calibration signals. |
A. Weiss; A. Yeredor; |
927 | Non-Iterative Blind Calibration of Nested Arrays with Asymptotically Optimal Weighting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel blind calibration method for 2-level nested arrays. |
A. Weiss; A. Yeredor; |
928 | Sensor Networks TDOA Self-Calibration: 2D Complexity Analysis and Solutions Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we consider 2D networks with synchronized receivers but unsynchronized transmitters and the corresponding calibration techniques, known as Time-Difference-Of-Arrival (TDOA) techniques. |
L. Ferranti; K. �str�m; M. Oskarsson; J. Boutellier; J. Kannala; |
929 | Fast and Robust Stratified Self-Calibration Using Time-Difference-Of-Arrival Measurements Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we study the problem of estimating receiver and sender positions using time-difference-of-arrival measurements. |
M. Larsson; G. Flood; M. Oskarsson; K. �str�m; |
930 | Stability Analysis of The RC-PLMS Adaptive Beamformer Using A Simple Transfer Function Approximation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a discrete time transfer function approximation for the reduced complexity parallel least mean square (RC-pLMS) adaptive beamforming algorithm. |
G. Akkad; A. Mansour; B. El Hassan; E. Inaty; |
931 | On The Asymptotic Performance of One-Bit Co-Array-Based Music Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, to provide valuable insights into the performance of DoA estimation from one-bit SLA measurements, we derive an asymptotic closed-form expression for the performance of One-Bit Co-Array-Based MUSIC (OBCAB-MUSIC). |
S. Sedighi; B. Shankar; M. Soltanalian; B. Ottersten; |
932 | Kld Minimization-Based Constrained Measurement Filtering For Two-Step TDOA Indoor Tracking Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents an enhanced two-step method for tracking an indoor point target using the time difference of arrival (TDOA) measurements from an ultra wideband (UWB) positioning system. |
R. Huang; L. Yang; J. Tao; Y. Xue; |
933 | A Correntropy Based Algorithm for Robust Localization in Wireless Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a correntropy based algorithm is proposed for localization and tracking of a mobile station in wireless networks. |
M. Sedighizad; B. Seyfe; S. Valaee; |
934 | MuG: A Multipath-Exploited and Grid-Free Localisation Method Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a localisation method in which both LoS and NLoS paths are exploited for much more general settings. |
H. Liu; W. Dai; Y. Shen; |
935 | Sparse Bayesian Learning for Acoustic Source Localization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, the CS method of sparse Bayesian learning (SBL) is used to find the DOAs. |
R. Pandey; S. Nannuru; A. Siripuram; |
936 | Automatic Fine-Grained Localization of Utility Pole Landmarks on Distributed Acoustic Sensing Traces Based on Bilinear Resnets Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose two machine learning approaches to automate this procedure for large-scale implementation. |
Y. Lu; Y. Tian; S. Han; E. Cosatto; S. Ozharar; Y. Ding; |
937 | SSLIDE: Sound Source Localization for Indoors Based on Deep Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents SSLIDE, Sound Source Localization for Indoors using DEep learning, which applies deep neural networks (DNNs) with encoder-decoder structure to localize sound sources with random positions in a continuous space. |
Y. Wu; R. Ayyalasomayajula; M. J. Bianco; D. Bharadia; P. Gerstoft; |
938 | Physical-Layer Security Via Distributed Beamforming in The Presence of Adversaries with Unknown Locations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We study the problem of securely communicating a sequence of information bits with a client in the presence of multiple adversaries at unknown locations in the environment. |
Y. Savas; A. Hashemi; A. P. Vinod; B. M. Sadler; U. Topcu; |
939 | Canonical Polyadic Tensor Decomposition With Low-Rank Factor Matrices Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a constrained canonical polyadic (CP) tensor decomposition method with low-rank factor matrices. |
A. -H. Phan; P. Tichavsk�; K. Sobolev; K. Sozykin; D. Ermilov; A. Cichocki; |
940 | A Diffusion FXLMS Algorithm for Multi-Channel Active Noise Control and Variable Spatial Smoothing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To solve this problem, a new Diff filtered-x least mean squares (Diff-FxLMS) algorithm that incorporates the knowledge of spatial smoothness is proposed. |
Y. J. Chu; S. C. Chan; C. M. Mak; M. Wu; |
941 | ADAPT-Then-Combine Full Waveform Inversion for Distributed Subsurface Imaging In Seismic Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose a distributed implementation of the full waveform inversion (FWI) for distributed imaging of subsurfaces in seismic networks. |
B. -S. Shin; D. Shutin; |
942 | Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to The Spatial Domain Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel audiovisual data fusion framework for speaker localization by assigning individual dynamic stream weights to specific regions in the localization space. |
J. Wissing; et al. |
943 | Towards Robust Training of Multi-Sensor Data Fusion Network Against Adversarial Examples in Semantic Segmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To achieve adversarial robust multi-sensor data fusion networks, we propose here a novel robust training scheme called Multi-Sensor Cumulative Learning (MSCL). |
Y. Yu; H. J. Lee; B. C. Kim; J. U. Kim; Y. M. Ro; |
944 | Data-Driven Adaptive Network Resource Slicing for Multi-Tenant Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel framework for network slicing with the goal of maximizing the expected utilities of tenants in the backhaul and Radio Access Network (RAN), where we reconfigure slices according to the time-varying user traffic and channel states. |
N. Reyhanian; H. Farmanbar; Z. -Q. Luo; |
945 | Distributed Scheduling Using Graph Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome this limitation, we propose a distributed MWIS solver based on graph convolutional networks (GCNs). |
Z. Zhao; G. Verma; C. Rao; A. Swami; S. Segarra; |
946 | Efficient Power Allocation Using Graph Neural Networks and Deep Algorithm Unfolding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We study the problem of optimal power allocation in a single-hop ad hoc wireless network. |
A. Chowdhury; G. Verma; C. Rao; A. Swami; S. Segarra; |
947 | A Sample-Efficient Scheme for Channel Resource Allocation in Networked Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Leveraging symmetry, quasi-convexity, and the method of Kernel density estimation, we propose a data-driven algorithm, which is guaranteed to converge to a globally optimal solution. |
M. M. Vasconcelos; U. Mitra; |
948 | An Efficient Linear Programming Rounding-and-Refinement Algorithm for Large-Scale Network Slicing Problem Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider the network slicing problem which attempts to map multiple customized virtual network requests (also called services) to a common shared network infrastructure and allocate network resources to meet diverse service requirements, and propose an efficient two-stage algorithm for solving this NP-hard problem. |
W. -K. Chen; Y. -F. Liu; Y. -H. Dai; Z. -Q. Luo; |
949 | Efficient Migration to The Next Generation of Networks Based on Digital Annealing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, the network migration problem is considered as a set of circuit migration problems in which two technicians simultaneously migrate the two ends of a circuit in order to minimize the total accumulated sites in-service and total technician travels. |
M. Javad-Kalbasi; S. Valaee; |
950 | A Technique for OFDM Symbol Slicing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work presents an orthonormal transform that splits the Orthogonal Frequency Division Multiplex (OFDM) symbol into slices with ranked rate and decoding complexity. |
A. Perez-Neira; M. A. Lagunas; |
951 | Communication Over Block Fading Channels � An Algorithmic Perspective On Optimal Transmission Schemes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper approaches this issue from a fundamental, algorithmic point of view by studying whether or not it is in principle possible to construct or find such optimal transmission schemes algorithmically (without putting any constraints on the computational complexity of such algorithms). |
H. Boche; R. F. Schaefer; H. Vincent Poor; |
952 | Secure UAV Communications Under Uncertain Eavesdroppers Locations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider the UAV-enabled physical-layer secure communications. |
S. Wang; F. Kong; Q. Li; |
953 | On Strategic Jamming in Distributed Detection Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, the optimal jamming strategy by an adversary in distributed detection networks is investigated. |
C. Quan; B. Geng; P. K. Varshney; |
954 | Real Number Signal Processing Can Detect Denial-of-Service Attacks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper investigates the general computation framework of Blum-Shub-Smale machines which allows the processing and storage of arbitrary reals. |
H. Boche; R. F. Schaefer; H. Vincent Poor; |
955 | A Hybrid Approach to Coded Compressed Sensing Where Coupling Takes Place Via The Outer Code Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This article seeks to advance coded compressed sensing (CCS) as a practical scheme for unsourced random access. |
J. R. Ebert; V. K. Amalladinne; J. -F. Chamberland; K. R. Narayanan; |
956 | Globally Optimal Beamforming for Rate Splitting Multiple Access Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider globally optimal precoder design for rate splitting multiple access in Gaussian multiple-input single-output downlink channels with respect to weighted sum rate and energy efficiency maximization. |
B. Matthiesen; Y. Mao; P. Popovski; B. Clerckx; |
957 | Beam Focusing for Multi-User MIMO Communications with Dynamic Metasurface Antennas Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study the potential of beam focusing, feasible in near-field operation, for multi-user MIMO systems, where the base station is equipped with a DMA. |
H. Zhang; N. Shlezinger; F. Guidi; D. Dardari; M. F. Imani; Y. C. Eldar; |
958 | Pushing The Limit of Type I Codebook For Fdd Massive Mimo Beamforming: A Channel Covariance Reconstruction Approach Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, in this paper, using Type I codebook, we leverage advances in cutting plane method to optimize the CSI reconstruction at the base station (BS), in order to close the gap between these two codebook based beamforming schemes. |
K. Li; Y. Li; L. Cheng; Q. Shi; Z. -Q. Luo; |
959 | First-Order Fast Algorithm for Structurally Optimal Multi-Group Multicast Beamforming in Large-Scale Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on the optimal multicast beamforming structure, we propose a fast first-order algorithm to obtain the beamforming solution. |
C. Zhang; M. Dong; B. Liang; |
960 | Analog Beamforming With Antenna Selection For Large-Scale Antenna Arrays Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a joint design of analog beamforming with antenna selection (AS) or antenna placement (AP) for an analog beamforming system. |
A. Arora; C. G. Tsinos; B. Shankar Mysore R; S. Chatzinotas; B. Ottersten; |
961 | Beamforming for Bidirectional Mimo Full Duplex Under The Joint Sum Power and Per Antenna Power Constraints Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel beamforming design to maximize the weighted sum-rate (WSR) with alternating optimization under the limited dynamic range (LDR) noise model. |
C. K. Sheemar; D. Slock; |
962 | Iterative Reweighted Algorithms for Joint User Identification and Channel Estimation in Spatially Correlated Massive MTC Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We develop a computationally efficient alternating direction method of multipliers (ADMM) approach to solve it. |
H. Djelouat; M. Leinonen; M. Juntti; |
963 | Millimeter Wave MIMO Channel Estimation with 1-bit Spatial Sigma-Delta Analog-to-Digital Converters Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, we present a new method for modeling the quantization noise by leveraging the deterministic input-output relation of the 1-bit spatial sigma-delta modulator. |
R. S. P. Sankar; S. Prabhakar Chepuri; |
964 | An Efficient Algorithm For Device Detection And Channel Estimation In Asynchronous IOT Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper targets at two practical issues along this line that have not been addressed before: asynchronous transmission from uncoordinated users and efficient algorithms for real-time implementation in systems with a massive number of devices. |
L. Liu; Y. -F. Liu; |
965 | Kalman Filter Based MIMO CSI Phase Recovery for COTS Wifi Devices Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we directly utilize the modeling of the phase distortions caused by the hardware impairments and propose an adaptive CSI estimation approach based on Kalman filter (KF) with maximum-a-posteriori (MAP) estimation that considers the CSI from the previous time. |
C. Li; J. Brauer; A. Sezgin; C. Zenger; |
966 | Improved Atomic Norm Based Channel Estimation for Time-Varying Narrowband Leaked Channels Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, improved channel gain delay estimation strategies are investigated when practical pulse shapes with finite block length and transmission bandwidth are employed. |
J. Li; U. Mitra; |
967 | Bayesian Massive MIMO Channel Estimation with Parameter Estimation Using Low-Resolution ADCs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we treat both signals and parameters as random variables and recover them jointly within the AMP framework. |
S. Huang; D. Qiu; T. D. Tran; |
968 | Optimal Detection in The Presence of Non-Gaussian Jamming Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider a scenario in which a transmitter sends complex multidimensional symbols to a receiver in the presence of a proactive continuous jammer emitting a zero-mean complex Gaussian signal over an unknown complex Gaussian channel. |
K. A. Almahorg; R. H. Gohary; |
969 | An Efficient Active Set Algorithm for Covariance Based Joint Data and Activity Detection for Massive Random Access with Massive MIMO Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a computationally efficient algorithm to solve the joint data and activity detection problem for massive random access with massive multiple-input multiple-output (MIMO). |
Z. Wang; Z. Chen; Y. -F. Liu; F. Sohrab; W. Yu; |
970 | Neural Layered Min-Sum Decoding for Protograph LDPC Codes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, layered min-sum (MS) iterative decoding is formulated as a customized neural network following the sequential scheduling of check node (CN) updates. |
D. Zhang; et al. |
971 | Integer Carrier Frequency Offset Estimation in OFDM with Zadoff-Chu Sequences Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this, we present a Zadoff-Chu sequence-based approach to overcome the ambiguity associated with estimating very high levels of integer Doppler. |
J. D. Roth; D. A. Garren; R. C. Robertson; |
972 | Plug-And-Play Learned Gaussian-mixture Approximate Message Passing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose learned Gaussian-mixture AMP (L-GM-AMP) – a plug-and-play compressed sensing (CS) recovery algorithm suitable for any i.i.d. source prior. |
O. Musa; P. Jung; G. Caire; |
973 | Low-Latency Polar Decoder Using Overlapped SCL Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel scheduling method that reduces the latency of polar decoders significantly. |
D. Kam; B. Y. Kong; Y. Lee; |
974 | Modular Binary Tree Architecture for Distributed Large Intelligent Surface Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a modular architecture that allows combining different LIS panels using a binary tree. |
J. V. Alegr�a; F. Rusek; J. R. S�nchez; O. Edfors; |
975 | Stochastic Successive Weighted Sum-Rate Maximization for Multiuser MIMO Systems with Finite-Alphabet Inputs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by the stochastic successive upper-bound minimization (SSUM) method [1], this paper proposes a stochastic successive inexact lower-bound maximization (SSILM) algorithm for the WSRM problem with finite-alphabet inputs. |
X. Guan; X. Zhao; Q. Shi; |
976 | Rate 1 Quasi Orthogonal Universal Transmission and Combining for MIMO Systems Achieving Full Diversity Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work addresses general multiple-input multiple-output systems and develops combined diversity transmission and combining schemes that achieve rate one and full diversity with reduced decoding complexity, while being universal in the sense that the operations performed at both transmission ends are channel independent. |
B. Avraham; U. Erez; E. Domanovitz; |
977 | Energy Efficiency Optimization Technique for SWIPT-Enabled Multi-Group Multicasting Systems with Heterogeneous Users Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: An algorithm based on Dinkelback method, slack-variable replacement, and second-order conic programming (SOCP)/semi-definite relaxation (SDR) is proposed to obtain a suitable solution for the above-mentioned fractional-objective dependent non-convex problem. |
S. Gautam; S. Chatzinotas; B. Ottersten; |
978 | Multi-Branch Tomlinson-Harashima Precoding for Rate Splitting Based Systems with Multiple Antennas Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a multi-branch (MB) scheme for a RS-based multiple-antenna system, which creates patterns to order the transmitted symbols and enhances the overall sum rate performance compared to existing approaches. |
A. R. Flores; R. C. de Lamare; B. Clerckx; |
979 | Divide and Conquer: One-bit MIMO-OFDM Detection By Inexact Expectation Maximization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we study the expectation-maximization (EM) approach for one-bit MIMO-OFDM detection. |
M. Shao; W. -K. Ma; |
980 | Differential Chaos Shift Keying-Based Wireless Power Transfer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we investigate differential chaos shift keying (DCSK), a communication-based waveform, in the context of wireless power transfer (WPT). |
P. Mukherjee; C. Psomas; I. Krikidis; |
981 | VGAI: End-to-End Learning of Vision-Based Decentralized Controllers for Robot Swarms Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose to learn decentralized controllers based solely on raw visual inputs. |
T. -K. Hu; F. Gama; T. Chen; Z. Wang; A. Ribeiro; B. M. Sadler; |
982 | Recognition of Dynamic Hand Gesture Based on Mm-Wave Fmcw Radar Micro-Doppler Signatures Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a convolutional neural network (CNN) for dynamic HGR based on a millimeter-wave Frequency Modulated Continuous Wave (FMCW) radar which operates at 77GHz. |
W. Jiang; Y. Ren; Y. Liu; Z. Wang; X. Wang; |
983 | Dynamic Resource Optimization for Adaptive Federated Learning at The Wireless Network Edge Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The aim of this paper is to propose a novel dynamic resource allocation strategy for energy-efficient federated learning at the wireless network edge, with latency and learning performance guarantees. |
P. D. Lorenzo; C. Battiloro; M. Merluzzi; S. Barbarossa; |
984 | Deep Weighted MMSE Downlink Beamforming Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: With the success of deep unfolding in trading off complexity and performance, we propose to apply deep unfolding to the WMMSE algorithm. |
L. Pellaco; M. Bengtsson; J. Jald�n; |
985 | Deep Generative Model Learning For Blind Spectrum Cartography with NMF-Based Radio Map Disaggregation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Our method integrates the favorable traits of model and data-driven approaches, which substantially ?shrinks? the state space. |
S. Shrestha; X. Fu; M. Hong; |
986 | Mitigating Clipping Distortion in OFDM Using Deep Residual Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a novel technique, using residual neural networks and soft clipping, to deterministically limit the peak amplitude of the signal, thus lowering its PAPR and circumventing PA distortion. |
M. S. Omar; X. Ma; |
987 | A Low-Complexity Admm-Based Massive Mimo Detectors Via Deep Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a deep neural network (DNN)-based massive MIMO detection method which can overcome the above limitation. |
I. N. Tiba; Q. Zhang; J. Jiang; Y. Wang; |
988 | Real-Time Radio Modulation Classification With An LSTM Auto-Encoder Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose a learning framework based on an LSTM denoising autoencoder designed to extract robust and stable features from the noisy received signals, and detect the underlying modulation scheme. |
Z. Ke; H. Vikalo; |
989 | Deep Active Learning Approach to Adaptive Beamforming for MmWave Initial Alignment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a deep learning approach to the adaptive and sequential beamforming design problem for the initial access phase in a mmWave environment with a single-path channel model. |
F. Sohrabi; Z. Chen; W. Yu; |
990 | Learning to Continuously Optimize Wireless Resource in Episodically Dynamic Environment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a continual learning (CL) framework for wireless systems, which can incrementally adapt the learning models to the new episodes, without forgetting models learned from the previous episodes. |
H. Sun; W. Pu; M. Zhu; X. Fu; T. -H. Chang; M. Hong; |
991 | Adaptive Contention Window Design Using Deep Q-Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: More precisely, our goal is to design an intelligent node that can dynamically adapt its minimum CW (MCW) parameter to maximize a network-level utility knowing neither the MCWs of other nodes nor how these change over time. |
A. Kumar; G. Verma; C. Rao; A. Swami; S. Segarra; |
992 | On Information Asymmetry in Online Reinforcement Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we study the system of two interacting non-cooperative Q-learning agents, where one agent has the privilege of observing the other’s actions. |
E. Tampubolon; H. Ceribasi; H. Boche; |
993 | Jamming Strategy Generation for Hidden Communication Modes Via Graph Convolution Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we focus on the less studied hidden mode jamming problem. |
F. Kong; Q. Li; H. Shao; |
994 | Contrastive Self-Supervised Learning for Wireless Power Control Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new approach for power control in wireless networks using self-supervised learning. |
N. Naderializadeh; |
995 | Measure-Transformed Covariance Test for Robust Spectrum Sensing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we develop a new robust spectrum sensing method for MIMO cognitive radios in the presence of heavy-tailed noise. |
Y. Sorek; K. Todros; |
996 | Searching for Anomalies with Multiple Plays Under Delay and Switching Costs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We develop a policy, dubbed consecutive controlled sensing (CCS), to achieve this goal. |
T. Lambez; K. Cohen; |
997 | Robust Estimation of High-order Phase Dynamics Using Variational Bayes Inference Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a new nonlinear phase estimator to obtain more robust tracks. |
F. Fabozzi; S. Bidon; S. Roche; |
998 | Robust PCA Through Maximum Correntropy Power Iterations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we introduce a robust formulation of PCA based on the maximum correntropy criterion (MCC). |
J. P. Chereau; B. Scalzo; D. P. Mandic; |
999 | Score-Based Change Detection For Gradient-Based Learning Machines Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a generic score-based change detection method that can detect a change in any number of components of a machine learning model trained via empirical risk minimization. |
L. Liu; J. Salmon; Z. Harchaoui; |
1000 | Super-Resolution Of Periodic Signals From Short Sequences Of Samples Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel algorithm which does not require the signal to be bandlimited and it can cope with additive noise in the samples. |
M. W. Rupniewski; |
1001 | Quickest Change Detection With Time Inconsistent Anticipatory Agents In Cyber-Physical Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We show that the interaction between anticipatory agents and sequential quickest detection results in unusual (nonconvex) structure of the quickest change detection policy. |
V. Krishnamurthy; |
1002 | Treatment Effect Estimation Using Invariant Risk Minimization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a new way to estimate the ITE using the domain generalization framework of invariant risk minimization (IRM). |
A. Shah; K. Ahuja; K. Shanmugam; D. Wei; K. R. Varshney; A. Dhurandhar; |
1003 | An F-Test for Polynomial Frequency Modulation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a semi-parametric multitaper F -test for the detection of line components which have been modulated by a polynomial of a given degree. |
K. Blanchette; W. Burr; G. Takahara; |
1004 | Quickest Joint Detection and Classification of Faults in Statistically Periodic Processes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: An algorithm is proposed to detect and classify a change in the distribution of a stochastic process that has periodic statistical behavior. |
T. Banerjee; S. Padhy; A. Taha; E. John; |
1005 | An Asymptotically Pointwise Optimal Procedure For Sequential Joint Detection And Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To solve the optimal stopping problem, we propose an asymptotically pointwise optimal (APO) stopping rule, i.e., a stopping rule that is optimal when the tolerated detection and estimation errors tend to zero. |
D. Reinhard; M. Fau�; A. M. Zoubir; |
1006 | Locally Optimal Detection of Stochastic Targeted Universal Adversarial Perturbations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we derive the locally optimal generalized likelihood ratio test based detector for detecting stochastic targeted universal adversarial perturbations to a classifier?s input. |
A. Goel; P. Moulin; |
1007 | A Decentralized Variance-Reduced Method for Stochastic Optimization Over Directed Graphs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a decentralized first-order stochastic optimization method Push-SAGA for finite-sum minimization over a strongly connected directed graph. |
M. I. Qureshi; R. Xin; S. Kar; U. A. Khan; |
1008 | On Distributed Composite Tests with Dependent Observations in WSN Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: As the sensors observe statistically spatial dependent samples, which makes difficult the implementation of fully distributed detection procedures, we propose a simpler algorithm for making a decision about the true hypothesis. |
J. A. Maya; L. Rey Vega; |
1009 | Byzantine-Resilient Decentralized TD Learning with Linear Function Approximation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper considers the policy evaluation problem in reinforcement learning with agents of a decentralized and directed network. |
Z. Wu; H. Shen; T. Chen; Q. Ling; |
1010 | On The Effect of Spatial Correlation on Distributed Energy Detection of A Stochastic Process Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: As those parameters can be easily estimated at each sensor node, we propose a modified generalized likelihood ratio test (GLRT). |
J. A. Maya; L. Rey Vega; |
1011 | Provably Fast Asynchronous And Distributed Algorithms For Pagerank Centrality Computation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We study asynchronous and distributed algorithms which are operated by aggregating information from local neighbors iteratively. |
Y. He; H. -T. Wai; |
1012 | Decentralized Optimization Over Noisy, Rate-Constrained Networks: How We Agree By Talking About How We Disagree Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We develop a novel algorithm for this scenario: Decentralized Lazy Mirror Descent with Differential Exchanges (DLMD-DiffEx), which guarantees convergence of the local estimates to the optimal solution. |
R. Saha; S. Rini; M. Rao; A. Goldsmith; |
1013 | A Multiple Access Channel Game Using Latency Metric Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The paper considers a multi-access channel scenario, where several users communicate with a base station, and investigates power allocation is a game-theoretic framework. |
A. Garnaev; A. Petropulu; W. Trappe; |
1014 | Linear Computation Coding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: For linear functions, we present an algorithm to reduce the computational cost of multiplying an arbitrary given matrix with an unknown vector. |
R. R. M�ller; B. G�de; A. Bereyhi; |
1015 | Spectral Folding And Two-Channel Filter-Banks On Arbitrary Graphs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we extend this theory to arbitrary graphs and positive semi-definite variation operators. |
E. Pavez; B. Girault; A. Ortega; P. A. Chou; |
1016 | Sparse Time-Frequency Representation Via Atomic Norm Minimization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a method of estimating a sparse T-F representation using atomic norm. |
T. Kusano; K. Yatabe; Y. Oikawa; |
1017 | Message Transmission Over Rapidly Time-Varying Channels Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a new method on transmitting data over linear time-variant (LTV) channels when no channel state estimation (CSI) is accessible. |
A. Kaplan; V. Pohl; |
1018 | A Robust and Efficient Multi-Scale Seasonal-Trend Decomposition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a general and efficient multi-scale seasonal-trend decomposition algorithm for time series with multiple seasonality. |
L. Yang; Q. Wen; B. Yang; L. Sun; |
1019 | Noise-Assisted Multivariate Variational Mode Decomposition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work combines MVMD with the noise injection paradigm to propose an efficient alternative for both VMD and MVMD, i.e., the noise-assisted MVMD (NA-MVMD), that aims at relaxing the requirement of presetting K, as well as improving the quality of the resulting decomposition. |
C. A. Zisou; G. K. Apostolidis; L. J. Hadjileontiadis; |
1020 | Approximate Weighted C R Coded Matrix Multiplication Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To enable distributed low rank approximation, we generalize the approximate matrix CR-multiplication to accommodate weighted block sampling, and we introduce a weighted coded matrix multiplication method. |
N. Charalambides; M. Pilanci; A. O. Hero; |
1021 | Periodic Signal Denoising: An Analysis-Synthesis Framework Based on Ramanujan Filter Banks and Dictionaries Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes to use a hybrid analysis-synthesis framework for denoising discrete-time periodic signals. |
P. Kulkarni; P. P. Vaidyanathan; |
1022 | Compressive Signal Recovery Under Sensing Matrix Errors Combined With Unknown Measurement Gains Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an alternating minimisation algorithm for on-the-fly signal recovery in the case when errors (a) and (b) occur jointly. |
J. Vora; A. Rajwade; |
1023 | Grid Optimization for Matrix-Based Source Localization Under Inhomogeneous Sensor Topology Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper finds that the localization performance degrades when the spatial pattern of the sensors is highly non-uniform, and the uniform grid formation is only a suboptimal solution. |
H. Sun; J. Chen; |
1024 | MSR-GAN: Multi-Segment Reconstruction Via Adversarial Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Thus, we propose to approximate it using Gumbel-Softmax reparametrization trick. |
M. Zehni; Z. Zhao; |
1025 | Count Sketch with Zero Checking: Efficient Recovery of Heavy Components Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we carefully analyze count sketch and illustrate that its default recovery method, namely median filtering, has a distinct error pattern of reporting false positives. |
G. Zhou; Z. Tian; |
1026 | Numerical Solution of Stochastic Differential Equations in Stiefel Manifolds Via Tangent Space Parametrization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here we achieve this by extending the so-called ‘tangent space parameterization? (TaSP) for ODEs to SDEs. |
Z. Wang; V. Solo; |
1027 | On The Accuracy Limit of Joint Time-Delay/Doppler/Acceleration Estimation with A Band-Limited Signal Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Considering a generic band-limited signal formulation, in this contribution we derive a new closed-form Cram?r-Rao bound (CRB) expression for joint time-delay/Doppler/acceleration estimation. |
H. McPhee; L. Ortega; J. Vil�-Valls; E. Chaumette; |
1028 | Automatic Order Selection in Autoregressive Modeling with Application in EEG Sleep-Stage Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper investigates the order selection problem for autoregressive models from a new perspective. |
F. Nassif; S. Beheshti; |
1029 | New Variants of DFA Based on Loess and Lowess Methods: Generalization of The Detrending Moving Average Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we recall the main principles of some of these methods, provide explanations on the behaviours of the algorithms and analyze the relevance of new variants based on the Savitzky-Golay filter, also known as the LOESS approach, and the LOWESS. |
B. Berthelot; E. Grivel; P. Legrand; |
1030 | Parameter Estimation for Student�s T VAR Model with Missing Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an algorithmic framework to estimate the parameters of a VAR model with heavy-tailed Student?s t distributed innovations from incomplete data based on the stochastic approximation expectation maximization (SAEM) algorithm coupled with a Markov Chain Monte Carlo (MCMC) procedure. |
R. Zhou; J. Liu; S. Kumar; D. P. Palomar; |
1031 | Fast and Robust ADMM for Blind Super-Resolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To do so, we adapt an operator splitting approach ADMM and combine it with a novel preconditioning scheme. |
Y. Ran; W. Dai; |
1032 | Nonstationary Portfolios: Diversification in The Spectral Domain Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we reformulate the portfolio optimization problem in the spectral domain to cater for the nonstationarity inherent to asset price movements and, in this way, allow for optimal capital allocations to be time-varying. |
B. Scalzo; A. Arroyo; L. Stankovic; D. P. Mandic; |
1033 | A Tyler-Type Estimator of Location and Scatter Leveraging Riemannian Optimization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a stable algorithm based on Riemannian optimization for this problem. |
A. Collas; F. Bouchard; A. Breloy; C. Ren; G. Ginolhac; J. . -P. Ovarlez; |
1034 | Statistical Properties of A Modified Welch Method That Uses Sample Percentiles Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present and analyze an alternative, more robust approach to the Welch?s overlapped segment averaging (WOSA) spectral estimator. |
F. Schwock; S. Abadi; |
1035 | Switched Hawkes Processes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider a switched Hawkes process which can be used to model systems in which the parameters of the process dynamically change depending on some (known) external state. |
N. Nadagouda; M. A. Davenport; |
1036 | An Adaptive Regularization Approach to Portfolio Optimization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we address the regularization issue from a different perspective. |
T. Ballal; A. S. Abdelrahman; A. H. Muqaibel; T. Y. Al-Naffouri; |
1037 | Active Estimation From Multimodal Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The paper considers the problem of estimating a covariate parameter shared by multiple statistical models. |
A. Mukherjee; A. Tajer; P. -Y. Chen; P. Das; |
1038 | Network Classifiers Based on Social Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work proposes a new way of combining independently trained classifiers over space and time. |
V. Bordignon; S. Vlaski; V. Matta; A. H. Sayed; |
1039 | Bayes-Optimal Methods for Finding The Source of A Cascade Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We study the problem of estimating the source of a network cascade. |
A. Sridhar; H. V. Poor; |
1040 | Private Wireless Federated Learning with Anonymous Over-the-Air Computation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In the wireless FL scenario, we show that the privacy of the system can be boosted by exploiting over-the-air computation (OAC) and anonymizing the transmitting devices. |
B. Hasircioglu; D. G�nd�z; |
1041 | Scalable Multilevel Quantization for Distributed Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A scalable algorithm is derived for multilevel quantization of sensor observations in distributed sensor networks, which consist of a number of sensors transmitting a summary information of their observations to the fusion center for a final decision. |
G. G�l; M. Ba�ler; |
1042 | Stability of Algebraic Neural Networks to Small Perturbations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we study stability of AlgNNs on the framework of algebraic signal processing. |
A. Parada-Mayorga; A. Ribeiro; |
1043 | Resolution Limits of 20 Questions Search Strategies for Moving Targets Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We are interested in the minimal resolution of any non-adaptive searching procedure with a finite number of queries and derive approximations to this optimal resolution via the second-order asymptotic analysis. |
L. Zhou; A. Hero; |
1044 | Gramian-Based Adaptive Combination Policies for Diffusion Learning Over Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents an adaptive combination strategy for distributed learning over diffusion networks. |
Y. E. Erginbas; S. Vlaski; A. H. Sayed; |
1045 | Graph-Adaptive Incremental Learning Using An Ensemble of Gaussian Process Experts Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To allow for uncertainty quantification, which is of utmost importance in safety-critical applications, this work tackles the SSL task in a Gaussian process (GP) based Bayesian framework to propagate the distribution of nonparametric function estimates. |
K. D. Polyzos; Q. Lu; G. B. Giannakis; |
1046 | Fast Decentralized Linear Functions Via Successive Graph Shift Operators Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In contrast, this paper develops a decentralized method to compute linear transformations in a small number of iterations. |
S. Mollaebrahim; D. Romero; B. Beferull-Lozano; |
1047 | Online Learning of Time-Varying Signals and Graphs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The aim of this paper is to propose a method for online learning of time-varying graphs from noisy observations of smooth graph signals collected over the vertices. |
S. Sardellitti; S. Barbarossa; P. Di Lorenzo; |
1048 | Kernel Regression on Graphs in Random Fourier Features Space Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work proposes an efficient batch-based implementation for kernel regression on graphs (KRG) using random Fourier features (RFF) and a low-complexity online implementation. |
V. R. M. Elias; V. C. Gogineni; W. A. Martins; S. Werner; |
1049 | Graph-Homomorphic Perturbations for Private Decentralized Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an alternative scheme, which constructs perturbations according to a particular nullspace condition, allowing them to be invisible (to first order in the step-size) to the network centroid, while preserving privacy guarantees. |
S. Vlaski; A. H. Sayed; |
1050 | Variance-Constrained Learning for Stochastic Graph Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an alternating primal-dual learning algorithm that updates the primal variable (SGNN parameters) with gradient descent and the dual variable with gradient ascent. |
Z. Gao; E. Isufi; A. Ribeiro; |
1051 | Graph Neural Network for Large-Scale Network Localization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we adopt GNN for a classic but challenging nonlinear regression problem, namely the network localization. |
W. Yan; D. Jin; Z. Lin; F. Yin; |
1052 | Graphon and Graph Neural Network Stability Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We analyze GNN stability using kernel objects called graphons. |
L. Ruiz; Z. Wang; A. Ribeiro; |
1053 | Graph Neural Networks for Decentralized Controllers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a framework using graph neural networks (GNNs) to learn decentralized controllers from data. |
F. Gama; E. Tolstaya; A. Ribeiro; |
1054 | Nonlinear State-Space Generalizations of Graph Convolutional Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we approach GCNNs from a state-space perspective revealing that the graph convolutional module is a minimalistic linear state-space model, in which the state update matrix is the graph shift operator. |
L. Ruiz; F. Gama; A. Ribeiro; E. Isufi; |
1055 | Wide and Deep Graph Neural Networks with Distributed Online Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes the Wide and Deep GNN (WD-GNN), a novel architecture that can be easily updated with distributed online learning mechanisms. |
Z. Gao; A. Ribeiro; F. Gama; |
1056 | Design of Graph Signal Sampling Matrices for Arbitrary Signal Subspaces Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a design method of sampling matrices for graph signals that guarantees perfect recovery for arbitrary graph signal subspaces. |
J. Hara; K. Yamada; S. Ono; Y. Tanaka; |
1057 | Graph Signal Denoising Using Nested-Structured Deep Algorithm Unrolling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a deep algorithm unrolling (DAU) based on a variant of the alternating direction method of multiplier (ADMM) called Plug-and-Play ADMM (PnP-ADMM) for denoising of signals on graphs. |
M. Nagahama; K. Yamada; Y. Tanaka; S. H. Chan; Y. C. Eldar; |
1058 | Identifying First-Order Lowpass Graph Signals Using Perron Frobenius Theorem Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Our aim is to determine if the graph filter generating the graph signals is first-order lowpass without knowing the graph topology. |
Y. He; H. -T. Wai; |
1059 | Graph Signal Denoising Via Unrolling Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an interpretable graph neural network framework to denoise single or multiple noisy graph signals. |
S. Chen; Y. C. Eldar; |
1060 | Adaptive Subsampling of Multidomain Signals with Product Graphs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an adaptive subsampling method for multidomain signals based on the constrained learning of a product graph. |
T. Gnassounou; P. Humbert; L. Oudre; |
1061 | Robust Graph-Filter Identification with Graph Denoising Regularization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Tailored to those setups, this paper presents a robust formulation for the problem of graph-filter identification from input-output observations. |
S. Rey; A. G. Marques; |
1062 | Fast and Provable Robust PCA VIA Normalized Coherence Pursuit Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We study the Innovation Values computed by the Innovation Search algorithm under a quadratic cost function and it is proved that Innovation Values with the new cost function are equivalent to Leverage Scores. |
M. Rahmani; P. Li; |
1063 | Aligning Sets of Temporal Signals with Riemannian Geometry and Koopman Operator Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider the problem of aligning data sets of short temporal signals without any a-priori known correspondence. |
O. Rahamim; R. Talmon; |
1064 | Weight Identification Through Global Optimization in A New Hysteretic Neural Network Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this contribution, we propose to consider new nonlinear activation functions, whose outputs depend both from the current and past inputs through a hysteresis effect. |
E. Leroy; A. Marmin; M. Castella; L. Duval; |
1065 | Multiview Variational Graph Autoencoders for Canonical Correlation Analysis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a novel multiview canonical correlation analysis model based on a variational approach. |
Y. Kaloga; P. Borgnat; S. P. Chepuri; P. Abry; A. Habrard; |
1066 | Cognitive Memory Constrained Human Decision Making Based on Multi-source Information Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The objective of this work is to study how humans make decisions based on internal and external sources of information under cognitive memory limitations. |
B. Geng; Q. Chen; P. K. Varshney; |
1067 | Binary Control and Digital-to-Analog Conversion Using Composite NUV Priors and Iterative Gaussian Message Passing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The paper proposes a new method to determine a binary control signal for an analog linear system such that the state, or some output, of the system follows a given target trajectory. |
R. Keusch; H. Malmberg; H. -A. Loeliger; |
1068 | Outlier-Robust Kernel Hierarchical-Optimization RLS on A Budget with Affine Constraints Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces a non-parametric learning framework to combat outliers in online, multi-output, and nonlinear regression tasks. |
K. Slavakis; M. Yukawa; |
1069 | Adaptive Real-Time Filter for Partially-Observed Boolean Dynamical Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this manuscript, we present a real-time joint state and parameter estimation framework for POBDS. |
M. Imani; S. F. Ghoreishi; |
1070 | Improving The Energy-Efficiency of A Kalman Filter Using Unreliable Memories Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The degradation in estimation performance caused by the noise in the memory is theoretically investigated. Updated equations are then developed for the Kalman filter, taking into account the new source of noise from the unreliable memory. |
J. Kern; E. Dupraz; A. A�ssa-El-Bey; F. Leduc-Primeau; |
1071 | Parallel Iterated Extended and Sigma-Point Kalman Smoothers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a set of parallel formulas that replace the existing sequential ones in order to achieve lower time (span) complexity. |
F. Yaghoobi; A. Corenflos; S. Hassan; S. S�rkk�; |
1072 | Wiener Filter on Meet/Join Lattices Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we extend DLSP with Wiener filtering for denoising and demonstrate it on two prototypical applications. |
B. Seifert; C. Wendler; M. P�schel; |
1073 | Learning Bollob�s-Riordan Graphs Under Partial Observability Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we discover that, over first-order vector autoregressive systems with a stable Laplacian combination matrix, graph learning is achievable under partial observability, when the network topology is drawn according to a popular preferential attachment model known as the Bollob?s-Riordan model. |
M. Cirillo; V. Matta; A. H. Sayed; |
1074 | Learning Sparse Graph Laplacian with K Eigenvector Prior Via Iterative Glasso and Projection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, given an empirical covariance matrix ${\mathbf{\bar C}}$ computed from data as input, we consider an eigen-structural assumption on the graph Laplacian matrix L: the first K eigenvectors of L are pre-selected, e.g., based on domain-specific criteria, and the remaining eigenvectors are then learned from data. |
S. Bagheri; G. Cheung; A. Ortega; F. Wang; |
1075 | Learning Mixed Membership from Adjacency Graph Via Systematic Edge Query: Identifiability and Algorithm Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A lightweight scalable algorithm is proposed, and its performance characterizations are presented. |
S. Ibrahim; X. Fu; |
1076 | Convergence Analysis of The Graph-Topology-Inference Kernel LMS Algorithm Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a convergence analysis of the graph-topology-inference KLMS algorithm. |
M. Moscu; R. Borsoi; C. Richard; |
1077 | An Efficient Alternating Direction Method for Graph Learning from Smooth Signals Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we cast the graph learning formulation as a nonsmooth, strictly convex optimization problem and develop an efficient alternating direction method of multipliers to solve it. |
X. Wang; C. Yao; H. Lei; A. M. -C. So; |
1078 | Topological Volterra Filters Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We apply TVFs to inverse filtering and recommender systems. |
G. Leus; M. Yang; M. Coutino; E. Isufi; |
1079 | Network Topology Inference with Graphon Spectral Penalties Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present the cases where the graphon model is assumed to be known and the more practical setting where the relevant features of the model are inferred from auxiliary network observations. |
T. M. Roddenberry; M. Navarro; S. Segarra; |
1080 | Network Topology Change-Point Detection from Graph Signals with Prior Spectral Signatures Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider the problem of sequential graph topology change-point detection from graph signals. |
C. Kaushik; T. M. Roddenberry; S. Segarra; |
1081 | Online Time-Varying Topology Identification Via Prediction-Correction Algorithms Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we address the problem of dynamic topology identification by building on recent results from time-varying optimization, devising a general-purpose online algorithm operating in non-stationary environments. |
A. Natali; M. Coutino; E. Isufi; G. Leus; |
1082 | Graph Learning Under Spectral Sparsity Constraints Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new signal processing based inference model and a new learning criterion that allow for wideband frequency variation in the data and derive an algorithm for graph inference. |
B. Subbareddy; A. Siripuram; J. Zhang; |
1083 | A Graph Learning Algorithm Based On Gaussian Markov Random Fields And Minimax Concave Penalty Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a graph learning framework to produce sparse and accurate graphs from network data. |
T. Koyakumaru; M. Yukawa; E. Pavez; A. Ortega; |
1084 | Figlearn: Filter and Graph Learning Using Optimal Transport Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We hence introduce a novel graph signal processing framework for jointly learning the graph and its generating filter from signal observations. |
M. Minder; Z. Farsijani; D. Shah; M. E. Gheche; P. Frossard; |
1085 | Construction of Unit-Norm Tight Frame Based Preconditioner for Sparse Coding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate the problem of constructing suitable preconditioner to improve the performance of the SR system. |
H. Bai; C. Hong; X. Li; |
1086 | Sparse High-Order Portfolios Via Proximal Dca And Sca Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study the cardinality constrained mean-variance-skewness-kurtosis (MVSKC) model for sparse high-order portfolio optimization. |
J. Wang; Z. Deng; T. Zheng; A. M. -C. So; |
1087 | A Convex Penalty for Block-Sparse Signals with Unknown Structures Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel convex penalty for block-sparse signals whose block partitions are unknown a priori. |
H. Kuroda; D. Kitahara; A. Hirabayashi; |
1088 | Event-Driven Modulo Sampling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a cascade model comprising a modulo non-linearity in series with an integrate-and-fire (IF) event-driven encoder. |
D. Florescu; F. Krahmer; A. Bhandari; |
1089 | No Relaxation: Guaranteed Recovery of Finite-Valued Signals from Undersampled Measurements Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We develop a new computationally efficient decoding algorithm that can operate at the optimal downsampling factor under mild conditions on the filter. |
P. Sarangi; P. Pal; |
1090 | Error Estimates in Second-Order Continuous-Time Sigma-Delta Modulators Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a general framework to study this error: under regularity assumptions on the input and the filtering kernel, we prove for a second-order CT-? |
D. Surroop; P. Combes; P. Martin; |
1091 | Banraw: Band-Limited Radar Waveform Design Via Phase Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a uniqueness result which states that a band- limited signal can be recovered from at least 3B measurements where B is the bandwidth from the radar ambiguity function (AF). |
S. Pinilla; K. V. Mishr; B. M. Sadler; H. Arguello; |
1092 | Sub-NYQUIST Multichannel Blind Deconvolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present necessary and sufficient conditions for the unique identification. |
S. Mulleti; K. Lee; Y. C. Eldar; |
1093 | Modified Arcsine Law for One-Bit Sampled Stationary Signals with Time-Varying Thresholds Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: For the first time in the literature, this paper introduces an approach to extending the arcsine law to the case where one-bit ADCs apply time-varying thresholds. |
A. Eamaz; F. Yeganegi; M. Soltanalian; |
1094 | Near-Optimal Resampling in Particle Filters Using The Ising Energy Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Instead of utilizing resampling procedures that rely on asymptotic convergence properties, we show that intelligently selecting and replicating a set of samples can better represent the posterior approximation and improve the overall performance of the PF. |
M. T. Rahman; M. Javad-Kalbasi; S. Valaee; |
1095 | Time-Domain Concentration and Approximation of Computable Bandlimited Signals Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we provide a different definition that uses the time-domain concentration of the signals. |
H. Boche; U. J. M�nich; |
1096 | Guaranteed Reconstruction from Integrate-and-Fire Neurons with Alpha Synaptic Activation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In contrast to state-of-the-art methods we propose an IF-TEM where we employ a biologically inspired and smooth sampling kernel, the alpha synaptic function, and show that perfect reconstruction can be achieved using this kernel. |
M. Hilton; R. Alexandru; P. L. Dragotti; |
1097 | Social Learning Under Inferential Attacks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we consider the scenario where a subset of agents aims at driving the network beliefs to the wrong hypothesis. |
K. Ntemos; V. Bordignon; S. Vlaski; A. H. Sayed; |
1098 | Segregation in Social Networks: MARKOV Bridge Models and Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Specifically, we present a novel community-based graph model that represent segregation as a Markov bridge process. |
V. Krishnamurthy; R. Luo; B. Nettasinghe; |
1099 | Controlled Testing and Isolation for Suppressing Covid-19 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We argue that controlling the spread of the virus can be done by using active feedback to control testing for infection by actively testing individuals with a high probability of being infected. |
K. Cohen; A. Leshem; |
1100 | Two-Stage Graph-Constrained Group Testing: Theory and Application Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper, in contrast, formalizes an adaptive, two-stage framework that requires T(kM2 log(p/k)) tests, that is, a factor k smaller than that of the one-stage (non-adaptive) frameworks. |
S. Sihag; A. Tajer; U. Mitra; |
1101 | Unveiling Anomalous Nodes Via Random Sampling and Consensus on Graphs Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The present paper develops a graph-based sampling and consensus (GraphSAC) approach to effectively detect anomalous nodes in large-scale graphs. |
V. N. Ioannidis; D. Berberidis; G. B. Giannakis; |
1102 | Estimating Fiedler Value on Large Networks Based on Random Walk Observations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we describe an iterative scheme which is able to estimate the Fiedler value of a network when the topology is initially unknown. |
A. Reiffers-Masson; T. Chonavel; Y. Hayel; |
1103 | Orthogonality and Zero DC Tradeoffs in Biorthogonal Graph Filterbanks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: By exploiting a new extension of the admissibility condition in GraphBior we propose a new fundamental matrix with the goal of distributing the errors of GraphBior more uniformly across pixels with different node degrees. |
D. E. O. Tzamarias; E. Pavez; B. Girault; A. Ortega; I. Blanes; J. Serra-Sagrist�; |
1104 | Graph Signal Compression Via Task-Based Quantization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we study the joint design of graph signal sampling along with the quantization of these samples, for graph signal compression. |
P. Li; N. Shlezinger; H. Zhang; B. Wang; Y. C. Eldar; |
1105 | A Partially Collapsed Gibbs Sampler for Unsupervised Nonnegative Sparse Signal Restoration Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper the problem of restoration of unsupervised nonnegative sparse signals is addressed in the Bayesian framework. |
M. C. Amrouche; H. Carfantan; J. Idier; |
1106 | A Structure-Guided and Sparse-Representation-Based 3d Seismic Inversion Method Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, with the help of gradient structure tensor (GST) and dictionary learning and sparse representation (DLSR) technologies, we propose a 3D inversion approach (GST-DLSR) that considers both vertical and horizontal structural constraints. |
B. She; Y. Wang; G. Hu; |
1107 | Accelerating Frank-Wolfe with Weighted Average Gradients Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The present contribution broadens its scope by replacing the gradient per FW subproblem with a weighted average of gradients. |
Y. Zhang; B. Li; G. B. Giannakis; |
1108 | Yapa: Accelerated Proximal Algorithm for Convex Composite Problems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a primal proximal method derived from a three-operator splitting in a product space and accelerated with Anderson extrapolation. |
G. Chierchia; M. El Gheche; |
1109 | Data Discovery Using Lossless Compression-Based Sparse Representation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a data-driven sparse representation using orthonormal bases under the lossless compression constraint. |
E. Sabeti; P. X. K. Song; A. O. Hero; |
1110 | Safe Screening for Sparse Regression with The Kullback-Leibler Divergence Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we extend the GAP Safe screening rule to the l1-regularized Kullback-Leibler divergence which does not fulfil the regularity assumptions made in previous works. |
C. F. Dantas; E. Soubies; C. F�votte; |
1111 | On The Convergence of Randomized Bregman Coordinate Descent for Non-Lipschitz Composite Problems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new randomized Bregman (block) coordinate descent (RBCD) method for minimizing a composite problem, where the objective function could be either convex or nonconvex, and the smooth part are freed from the global Lipschitz-continuous (partial) gradient assumption. |
T. Gao; S. Lu; J. Liu; C. Chu; |
1112 | A Global Cayley Parametrization of Stiefel Manifold for Direct Utilization of Optimization Mechanisms Over Vector Spaces Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a global parametrization of the Stiefel manifold entirely by a single fixed vector space with the Cayley transform, say Global Cayley Parametrization (G-CP), to solve the problem through optimization over a vector space. |
K. Kume; I. Yamada; |
1113 | Training Logical Neural Networks By Primal�Dual Methods for Neuro-Symbolic Reasoning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a unified framework for solving this nonlinear programming problem by leveraging primal-dual optimization methods, and quantify the corresponding convergence rate to the Karush-Kuhn-Tucker (KKT) points of this problem. |
S. Lu; N. Khan; I. Y. Akhalwaya; R. Riegel; L. Horesh; A. Gray; |
1114 | Cooperative Parameter Tracking on The Unit Sphere Using Distributed Adapt-Then-Combine Particle Filters and Parallel Transport Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces a new distributed Adapt-then-Combine (ATC) diffusion algorithm for cooperative tracking of an un-known state vector that evolves on the unit hypersphere. |
C. G. de Figueredo; C. J. Bordin; M. G. S. Bruno; |
1115 | Bayesian Estimation of A Tail-Index with Marginalized Threshold Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we develop a new method for estimating the tail-index found in extreme value statistics. |
D. E. Johnston; P. M. Djuric; |
1116 | Block Kalman Filter: An Asymptotic Block Particle Filter in The Linear Gaussian Case Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce the block based approach in the Kalman filter and show that the block particle filter asymptotically acts as the resulting block Kalman filter. |
R. Min; C. Garnier; F. Septier; J. Klein; |
1117 | Particle Gibbs Sampling for Regime-Switching State-Space Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we develop a particle Gibbs sampling algorithm for Bayesian learning in RS-SSMs. |
Y. El-Laham; L. Yang; H. J. Lynch; P. M. Djuric; M. F. Bugallo; |
1118 | Adaptive Importance Sampling Via Auto-Regressive Generative Models and Gaussian Processes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a class of adaptive importance sampling methods where the proposal distribution is constructed in a way that Gaussian processes are combined autoregressively. |
H. Wang; M. F. Bugallo; P. M. Djuric; |
1119 | Variational Parameter Learning in Sequential State-Space Model Via Particle Filtering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a novel algorithm, the particle filter variational inference (PF-VI) algorithm, which achieves closed-form learning of SSM parameters while tractably inferring the non-linear sequential states. |
C. Li; S. J. Godsill; |
1120 | Correlation-Based Robust Linear Regression with Iterative Outlier Removal Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here we consider linear regression from the view of correlation and propose a robust regression algorithm. |
J. Ding; J. Wang; Y. Zhang; Y. Li; N. Zheng; |
1121 | On The Optimality of Backward Regression: Sparse Recovery and Subset Selection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present novel guarantees for the algorithm, propose an efficient, numerically stable implementation, and put forth Stepwise Regression with Replacement (SRR), a new family of two-stage algorithms that employs both forward and backward steps for compressed sensing problems. |
S. Ament; C. Gomes; |
1122 | General Total Variation Regularized Sparse Bayesian Learning for Robust Block-Sparse Signal Recovery Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel Sparse Bayesian Learning (SBL) method for block-sparse recovery based on popular CS based regularizers with the function input variable related to total variation (TV). |
A. Sant; M. Leinonen; B. D. Rao; |
1123 | Automatic Registration and Clustering of Time Series Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new method for automatic time series alignment within a clustering problem. |
M. Weylandt; G. Michailidis; |
1124 | Low-Rank on Graphs Plus Temporally Smooth Sparse Decomposition for Anomaly Detection in Spatiotemporal Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce an unsupervised tensor-based anomaly detection method that takes the sparse and temporally continuous nature of anomalies into account. |
S. E. Sofuoglu; S. Aviyente; |
1125 | A Parallel Algorithm for Phase Retrieval with Dictionary Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new formulation for the joint phase retrieval and dictionary learning problem with a reduced number of regularization parameters to be tuned. |
T. Liu; A. M. Tillmann; Y. Yang; Y. C. Eldar; M. Pesavento; |
1126 | Improving RNN Transducer Modeling for Small-Footprint Keyword Spotting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we improve the RNN-T modeling for small-footprint keyword spotting in three aspects. |
Y. Tian; H. Yao; M. Cai; Y. Liu; Z. Ma; |
1127 | Cascaded Encoders for Unifying Streaming and Non-Streaming ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work presents cascaded encoders for building a single E2E ASR model that can operate in both these modes simultaneously. |
A. Narayanan; et al. |
1128 | A Better and Faster End-to-end Model for Streaming ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we look at encouraging the E2E model to emit words early, through an algorithm called FastEmit [3]. |
B. Li; et al. |
1129 | Efficient Knowledge Distillation for RNN-Transducer Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we develop a distillation method for RNN-Transducer (RNN-T) models, a popular end-to-end neural network architecture for streaming speech recognition. |
S. Panchapagesan; D. S. Park; C. -C. Chiu; Y. Shangguan; Q. Liang; A. Gruenstein; |
1130 | Phoneme Based Neural Transducer for Large Vocabulary Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To join the advantages of classical and end-to-end approaches for speech recognition, we present a simple, novel and competitive approach for phoneme-based neural transducer modeling. |
W. Zhou; S. Berger; R. Schl�ter; H. Ney; |
1131 | RNN-T Based Open-Vocabulary Keyword Spotting in Mandarin with Multi-Level Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an RNN Transducer (RNN-T) based keyword spotting system with a constrained attention mechanism biasing module that biases the RNN-T model towards a specific keyword of interest. |
Z. Liu; T. Li; P. Zhang; |
1132 | Advancing RNN Transducer Technology for Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We investigate a set of techniques for RNN Transducers (RNN-Ts) that were instrumental in lowering the word error rate on three different tasks (Switchboard 300 hours, conversational Spanish 780 hours and conversational Italian 900 hours). |
G. Saon; Z. T�ske; D. Bolanos; B. Kingsbury; |
1133 | Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we study the influence of the amount of label context on the model?s accuracy, and its impact on the efficiency of the decoding process. |
R. Prabhavalkar; et al. |
1134 | Simpleflat: A Simple Whole-Network Pre-Training Approach for RNN Transducer-Based End-to-End Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Our solution is SimpleFlat (SF), a novel and simple whole-network pretraining approach for RNN-T. |
T. Moriya; et al. |
1135 | Echo State Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose automatic speech recognition (ASR) models inspired by echo state network (ESN) [1], in which a subset of recurrent neural networks (RNN) layers in the models are randomly initialized and untrained. |
H. Shrivastava; A. Garg; Y. Cao; Y. Zhang; T. Sainath; |
1136 | Using Synthetic Audio to Improve The Recognition of Out-of-Vocabulary Words in End-to-End Asr Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we use a text-to-speech (TTS) engine to provide synthetic audio for out-of-vocabulary (OOV) words. |
X. Zheng; Y. Liu; D. Gunceler; D. Willett; |
1137 | Wave-Tacotron: Spectrogram-Free End-to-End Text-to-Speech Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We describe a sequence-to-sequence neural network which directly generates speech waveforms from text inputs. |
R. J. Weiss; R. Skerry-Ryan; E. Battenberg; S. Mariooryad; D. P. Kingma; |
1138 | Patnet : A Phoneme-Level Autoregressive Transformer Network for Speech Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Aiming at efficiently predicting acoustic features with high naturalness and robustness, this paper proposes PATNet, a neural acoustic model for speech synthesis using phoneme-level autoregression. |
S. Wang; Z. Ling; R. Fu; J. Yi; J. Tao; |
1139 | Multi-Rate Attention Architecture for Fast Streamable Text-to-Speech Spectrum Modeling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a multi-rate attention architecture that breaks the latency and RTF bottlenecks by computing a compact representation during encoding and recurrently generating the attention vector in a streaming manner during decoding. |
Q. He; Z. Xiu; T. Koehler; J. Wu; |
1140 | End-to-End Text-to-Speech Using Latent Duration Based on VQ-VAE Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new TTS framework using explicit duration modeling that incorporates duration as a discrete latent variable to TTS and enables joint optimization of whole modules from scratch. |
Y. Yasuda; X. Wang; J. Yamagishd; |
1141 | Lightspeech: Lightweight and Fast Text to Speech with Neural Architecture Search Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose LightSpeech, which leverages neural architecture search (NAS) to automatically design more lightweight and efficient models based on FastSpeech. |
R. Luo; et al. |
1142 | A New High Quality Trajectory Tiling Based Hybrid TTS In Real Time Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A trajectory tiling based, hybrid TTS is revisited in this study for improving its synthesis performance. |
F. -L. Xie; X. -H. Li; W. -C. Su; L. Lu; F. K. Soong; |
1143 | Parallel Tacotron: Non-Autoregressive and Controllable TTS Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a non-autoregressive neural text-to-speech model augmented with a variational autoencoder-based residual encoder. |
I. Elias; et al. |
1144 | Fcl-Taco2: Towards Fast, Controllable and Lightweight Text-to-Speech Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To achieve fast inference speed and small model size while maintain high-quality speech, we propose FCL-taco2, a Fast, Controllable and Lightweight (FCL) TTS model based on Tacotron2. |
D. Wang; et al. |
1145 | Prosodic Clustering for Phoneme-Level Prosody Control in End-to-End Speech Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a method for controlling the prosody at the phoneme level in an autoregressive attention-based text-to-speech system. |
A. Vioni; et al. |
1146 | Improving Naturalness and Controllability of Sequence-to-Sequence Speech Synthesis By Learning Local Prosody Representations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we extended Tacotron2 with a pitch prediction task to capture discrete pitch-related representations. |
C. Gong; L. Wang; Z. Ling; S. Guo; J. Zhang; J. Dang; |
1147 | Multi-Speaker Emotional Speech Synthesis with Fine-Grained Prosody Modeling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present an end-to-end system for multi-speaker emotional speech synthesis. |
C. Lu; X. Wen; R. Liu; X. Chen; |
1148 | Emotion Controllable Speech Synthesis Using Emotion-Unlabeled Dataset with The Assistance of Cross-Domain Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel approach for emotional TTS synthesis on a TTS dataset without emotion labels. |
X. Cai; D. Dai; Z. Wu; X. Li; J. Li; H. Meng; |
1149 | Dual-Path Modeling for Long Recording Speech Separation in Meetings Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we further extend the dual-path modeling framework for CSS task. A transformer-based dual-path system is proposed, which integrates transform layers for global modeling. |
C. Li; et al. |
1150 | Time-Domain Loss Modulation Based on Overlap Ratio for Monaural Conversational Speaker Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new loss function for speaker separation based on permutation invariant training that dynamically reweighs losses using the segment overlap ratio. |
H. Taherian; D. Wang; |
1151 | Continuous Speech Separation with Conformer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper examines the use of Conformer architecture in lieu of recurrent neural networks for the separation model. |
S. Chen; et al. |
1152 | A Flow-Based Neural Network for Time Domain Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Thus, in this paper we propose a NF framework to directly model the enhancement process by density estimation of clean speech utterances conditioned on their noisy counterpart. |
M. Strauss; B. Edler; |
1153 | Sandglasset: A Light Multi-Granularity Self-Attentive Network for Time-Domain Speech Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a self-attentive network with a novel sandglass-shape, namely Sandglasset, which advances the state-of-the-art (SOTA) SS performance at significantly smaller model size and computational cost. |
M. W. Y. Lam; J. Wang; D. Su; D. Yu; |
1154 | TransMask: A Compact and Fast Speech Separation Model Based on Transformer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To make these models more practical by reducing the model size and inference time while maintaining high separation quality, we propose a new transformer-based speech separation approach, called TransMask. |
Z. Zhang; B. He; Z. Zhang; |
1155 | One Shot Learning for Speech Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we use model-agnostic meta-learning(MAML) algorithm and almost no inner loop(ANIL) algorithm in Conv-TasNet to achieve this goal. |
Y. -K. Wu; K. -P. Huang; Y. Tsao; H. -y. Lee; |
1156 | Training Noisy Single-Channel Speech Separation with Noisy Oracle Sources: A Large Gap and A Small Step Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We demonstrate the relative inseparability of noise and that this noisy speech paradigm leads to significant degradation of system performance. |
M. Maciejewski; J. Shi; S. Watanabe; S. Khudanpur; |
1157 | Speaker and Direction Inferred Dual-Channel Speech Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we employ ideas from auditory attention with two ears and propose a speaker and direction inferred speech separation network (dubbed SDNet) to solve the cocktail party problem. |
C. Li; J. Xu; N. Mesgarani; B. Xu; |
1158 | Speech Dereverberation Using Variational Autoencoders Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a statistical method for single-channel speech dereverberation using a variational autoencoder (VAE) for modelling the speech spectra. |
D. Baby; H. Bourlard; |
1159 | Real-Time Denoising and Dereverberation Wtih Tiny Recurrent U-Net Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose Tiny Recurrent U-Net (TRU-Net), a lightweight online inference model that matches the performance of current state-of- the-art models. |
H. -S. Choi; S. Park; J. H. Lee; H. Heo; D. Jeon; K. Lee; |
1160 | Weighted Magnitude-Phase Loss for Speech Dereverberation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new weighted magnitude-phase loss function, which is divided into a magnitude component and a phase component, to train a neural network to estimate complex ideal ratio masks. |
J. Zhang; M. D. Plumbley; W. Wang; |
1161 | Speaker Embeddings for Diarization of Broadcast Data In The Allies Challenge Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The work reported in this paper compares the performance of different embeddings extracted from MFCCs or the raw signal for speaker diarization and broadcast media treated with compression and sub-sampling, operations which typically degrade performance. |
A. Larcher; et al. |
1162 | On The Detection of Pitch-Shifted Voice: Machines and Human Listeners Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a performance comparison between human listeners and a simple algorithm for the task of speech anomaly detection. |
D. Looney; N. D. Gaubitch; |
1163 | The Ins and Outs of Speaker Recognition: Lessons from VoxSRC 2020 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The goal of this work is robust speaker recognition of utterances recorded in these challenging environments. |
Y. Kwon; H. -S. Heo; B. -J. Lee; J. S. Chung; |
1164 | The Idlab Voxsrc-20 Submission: Large Margin Fine-Tuning and Quality-Aware Score Calibration in DNN Based Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we propose and analyse a large margin fine-tuning strategy and a quality-aware score calibration in text-independent speaker verification. |
J. Thienpondt; B. Desplanques; K. Demuynck; |
1165 | Analysis of The But Diarization System for Voxconverse Challenge Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the system developed by the BUT team for the fourth track of the VoxCeleb Speaker Recognition Challenge, focusing on diarization on the VoxConverse dataset. |
F. Landini; et al. |
1166 | Microsoft Speaker Diarization System for The Voxceleb Speaker Recognition Challenge 2020 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the Microsoft speaker diarization system for monaural multi-talker recordings in the wild, evaluated at the diarization track of the VoxCeleb Speaker Recognition Challenge (VoxSRC) 2020. |
X. Xiao; et al. |
1167 | Squeezing Value of Cross-Domain Labels: A Decoupled Scoring Approach for Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on this analysis, we present a decoupled scoring approach that can maximally squeeze the value of cross-domain labels and obtain optimal verification scores in the enrollment-test mismatch condition. |
L. Li; Y. Zhang; J. Kang; T. F. Zheng; D. Wang; |
1168 | Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we incorporate self-supervised learning strategy to the unsupervised domain adaptation system and proposed a self-supervised learning based domain adaptation approach (SSDA). |
Z. Chen; S. Wang; Y. Qian; |
1169 | Meta-Learning for Cross-Channel Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose a meta speaker embedding network (MSEN) via meta-learning to generate channel-invariant utterance embeddings. |
H. Zhang; L. Wang; K. A. Lee; M. Liu; J. Dang; H. Chen; |
1170 | SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a synthesis based data augmentation method (SynAug) to expand the training set with more speakers and text-controlled synthesized speech. |
C. Du; B. Han; S. Wang; Y. Qian; K. Yu; |
1171 | Unit Selection Synthesis Based Data Augmentation for Fixed Phrase Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a unit selection synthesis based data augmentation method to leverage the abundant text-independent data resources. |
H. Huang; X. Xiang; F. Zhao; S. Wang; Y. Qian; |
1172 | Improving Speaker Verification in Reverberant Environments Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we first introduce a new feature set that gives more details in the frequency dimension in the 2-D time-frequency space used to represent speech. |
X. Chen; S. A. Zahorian; |
1173 | Transformer-Transducers for Code-Switched Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present an end-to-end ASR system using a transformer-transducer model architecture for code-switched speech recognition. |
S. Dalmia; Y. Liu; S. Ronanki; K. Kirchhoff; |
1174 | Wake Word Detection with Streaming Transformers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we explore the performance of several variants of chunk-wise streaming Transformers tailored for wake word detection in a recently proposed LF-MMI system, including looking-ahead to the next chunk, gradient stopping, different positional embedding methods and adding same-layer dependency between chunks. |
Y. Wang; H. Lv; D. Povey; L. Xie; S. Khudanpur; |
1175 | Capturing Multi-Resolution Context By Dilated Self-Attention Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a combination of restricted self-attention and a dilation mechanism, which we refer to as dilated self-attention. |
N. Moritz; T. Hori; J. Le Roux; |
1176 | Recent Developments on Espnet Toolkit Boosted By Conformer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we present recent developments on ESPnet: End-to- End Speech Processing toolkit, which mainly involves a recently proposed architecture called Conformer, Convolution-augmented Transformer. |
P. Guo; et al. |
1177 | Hierarchical Transformer-Based Large-Context End-To-End ASR with Large-Context Knowledge Distillation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a novel large-context end-to-end automatic speech recognition (E2E-ASR) model and its effective training method based on knowledge distillation. |
R. Masumura; N. Makishima; M. Ihori; A. Takashima; T. Tanaka; S. Orihashi; |
1178 | End-to-End Multi-Channel Transformer for Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we leverage the neural transformer architectures for multi-channel speech recognition systems, where the spectral and spatial information collected from different microphones are integrated using attention layers. |
F. -J. Chang; M. Radfar; A. Mouchtaris; B. King; S. Kunzmann; |
1179 | CASS-NAT: CTC Alignment-Based Single Step Non-Autoregressive Transformer for Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a CTC alignment-based single step non-autoregressive transformer (CASS-NAT) for speech recognition. |
R. Fan; W. Chu; P. Chang; J. Xiao; |
1180 | Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this problem, we propose a CTC-enhanced NAR transformer, which generates target sequence by refining predictions of the CTC module. |
X. Song; Z. Wu; Y. Huang; C. Weng; D. Su; H. Meng; |
1181 | Transformer-Based End-to-End Speech Recognition with Local Dense Synthesizer Attention Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Motivated by the fact that dense synthesizer attention (DSA), which dispenses with dot products and pairwise interactions, achieved competitive results in many language processing tasks, in this paper, we first propose a DSA-based speech recognition, as an alternative to SA. |
M. Xu; S. Li; X. -L. Zhang; |
1182 | Developing Real-Time Streaming Transformer Transducer for Speech Recognition on Large-Scale Dataset Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we explored the potential of Transformer Transducer (T-T) models for the fist pass decoding with low latency and fast speed on a large-scale dataset. |
X. Chen; Y. Wu; Z. Wang; S. Liu; J. Li; |
1183 | Head-Synchronous Decoding for Transformer-Based Streaming ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To overcome these issues, here we propose a head-synchronous (HS) version of the DACS algorithm, where the boundary of attention is jointly detected by all the DACS heads in each decoder layer. |
M. Li; C. Zorila; R. Doddipatla; |
1184 | History Utterance Embedding Transformer LM for Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose the history utterance embedding Transformer LM (HTLM), which includes an embedding generation network for extracting contextual information contained in the history utterances and a main Transformer LM for current prediction. |
K. Deng; G. Cheng; H. Miao; P. Zhang; Y. Yan; |
1185 | Maskcyclegan-VC: Learning Non-Parallel Voice Conversion with Filling in Frames Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: As an alternative, we propose MaskCycleGAN-VC, which is another extension of CycleGAN-VC2 and is trained using a novel auxiliary task called filling in frames (FIF). |
T. Kaneko; H. Kameoka; K. Tanaka; N. Hojo; |
1186 | Non-Parallel Many-To-Many Voice Conversion By Knowledge Transfer from A Text-To-Speech Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a simple but novel framework to train a nonparallel many-to-many voice conversion (VC) model based on the encoder-decoder architecture. |
X. YU; B. Mak; |
1187 | Non-Parallel Many-To-Many Voice Conversion Using Local Linguistic Tokens Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we propose the Local Linguistic Tokens (LLTs) model to learn high-quality speaker-invariant linguistic embeddings using the multi-head attention module, which has shown great success in extracting speaking style embeddings in Global Style Tokens (GSTs). |
C. Wang; Y. Yu; |
1188 | Crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present an open-source software for developing a nonparallel voice conversion (VC) system named crank. |
K. Kobayashi; W. -C. Huang; Y. -C. Wu; P. L. Tobing; T. Hayashi; T. Toda; |
1189 | Fragmentvc: Any-To-Any Voice Conversion By End-To-End Extracting and Fusing Fine-Grained Voice Fragments with Attention Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we proposed FragmentVC, in which the latent phonetic structure of the utterance from the source speaker is obtained from Wav2Vec 2.0, while the spectral features of the utterance(s) from the target speaker are obtained from log mel-spectrograms. |
Y. Y. Lin; C. -M. Chien; J. -H. Lin; H. -y. Lee; L. -s. Lee; |
1190 | Any-to-One Sequence-to-Sequence Voice Conversion Using Self-Supervised Discrete Speech Representations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a novel approach to any-to-one (A2O) voice conversion (VC) in a sequence-to-sequence (seq2seq) framework. |
W. -C. Huang; Y. -C. Wu; T. Hayashi; |
1191 | Towards Low-Resource Stargan Voice Conversion Using Weight Adaptive Instance Normalization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we aim at improving the data efficiency of the model and achieving a many-to-many non-parallel StarGAN-based voice conversion for a relatively large number of speakers with limited training samples. |
M. Chen; Y. Shi; T. Hain; |
1192 | Again-VC: A One-Shot Voice Conversion Using Activation Guidance and Adaptive Instance Normalization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose AGAIN-VC, an innovative VC system using Activation Guidance and Adaptive Instance Normalization. |
Y. -H. Chen; D. -Y. Wu; T. -H. Wu; H. -y. Lee; |
1193 | One-Shot Voice Conversion Based on Speaker Aware Module Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a speaker-aware voice conversion (SAVC) system realizing one-shot voice conversion without an adaptation stage. |
Y. Zhang; H. Che; J. Li; C. Li; X. Wang; Z. Wang; |
1194 | Zero-Shot Voice Conversion with Adjusted Speaker Embeddings and Simple Acoustic Features Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In our work, a newly designed neural network was used to adjust the speaker embeddings of unseen speakers. |
Z. Tan; J. Wei; J. Xu; Y. He; W. Lu; |
1195 | Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we build upon the neural text-to-speech (TTS) model, i.e., FastSpeech, and LPCNet neural vocoder to design a new cross-lingual VC framework named FastSpeech-VC. |
S. Zhao; H. Wang; T. H. Nguyen; B. Ma; |
1196 | Meta-Learning for Improving Rare Word Recognition in End-to-End ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we take on the challenge of rare word recognition in end-to-end (E2E) automatic speech recognition (ASR) by integrating a meta learning mechanism into an E2E ASR system, enabling few-shot adaptation. |
F. Lux; N. T. Vu; |
1197 | A Comparison of Methods for OOV-Word Recognition on A New Public Dataset Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We showcase very large improvements in OOV-word recognition and make both the data and code available. |
R. A. Braun; S. Madikeri; P. Motlicek; |
1198 | Convolutional Dropout and Wordpiece Augmentation for End-to-End Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a generalization of dropout, called convolutional dropout, where each neuron?s activation is replaced with a randomly-weighted linear combination of neuron values in its neighborhood. |
H. Xu; Y. Huang; Y. Zhu; K. Audhkhasi; B. Ramabhadran; |
1199 | Partially Overlapped Inference for Long-Form Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a more effective way of overlapped inference by aligning partially matched hypotheses. |
T. G. Kang; H. -G. Kim; M. -J. Lee; J. Lee; H. Lee; |
1200 | Focus on The Present: A Regularization Method for The ASR Source-Target Attention Layer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces a novel method to diagnose the source-target attention in state-of-the-art end-to-end speech recognition models with joint connectionist temporal classification (CTC) and attention training. |
N. Chen; P. Zelasko; J. Villalba; N. Dehak; |
1201 | Bifocal Neural ASR: Exploiting Keyword Spotting for Inference Optimization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present Bifocal RNN-T, a new variant of the Recurrent Neural Network Transducer (RNN-T) architecture designed for improved inference time latency on speech recognition tasks. |
J. Macoskey; G. P. Strimel; A. Rastrow; |
1202 | FastEmit: Low-Latency Streaming ASR with Sequence-Level Emission Regularization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a sequence-level emission regularization method, named FastEmit, that applies latency regularization directly on per-sequence probability in training transducer models, and does not require any alignment. |
J. Yu; et al. |
1203 | Sparsification Via Compressed Sensing for Automatic Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a compressed sensing based pruning (CSP) approach to effectively address those questions. |
K. Zhen; H. D. Nguyen; F. -J. Chang; A. Mouchtaris; A. Rastrow; |
1204 | Dynamic Sparsity Neural Networks for Automatic Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present Dynamic Sparsity Neural Networks (DSNN) that, once trained, can instantly switch to any predefined sparsity configuration at run-time. |
Z. Wu; D. Zhao; Q. Liang; J. Yu; A. Gulati; R. Pang; |
1205 | An Asynchronous WFST-Based Decoder for Automatic Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce asynchronous dynamic decoder, which adopts an efficient A* algorithm to incorporate big language models in the one-pass decoding for large vocabulary continuous speech recognition. |
H. Lv; Z. Chen; H. Xu; D. Povey; L. Xie; S. Khudanpur; |
1206 | Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes an extremely lightweight phone-based transducer model with a tiny decoding graph on edge devices. |
Y. Zhang; S. Sun; L. Ma; |
1207 | Noise Level Limited Sub-Modeling for Diffusion Probabilistic Vocoders Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a simple but effective noise level-limited sub-modeling framework for diffusion probabilistic vocoders Sub-WaveGrad and Sub-DiffWave. |
T. Okamoto; T. Toda; Y. Shiga; H. Kawai; |
1208 | StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive Normalization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We therefore propose StyleMelGAN, a lightweight neural vocoder allowing synthesis of high-fidelity speech with low computational complexity. |
A. Mustafa; N. Pia; G. Fuchs; |
1209 | Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes voicing-aware conditional discriminators for Parallel WaveGAN-based waveform synthesis systems. |
R. Yamamoto; E. Song; M. -J. Hwang; J. -M. Kim; |
1210 | Universal Neural Vocoding with Parallel Wavenet Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a universal neural vocoder based on Parallel WaveNet, with an additional conditioning network called Audio Encoder. |
Y. Jiao; A. Gabrys; G. Tinchev; B. Putrycz; D. Korzekwa; V. Klimkov; |
1211 | Periodnet: A Non-Autoregressive Waveform Generation Model with A Structure Separating Periodic and Aperiodic Components Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose PeriodNet, a non-autoregressive (non-AR) waveform generation model with a new model structure for modeling periodic and aperiodic components in speech waveforms. |
Y. Hono; S. Takaki; K. Hashimoto; K. Oura; Y. Nankaku; K. Tokuda; |
1212 | LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel conditional convolution network, named location-variable convolution, to model the dependencies of the waveform sequence. |
Z. Zeng; J. Wang; N. Cheng; J. Xiao; |
1213 | Graphspeech: Syntax-Aware Graph Attention Network for Neural Speech Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel neural TTS model, denoted as GraphSpeech, that is formulated under graph neural network framework. |
R. Liu; B. Sisman; H. Li; |
1214 | Syntactic Representation Learning For Neural Network Based TTS with Syntactic Parse Tree Traversal Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a syntactic representation learning method based on syntactic parse tree traversal to automatically utilize the syntactic structure information. |
C. Song; J. Li; Y. Zhou; Z. Wu; H. Meng; |
1215 | A Chapter-Wise Understanding System for Text-To-Speech in Chinese Novels Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a chapter-wise understanding system for Chinese novels, to predict speaker and emotion tags automatically based on the chapter-level context. |
J. Pan; L. Wu; X. Yin; P. Wu; C. Xu; Z. Ma; |
1216 | A Universal Bert-Based Front-End Model for Mandarin Text-To-Speech Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a universal BERT-based model that can be used for various tasks in the Mandarin front-end without changing its architecture. |
Z. Bai; B. Hu; |
1217 | Improving Prosody Modelling with Cross-Utterance Bert Embeddings for End-to-End Speech Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to use the text embeddings of the neighboring sentences to improve the prosody generation for each utterance of a paragraph in an end-to-end fashion without using any explicit prosody features. |
G. Xu; W. Song; Z. Zhang; C. Zhang; X. He; B. Zhou; |
1218 | Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel multi-channel speech extraction system to simultaneously extract multiple clean individual sources from a mixture in noisy and reverberant environments. |
J. Zhang; C. Zorila; R. Doddipatla; J. Barker; |
1219 | ADL-MVDR: All Deep Learning MVDR Beamformer for Target Speech Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel all deep learning MVDR framework, where the matrix inversion and eigenvalue decomposition are replaced by two recurrent neural networks (RNNs), to resolve both issues at the same time. |
Z. Zhang; Y. Xu; M. Yu; S. -X. Zhang; L. Chen; D. Yu; |
1220 | Multi-Channel Target Speech Extraction with Channel Decorrelation and Target Speaker Adaptation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose two methods for exploiting the multi-channel spatial information to extract the target speech. |
J. Han; X. Zhou; Y. Long; Y. Li; |
1221 | Speaker Activity Driven Neural Speech Extraction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore the use of speaker activity information as an auxiliary clue for single-channel neural network-based speech extraction. |
M. Delcroix; K. Zmolikova; T. Ochiai; K. Kinoshita; T. Nakatani; |
1222 | Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Environments Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by it, we explicitly modeled the onset cue and verified the effectiveness in the speaker extraction task. |
Y. Hao; J. Xu; P. Zhang; B. Xu; |
1223 | Multi-Stage Speaker Extraction with Utterance and Frame-Level Reference Signals Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a speaker extraction technique, that performs in multiple stages to take full advantage of short reference speech sample. |
M. Ge; C. Xu; L. Wang; E. S. Chng; J. Dang; H. Li; |
1224 | Neural Network-Based Virtual Microphone Estimator Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, as an alternative approach, we propose a neural network-based virtual microphone estimator (NN-VME). |
T. Ochiai; M. Delcroix; T. Nakatani; R. Ikeshita; K. Kinoshita; S. Araki; |
1225 | Joint Maximum Likelihood Estimation of Power Spectral Densities and Relative Acoustic Transfer Functions for Acoustic Beamforming Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We use historical results to derive joint ML estimates (MLEs) of the RATFs and PSDs in the context of acoustic beam-forming. |
P. Hoang; Z. -H. Tan; J. M. de Haan; J. Jensen; |
1226 | Cue-Preserving MMSE Filter with Bayesian SNR Marginalization for Binaural Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we translate the single-channel approach into a binaural Bayesian SNR marginalization, based on a binaural a-priori SNR definition and a related hyperprior. |
S. Thaleiser; G. Enzner; |
1227 | Blind and Neural Network-Guided Convolutional Beamformer for Joint Denoising, Dereverberation, and Source Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes an approach for optimizing a Convolutional BeamFormer (CBF) that can jointly perform denoising (DN), dereverberation (DR), and source separation (SS). |
T. Nakatani; R. Ikeshita; K. Kinoshita; H. Sawada; S. Araki; |
1228 | Real-Time Speech Enhancement for Mobile Communication Based on Dual-Channel Complex Spectral Mapping Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel approach to real-time speech enhancement for dual-microphone mobile phones. |
K. Tan; X. Zhang; D. Wang; |
1229 | Don�t Shoot Butterfly with Rifles: Multi-Channel Continuous Speech Separation with Early Exit Transformer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To deal with this problem, we propose an early exit mechanism, which enables the Transformer model to handle different cases with adaptive depth. |
S. Chen; et al. |
1230 | Double Multi-Head Attention for Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we present Double Multi-Head Attention (MHA) pooling, which extends our previous approach based on Self MHA. |
M. India; P. Safari; J. Hernando; |
1231 | Graph Attention Networks for Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work presents a novel back-end framework for speaker verification using graph attention networks. |
J. -w. Jung; H. -S. Heo; H. -J. Yu; J. S. Chung; |
1232 | Memory Layers with Multi-Head Attention Mechanisms for Text-Dependent Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore an approach based on memory layers and multi-head attention mechanisms to improve in an efficient way the performance of text-dependent speaker verification (SV) systems. |
V. Mingote; A. Miguel; A. Ortega; E. Lleida; |
1233 | FoolHD: Fooling Speaker Identification By Highly Imperceptible Adversarial Disturbances Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a white-box steganography-inspired adversarial attack that generates imperceptible adversarial perturbations against a speaker identification model. |
A. S. Shamsabadi; F. S. Teixeira; A. Abad; B. Raj; A. Cavallaro; I. Trancoso; |
1234 | Adversarial Defense for Deep Speaker Recognition Using Hybrid Adversarial Training Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this concern, in this work, we propose a new defense mechanism based on a hybrid adversarial training (HAT) setup. |
M. Pal; A. Jati; R. Peri; C. -C. Hsu; W. AbdAlmageed; S. Narayanan; |
1235 | DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we propose a novel framework to disentangle speaker-related and domain-specific features and apply domain adaptation on the speaker-related feature space solely. |
M. Sang; W. Xia; J. H. L. Hansen; |
1236 | Playing A Part: Speaker Verification at The Movies Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The goal of this work is to investigate the performance of popular speaker recognition models on speech segments from movies, where often actors intentionally disguise their voice to play a character. |
A. Brown; J. Huh; A. Nagrani; J. S. Chung; A. Zisserman; |
1237 | Small Footprint Text-Independent Speaker Verification For Embedded Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present a two-stage model architecture orders of magnitude smaller than common solutions (237.5K learning parameters, 11.5MFLOPS) reaching a competitive result of 3.31% Equal Error Rate (EER) on the well established VoxCeleb1 verification test set. |
J. Balian; R. Tavarone; M. Poumeyrol; A. Coucke; |
1238 | ASV-SUBTOOLS: Open Source Toolkit for Automatic Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a new open source toolkit for automatic speaker verification (ASV), named ASV-Subtools. |
F. Tong; et al. |
1239 | DEEPTALK: Vocal Style Encoding for Speaker Recognition and Speech Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a prosody encoding network called DeepTalk for extracting vocal style features directly from raw audio data. |
A. Chowdhury; A. Ross; P. David; |
1240 | A Multi-View Approach to Audio-Visual Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: As these methods lack the ability to do cross-modal verification, we introduce a multi-view model which uses a shared classifier to map audio and video into the same space. |
L. Sari; K. Singh; J. Zhou; L. Torresani; N. Singhal; Y. Saraf; |
1241 | Top-Down Attention in End-to-End Spoken Language Understanding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on this insight, we propose Top-Down SLU (TD-SLU), a new transformer-based E2E SLU model that uses top-down attention and an attention gate to fuse high-level NLU features with low-level ASR features, which leads to a better optimization of both tasks. |
Y. Chen; et al. |
1242 | Fine-Tuning of Pre-Trained End-to-End Speech Recognition with Generative Adversarial Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a novel framework for fine-tuning a pre-trained ASR model using the GAN objective where the ASR model acts as a generator and a discriminator tries to distinguish the ASR output from the real data. |
M. A. Haidar; M. Rezagholizadeh; |
1243 | A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we propose a general multi-task learning framework to leverage text data for ASR and ST tasks. |
Y. Tang; J. Pino; C. Wang; X. Ma; D. Genzel; |
1244 | Gaussian Kernelized Self-Attention for Long Sequence Data and Its Application to CTC-Based Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To mitigate this mismatch, we propose a new architecture, which is a variant of the Gaussian kernel, which itself is a shift-invariant kernel. |
Y. Kashiwagi; E. Tsunoo; S. Watanabe; |
1245 | Lattice-Free Mmi Adaptation of Self-Supervised Pretrained Acoustic Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose lattice-free MMI (LFMMI) for supervised adaptation of self-supervised pretrained acoustic model. |
A. Vyas; S. Madikeri; H. Bourlard; |
1246 | Intermediate Loss Regularization for CTC-Based Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a simple and efficient auxiliary loss function for automatic speech recognition (ASR) based on the connectionist temporal classification (CTC) objective. |
J. Lee; S. Watanabe; |
1247 | Code-Switch Speech Rescoring with Monolingual Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on the code-switch speech recognition in mainland China, which is obviously different from the Hong Kong and Southeast Asia area in linguistic characteristics. |
G. Liu; L. Cao; |
1248 | Mixture of Informed Experts for Multilingual Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a novel variant of this approach, ?informed experts?, which attempts to tackle inter-task conflicts by eliminating gradients from other tasks in these task-specific parameters. |
N. Gaur; et al. |
1249 | Reducing Spelling Inconsistencies in Code-Switching ASR Using Contextualized CTC Loss Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose Contextualized Connectionist Temporal Classification (CCTC) loss to encourage spelling consistencies of a character-based non-autoregressive ASR which allows for faster inference. |
B. Naowarat; T. Kongthaworn; K. Karunratanakul; S. H. Wu; E. Chuangsuwanich; |
1250 | Multi-Dialect Speech Recognition in English Using Attention on Ensemble of Experts Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we further explore using a single model for multi-dialect speech recognition using ensemble modeling. |
A. Das; K. Kumar; J. Wu; |
1251 | Decoupling Pronunciation and Language for End-to-End Code-Switching Automatic Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a decoupled transformer model to use mono-lingual paired data and unpaired text data to alleviate the problem of code-switching data shortage. |
S. Zhang; J. Yi; Z. Tian; Y. Bai; J. Tao; Z. wen; |
1252 | AISpeech-SJTU Accent Identification System for The Accented English Speech Recognition Challenge Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the AISpeech-SJTU system for the accent identification track of the Interspeech-2020 Accented English Speech Recognition Challenge. |
H. Huang; X. Xiang; Y. Yang; R. Ma; Y. Qian; |
1253 | Meta-Learning for Low-Resource Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Probing the learning process of generalized representations across languages, we propose a meta-learning approach for low-resource speech emotion recognition. |
S. Chopra; P. Mathur; R. Sawhney; R. R. Shah; |
1254 | Progressive Co-Teaching for Ambiguous Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Motivated by human and animal learning studies, we propose a novel method named Progressive Co-teaching (PCT) to learn speech emotion features from simple to difficult. |
Y. Yin; Y. Gu; L. Yao; Y. Zhou; X. Liang; H. Zhang; |
1255 | Emotion Recognition By Fusing Time Synchronous and Time Asynchronous Representations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a novel two-branch neural network model structure is proposed for multimodal emotion recognition, which consists of a time synchronous branch (TSB) and a time asynchronous branch (TAB). |
W. Wu; C. Zhang; P. C. Woodland; |
1256 | Speech Emotion Recognition Based on Listener Adaptive Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to mitigate this problem, we propose a Listener Adaptive (LA) model that reflects emotion recognition criteria of each listener. |
A. Ando; et al. |
1257 | Speech Emotion Recognition Using Semantic Information Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel framework that can capture both the semantic and the paralinguistic information in the signal. |
P. Tzirakis; A. Nguyen; S. Zafeiriou; B. W. Schuller; |
1258 | Compact Graph Architecture for Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a deep graph approach to address the task of speech emotion recognition. |
A. Shirian; T. Guha; |
1259 | A Novel End-to-end Speech Emotion Recognition Network with Stacked Transformer Layers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In contrast with those previous works, herein we propose a novel strategy for global SER feature extraction by applying an additional enhancement module on top of the current SER pipeline. |
X. Wang; M. Wang; W. Qi; W. Su; X. Wang; H. Zhou; |
1260 | A Novel Attention-Based Gated Recurrent Unit and Its Efficacy in Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we explore using diverse activation functions within GRU and bi-directional GRU (BiGRU) cells in the context of speech emotion recognition (SER). |
S. T. Rajamani; K. T. Rajamani; A. Mallol-Ragolta; S. Liu; B. Schuller; |
1261 | MAEC: Multi-Instance Learning with An Adversarial Auto-Encoder-Based Classifier for Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an adversarial auto-encoder-based classifier, which can regularize the distribution of latent representation to smooth the boundaries among categories. |
C. Fu; C. Liu; C. T. Ishi; H. Ishiguro; |
1262 | Representation Learning with Spectro-Temporal-Channel Attention for Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an attention module, named spectro-temporal-channel (STC) attention module that is integrated with CNN to improve representation learning ability. |
L. Guo; L. Wang; C. Xu; J. Dang; E. S. Chng; H. Li; |
1263 | Speech Emotion Recognition Using Quaternion Convolutional Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Our paper addresses this problem by proposing a quaternion convolutional neural network (QCNN) based speech emotion recognition (SER) model in which Mel-spectrogram features of speech signals are encoded in an RGB quaternion domain. |
A. Muppidi; M. Radfar; |
1264 | Domain-Adversarial Autoencoder with Attention Based Feature Level Fusion for Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a domain-adversarial autoencoder to extract discriminative representations for SER. |
Y. Gao; J. Liu; L. Wang; J. Dang; |
1265 | Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we apply multiscale area attention in a deep convolutional neural network to attend emotional characteristics with varied granularities and therefore the classifier can benefit from an ensemble of attentions with different scales. |
M. Xu; F. Zhang; X. Cui; W. Zhang; |
1266 | CopyPaste: An Augmentation Method for Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study proposes CopyPaste, a perceptually motivated novel augmentation procedure for SER. |
R. Pappagari; J. Villalba; P. Zelasko; L. Moro-Velazquez; N. Dehak; |
1267 | Contrastive Unsupervised Learning for Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To circumvent this problem, we investigate how unsupervised representation learning on unlabeled datasets can benefit SER. |
M. Li; et al. |
1268 | Hierarchical Network Based on The Fusion of Static and Dynamic Features for Speech Emotion Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel hierarchical network called HNSD that can efficiently integrate the static and dynamic features for SER. |
Q. Cao; M. Hou; B. Chen; Z. Zhang; G. Lu; |
1269 | Multimodal Emotion Recognition with Capsule Graph Convolutional Based Representation Fusion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel representation fusion method, Capsule Graph Convolutional Network (CapsGCN). |
J. Liu; et al. |
1270 | Disentanglement for Audio-Visual Emotion Recognition Using Multitask Setup Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In particular, we developed a multitask framework to extract low-dimensional embeddings that aim to capture emotion specific information, while containing minimal information related to person identity. |
R. Peri; S. Parthasarathy; C. Bradshaw; S. Sundaram; |
1271 | Data Augmentation with Signal Companding for Detection of Logical Access Attacks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a novel data augmentation technique using a-law and mu-law based signal companding. |
R. K. Das; J. Yang; H. Li; |
1272 | Replay and Synthetic Speech Detection with Res2Net Architecture Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work proposes to leverage a novel model structure, so-called Res2Net, to improve the anti-spoofing countermeasure?s generalizability. |
X. Li; et al. |
1273 | A Capsule Network Based Approach for Detection of Audio Spoofing Attacks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, as the first attempt, we introduce a capsule network to enhance the generalization of the detection system. |
A. Luo; E. Li; Y. Liu; X. Kang; Z. J. Wang; |
1274 | Cross-Teager Energy Cepstral Coefficients for Replay Spoof Detection on Voice Assistants Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The key idea of this work is optimal channel selection based on maximum cross-energies from a multichannel input, which is suitable for SSD task. |
R. Acharya; H. Kotta; A. T. Patil; H. A. Patil; |
1275 | End-to-End Anti-spoofing with RawNet2 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We describe modifications made to the original RawNet2 architecture so that it can be applied to anti-spoofing. |
H. Tak; J. Patino; M. Todisco; A. Nautsch; N. Evans; A. Larcher; |
1276 | Replay-Attack Detection Using Features With Adaptive Spectro-Temporal Resolution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, an adaptive spectro-temporal resolution is proposed to obtain the optimal scale in the feature space: the frequency resolution is adaptive to frequency discrimination, while the temporal resolution is adaptive to continuous phones. |
M. Liu; L. Wang; K. A. Lee; X. Chen; J. Dang; |
1277 | Improving Identification of System-Directed Speech Utterances By Deep Learning of ASR-Based Word Embeddings and Confidence Metrics Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we extend our previous work on the detection of system-directed speech utterances. |
V. Vilaysouk; A. Nour-Eldin; D. Connolly; |
1278 | BLSTM-Based Confidence Estimation for End-to-End Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we perform confidence estimation for end-to-end (E2E) ASR hypotheses. |
A. Ogawa; N. Tawara; T. Kano; M. Delcroix; |
1279 | Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we first examine how some commonly used regularisation methods influence the softmax-based confidence scores and study the overconfident behaviour of end-to-end models. Then we propose a lightweight and effective approach named confidence estimation module (CEM) on top of an existing end-to-end ASR model. |
Q. Li; et al. |
1280 | Learning Word-Level Confidence for Subword End-To-End ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes and studies two confidence models of increasing complexity to solve this problem. |
D. Qiu; et al. |
1281 | Neural Utterance Confidence Measure for RNN-Transducers and Two Pass Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose methods to compute confidence score on the predictions made by an end-to-end speech recognition model in a 2-pass framework. |
A. Gupta; et al. |
1282 | Detecting Adversarial Attacks on Audiovisual Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose an efficient and straightforward detection method based on the temporal correlation between audio and video streams. |
P. Ma; S. Petridis; M. Pantic; |
1283 | REDAT: Accent-Invariant Representation for End-To-End ASR By Domain Adversarial Training with Relabeling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Motivated by the proof of equivalence, we introduce reDAT, a novel technique based on DAT, which relabels data using either unsupervised clustering or soft labels. |
H. Hu; et al. |
1284 | AISpeech-SJTU ASR System for The Accented English Speech Recognition Challenge Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the AISpeech-SJTU ASR system for the Interspeech-2020 Accented English Speech Recognition Challenge (AESRC). |
T. Tan; Y. Lu; R. Ma; S. Zhu; J. Guo; Y. Qian; |
1285 | End-To-End Multi-Accent Speech Recognition with Unsupervised Accent Modelling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to grapple with such an issue, we first investigate and improve the current mainstream end-to-end multi-accent speech recognition technologies. In addition, we propose two unsupervised accent modelling methods, which convert accent information into a global embedding, and use it to improve the performance of the end-to-end multi-accent speech recognition systems. |
S. Li; B. Ouyang; D. Liao; S. Xia; L. Li; Q. Hong; |
1286 | A Comparative Study of Acoustic and Linguistic Features Classification for Alzheimer’s Disease Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a comparative study of different acoustic and linguistic features for the AD detection using various classifiers. |
J. Li; et al. |
1287 | Synthesis of New Words for Improved Dysarthric Speech Recognition on An Expanded Vocabulary Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a data augmentation method using voice conversion that allows dysarthric ASR systems to accurately recognize words outside of the training set vocabulary. |
J. Harvill; D. Issa; M. Hasegawa-Johnson; C. Yoo; |
1288 | Development of The Cuhk Elderly Speech Recognition System for Neurocognitive Disorder Detection Using The Dementiabank Corpus Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents the development of a state-of-the-art automatic speech recognition (ASR) system built on the Dementia-Bank Pitt corpus for automatic NCD detection. |
Z. Ye; et al. |
1289 | Portable Photoglottography for Monitoring Vocal Fold Vibrations in Speech Production Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper is to realize a portable PGG (P-PGG) module with an audio interface to record glottal and speech waveforms simultaneously with ease. |
Y. Chi; K. Honda; J. Wei; |
1290 | Improving Ultrasound Tongue Contour Extraction Using U-Net and Shape Consistency-Based Regularizer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address the faint or missing contours in the sequence, we explore the shape consistency-based regularizer, which can take sequential information into account. |
M. Feng; Y. Wang; K. Xu; H. Wang; B. Ding; |
1291 | Impact of Speaking Rate on The Source Filter Interaction in Speech: A Study Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we examine how the drop in pitch alters when such a VCV sequence is spoken at three different speaking rates – slow, normal and fast. |
T. Purohit; A. R. MV; P. Kumar Ghosh; |
1292 | A Two-Stage Deep Modeling Approach to Articulatory Inversion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a two-stage deep feed-forward neural network (DNN) to tackle the acoustic-to-articulatory inversion (AAI) problem. |
A. S. Shahrebabaki; N. Olfati; A. S. Imran; M. Hallstein Johnsen; S. M. Siniscalchi; T. Svendsen; |
1293 | Acoustic-to-Articulatory Inversion for Dysarthric Speech By Using Cross-Corpus Acoustic-Articulatory Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we focus on estimating articulatory movements from acoustic features, known as acoustic-to-articulatory inversion (AAI), for dysarthric patients with amyotrophic lateral sclerosis (ALS). |
S. K. Maharana; et al. |
1294 | Speaking Rate and Tonal Realization in Mandarin Chinese: What Can We Learn From Large Speech Corpora? Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Our analysis found two differences for slower speaking rates: (1) lower static tones and (2) more change for dynamic tones. |
J. Yuan; K. Church; |
1295 | Humanacgan: Conditional Generative Adversarial Network with Human-Based Auxiliary Classifier and Its Evaluation in Phoneme Perception Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a conditional generative adversarial network (GAN) incorporating humans? perceptual evaluations. |
Y. Ueda; K. Fujii; Y. Saito; S. Takamichi; Y. Baba; H. Saruwatari; |
1296 | Improving Audio Anomalies Recognition Using Temporal Convolutional Attention Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a novel approach using a temporal convolutional attention network (TCAN) is proposed to tackle this problem. |
Q. Huang; T. Hain; |
1297 | Generative Speech Coding with Predictive Variance Regularization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce predictive-variance regularization to reduce the sensitivity to outliers, resulting in a significant increase in performance. |
W. B. Kleijn; et al. |
1298 | How to Make Text-to-Speech System Pronounce Voldemort: An Experimental Approach of Foreign Word Phonemization in Vietnamese Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Generating foreign words is one of the hardest tasks for any speech synthesis systems. This work deal with this problem in the case of Vietnamese, a low-resourced language, following an experimental approach. |
D. -K. MAC; V. -H. NGUYEN; D. -N. NGUYEN; K. -A. NGUYEN; |
1299 | How Similar or Different Is Rakugo Speech Synthesizer to Professional Performers? Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel evaluation methodology using synthesized rakugo speech and real rakugo speech uttered by professional performers of three different ranks. |
S. Kato; Y. Yasuda; X. Wang; E. Cooper; J. Yamagishi; |
1300 | Dnsmos: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces a multi-stage self-teaching based perceptual objective metric that is designed to evaluate noise suppressors. |
C. K. A. Reddy; V. Gopal; R. Cutler; |
1301 | A Causal Deep Learning Framework for Classifying Phonemes in Cochlear Implants Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a causal deep learning framework for classifying phonemes using features extracted at the time-frequency resolution of a CI processor. |
K. Chu; L. Collins; B. Mainsah; |
1302 | Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a speaker-attributed minimum Bayes risk (SA-MBR) training method where the parameters are trained to directly minimize the expected SA-WER over the training data. |
N. Kanda; et al. |
1303 | Mutually-Constrained Monotonic Multihead Attention for Online ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we remove the discrepancy between training and test phases by considering, in the training of MMA, the interactions across multiple heads that will occur in the test time. |
J. Song; H. Shim; E. Yang; |
1304 | The Use of Voice Source Features for Sung Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we ask whether vocal source features (pitch, shimmer, jitter, etc) can improve the performance of automatic sung speech recognition, arguing that conclusions previously drawn from spoken speech studies may not be valid in the sung speech domain. |
G. R. Dabike; J. Barker; |
1305 | A Parallelizable Lattice Rescoring Strategy with Neural Language Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a parallel computation strategy and a posterior-based lattice expansion algorithm for efficient lattice rescoring with neural language models (LMs) for automatic speech recognition. |
K. Li; D. Povey; S. Khudanpur; |
1306 | Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel decentralized feature extraction approach in federated learning to address privacy-preservation issues for speech recognition. |
C. -H. H. Yang; et al. |
1307 | Cif-Based Collaborative Decoding for End-to-End Contextual Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on incorporating contextual information into the continuous integrate-and-fire (CIF) based model that supports contextual biasing in a more controllable fashion. |
M. Han; L. Dong; S. Zhou; B. Xu; |
1308 | Hubert: How Much Can A Bad Teacher Benefit ASR Pre-Training? Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose the Hidden-Unit BERT (HUBERT) model which utilizes a cheap k-means clustering step to provide aligned target labels for pre-training of a BERT model. |
W. -N. Hsu; Y. -H. H. Tsai; B. Bolte; R. Salakhutdinov; A. Mohamed; |
1309 | A Further Study of Unsupervised Pretraining for Transformer Based Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we conduct a further study on MPC and focus on three important aspects: the effect of pretraining data speaking style, its extension on streaming model, and strategies for better transferring learned knowledge from pretraining stage to downstream tasks. |
D. Jiang; et al. |
1310 | Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Text Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a method to pre-train transformer-based encoder-decoder automatic speech recognition (ASR) models using sufficient target-domain text. |
C. Gao; G. Cheng; R. Yang; H. Zhu; P. Zhang; Y. Yan; |
1311 | Semi-Supervised Speech Recognition Via Graph-Based Temporal Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a generalized form of the connectionist temporal classification (CTC) objective that accepts a graph representation of the training labels. |
N. Moritz; T. Hori; J. L. Roux; |
1312 | Unsupervised Domain Adaptation for Speech Recognition Via Uncertainty Driven Self-Training Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we show that self-training (ST) combined with an uncertainty-based pseudo-label filtering approach can be effectively used for domain adaptation. |
S. Khurana; N. Moritz; T. Hori; J. L. Roux; |
1313 | Improving Streaming Automatic Speech Recognition with Non-Streaming Model Distillation on Unsupervised Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel and effective learning method by leveraging a non-streaming ASR model as a teacher to generate transcripts on an arbitrarily large data set, which is then used to distill knowledge into streaming ASR models. |
T. Doutre; et al. |
1314 | Speech Bert Embedding for Improving Prosody in Neural TTS Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a speech BERT model to extract embedded prosody information in speech segments for improving the prosody of synthesized speech in neural text-to-speech (TTS). |
L. Chen; Y. Deng; X. Wang; F. K. Soong; L. He; |
1315 | Bi-Level Style and Prosody Decoupling Modeling for Personalized End-to-End Speech Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a bi-level function decoupling framework to realise separate modeling and controlling for solving above problems. |
R. Fu; J. Tao; Z. Wen; J. Yi; T. Wang; C. Qiang; |
1316 | Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce Kathaka, a model trained with a novel two-stage training process for neural speech synthesis with contextually appropriate prosody. |
S. Karlapati; et al. |
1317 | Camp: A Two-Stage Approach to Modelling Prosody in Context Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: There are two major issues faced when modelling prosody: (1) prosody varies at a slower rate compared with other content in the acoustic signal (e.g. segmental information and background noise); (2) determining appropriate prosody without sufficient context is an ill-posed problem. In this paper, we propose solutions to both these issues. |
Z. Hodari; et al. |
1318 | Unsupervised Learning for Multi-Style Speech Synthesis with Limited Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present an unsupervised multi-style speech synthesis method that can be trained with limited data. |
S. Liang; C. Miao; M. Chen; J. Ma; S. Wang; J. Xiao; |
1319 | Fastpitch: Parallel Text-to-Speech with Pitch Prediction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present FastPitch, a fully-parallel text-to-speech model based on FastSpeech, conditioned on fundamental frequency contours. |
A. Lancucki; |
1320 | Low-Resource Expressive Text-To-Speech Using Data Augmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present a novel 3-step methodology to circumvent the costly operation of recording large amounts of target data in order to build expressive style voices with as little as 15 minutes of such recordings. |
G. Huybrechts; T. Merritt; G. Comini; B. Perz; R. Shah; J. Lorenzo-Trueba; |
1321 | TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a text-to-speech (TTS)-driven data augmentation method for improving the quality of a non-autoregressive (AR) TTS system. |
M. -J. Hwang; R. Yamamoto; E. Song; J. -M. Kim; |
1322 | A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, we propose the following method to successfully train an end-to-end TTS model with limited broadcast data. |
H. Bae; J. -S. Bae; Y. -S. Joo; Y. -I. Kim; H. -Y. Cho; |
1323 | Disentangled Speaker and Language Representations Using Mutual Information Minimization and Domain Adaptation for Cross-Lingual TTS Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a method for obtaining disentangled speaker and language representations via mutual information minimization and domain adaptation for cross-lingual text-to-speech (TTS) synthesis. |
D. Xin; T. Komatsu; S. Takamichi; H. Saruwatari; |
1324 | Adaspeech 2: Adaptive Text to Speech with Untranscribed Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we develop AdaSpeech 2, an adaptive TTS system that only leverages untranscribed speech data for adaptation. |
Y. Yan; et al. |
1325 | Investigation of Fast and Efficient Methods for Multi-Speaker Modeling and Speaker Adaptation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel method for fast and efficient few-shot TTS task, which is able to disentangle linguistic and speaker representations. |
Y. Zheng; X. Li; L. Lu; |
1326 | ICASSP 2021 Deep Noise Suppression Challenge Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The Deep Noise Suppression (DNS) challenge is designed to foster innovation in the area of noise suppression to achieve superior perceptual speech quality. |
C. K. A. Reddy; et al. |
1327 | ICASSP 2021 Deep Noise Suppression Challenge: Decoupling Magnitude and Phase Optimization with A Two-Stage Deep Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose a novel system for denoising in the complicated applications, which is mainly comprised of two pipelines, namely a two-stage network and a post-processing module. |
A. Li; W. Liu; X. Luo; C. Zheng; X. Li; |
1328 | Fullsubnet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a full-band and sub-band fusion model, named as FullSubNet, for single-channel real-time speech enhancement. |
X. Hao; X. Su; R. Horaud; X. Li; |
1329 | Densely Connected Multi-Stage Model with Channel Wise Subband Feature for Real-Time Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a computationally efficient real-time speech enhancement network with densely connected multi-stage structures, which progressively enhances the channel-wise subband speech. |
J. Li; et al. |
1330 | A Modulation-Domain Loss for Neural-Network-Based Real-Time Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We describe a modulation-domain loss function for deep-learning-based speech enhancement systems. |
T. Vuong; Y. Xia; R. M. Stern; |
1331 | Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a complex convolutional block attention module (CCBAM) to boost the representation power of the complex-valued convolutional layers by constructing more informative features. |
S. Zhao; T. H. Nguyen; B. Ma; |
1332 | Audio-Visual Speech Inpainting with Deep Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a deep-learning-based framework for audio-visual speech inpainting, i.e., the task of restoring the missing parts of an acoustic speech signal from reliable audio context and uncorrupted visual information. |
G. Morrone; D. Michelsanti; Z. -H. Tan; J. Jensen; |
1333 | Vset: A Multimodal Transformer for Visual Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we challenge this common belief and show that an audio-visual transformer can significantly improve AVSE performance, by learning the long-term dependency of both intra-modality and inter-modality. |
K. Ramesh; C. Xing; W. Wang; D. Wang; X. Chen; |
1334 | Switching Variational Auto-Encoders for Noise-Agnostic Audio-Visual Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to find the optimal combination of these two architectures through time. |
M. Sadeghi; X. Alameda-Pineda; |
1335 | Audio-Visual Speech Enhancement Method Conditioned in The Lip Motion and Speaker-Discriminative Embeddings Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an audio-visual speech enhancement (AVSE) method conditioned both on the speaker?s lip motion and on speaker-discriminative embeddings. |
K. Ito; M. Yamamoto; K. Nagamatsu; |
1336 | Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present an audio-visual speech separation learning method that considers the correspondence between the separated signals and the visual signals to reflect the speech characteristics during training. |
N. Makishima; M. Ihori; A. Takashima; T. Tanaka; S. Orihashi; R. Masumura; |
1337 | Muse: Multi-Modal Target Speaker Extraction with Visual Cues Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a multi-modal speaker extraction network, named MuSE, that is conditioned only on a lip image sequence. |
Z. Pan; R. Tao; C. Xu; H. Li; |
1338 | An Effective Deep Embedding Learning Method Based on Dense-Residual Networks for Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present an effective end-to-end deep embedding learning method based on Dense-Residual networks, which combine the advantages of a densely connected convolutional network (DenseNet) and a residual network (ResNet), for speaker verification (SV). |
Y. Liu; Y. Song; I. McLoughlin; L. Liu; L. -r. Dai; |
1339 | Time-Domain Speaker Verification Using Temporal Convolutional Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a speaker verification system that takes the time-domain raw waveforms as inputs, which adopts a learnable encoder and temporal convolutional networks (TCNs) that have shown impressive performance in speech separation. |
S. Han; J. Byun; J. W. Shin; |
1340 | Towards Robust Speaker Verification with Target Speaker Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes the target speaker enhancement based speaker verification network (TASE-SVNet), an all neural model that couples target speaker enhancement and speaker embedding extraction for robust speaker verification (SV). |
C. Zhang; M. Yu; C. Weng; D. Yu; |
1341 | A Joint Training Framework of Multi-Look Separator and Speaker Embedding Extractor for Overlapped Speech Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a joint training framework of the front-end multi-look speech separator and the back-end speaker embedding extractor is proposed for multi-channel overlapped speech. |
N. Zheng; et al. |
1342 | Cam: Context-Aware Masking for Robust Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose context- aware masking (CAM), a novel method to extract robust speaker embedding. |
Y. -Q. Yu; S. Zheng; H. Suo; Y. Lei; W. -J. Li; |
1343 | Short-Time Spectral Aggregation for Speaker Embedding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces short-time spectral pooling (STSP) for better aggregation of frame-level information. |
Y. Tu; M. -W. Mak; |
1344 | Contrastive Self-Supervised Learning for Text-Independent Speaker Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We proposed channel-invariant loss to prevent the network from encoding the undesired channel information into the speaker representation. |
H. Zhang; Y. Zou; H. Wang; |
1345 | Adversarial Defense for Automatic Speaker Verification By Cascaded Self-Supervised Learning Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Hence, with the goal of effective defense in ASV against adversarial attacks, we propose a standard and attack-agnostic method based on cascaded self-supervised learning models to purify the adversarial perturbations. |
H. Wu; X. Li; A. T. Liu; Z. Wu; H. Meng; H. -y. Lee; |
1346 | Self-Supervised Text-Independent Speaker Verification Using Prototypical Momentum Contrastive Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we investigate self-supervised representation learning for speaker verification (SV). |
W. Xia; C. Zhang; C. Weng; M. Yu; D. Yu; |
1347 | An Iterative Framework for Self-Supervised Deep Speaker Representation Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an iterative framework for self-supervised speaker representation learning based on a deep neural network (DNN). |
D. Cai; W. Wang; M. Li; |
1348 | Improving Reconstruction Loss Based Speaker Embedding in Unsupervised and Semi-Supervised Scenarios Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Thus, in this paper, we evaluate speaker embeddings learned by training the spectrogram prediction network under unsupervised and semi-supervised scenarios. |
J. Cho; P. Zelasko; J. Villalba; N. Dehak; |
1349 | Speech Acoustic Modelling from Raw Phase Spectrum Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate the possibility and efficacy of acoustic modelling using the raw short-time phase spectrum. |
E. Loweimi; Z. Cvetkovic; P. Bell; S. Renals; |
1350 | An Investigation of Using Hybrid Modeling Units for Improving End-to-End Speech Recognition System Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper uses a hybrid of the syllable, Chinese character, and subword as the modeling units for the end-to-end speech recognition system based on the CTC/attention multi-task learning. |
S. Chen; X. Hu; S. Li; X. Xu; |
1351 | Federated Acoustic Modeling for Automatic Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate federated acoustic modeling using data from multiple clients. |
X. Cui; S. Lu; B. Kingsbury; |
1352 | Eat: Enhanced ASR-TTS for Self-Supervised Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here we propose an enhanced ASR-TTS (EAT) model that incorporates two main features: 1) The ASR?TTS direction is equipped with a language model reward to penalize the ASR hypotheses before forwarding it to TTS. |
M. K. Baskar; L. Burget; S. Watanabe; R. F. Astudillo; J. . Cernock�; |
1353 | Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a range of neural architecture search (NAS) techniques are used to automatically learn two types of hyper-parameters of state-of-the-art factored time delay neural networks (TDNNs): i) the left and right splicing context offsets; and ii) the dimensionality of the bottleneck linear projection at each hidden layer. |
S. Hu; et al. |
1354 | Hypothesis Stitcher for End-to-End Speaker-Attributed ASR on Long-Form Multi-Talker Recordings Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we first apply a known decoding technique that was developed to perform single-speaker ASR for long-form audio to our E2E SA-ASR task. Then, we propose a novel method using a sequence-to-sequence model, called hypothesis stitcher. |
X. Chang; N. Kanda; Y. Gaur; X. Wang; Z. Meng; T. Yoshioka; |
1355 | Ensemble Combination Between Different Time Segmentations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes to generalise hypothesis-level combination, allowing the use of different audio segmentation times between the models, by splitting and re-joining the hypothesised N-best lists in time. |
J. H. M. Wong; et al. |
1356 | Streaming End-to-End Speech Recognition with Jointly Trained Neural Feature Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a streaming end-to-end speech recognition model based on Monotonic Chunkwise Attention (MoCha) jointly trained with enhancement layers. |
C. Kim; A. Garg; D. Gowda; S. Mun; C. Han; |
1357 | Transformer in Action: A Comparative Study of Transformer-Based Acoustic Models for Large Scale Speech Recognition Applications Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we summarize the application of transformer and its streamable variant, Emformer based acoustic model [1] for large scale speech recognition applications. |
Y. Wang; et al. |
1358 | Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes an efficient memory transformer Emformer for low latency streaming speech recognition. |
Y. Shi; et al. |
1359 | Learned Transferable Architectures Can Surpass Hand-Designed Architectures for Large Scale Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore the neural architecture search (NAS) for automatic speech recognition (ASR) systems. |
L. He; D. Su; D. Yu; |
1360 | Multitask Learning and Joint Optimization for Transformer-RNN-Transducer Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose novel multitask learning, joint optimization, and joint decoding methods for transformer-RNN-transducer systems. |
J. -J. Jeon; E. Kim; |
1361 | SEP-28k: A Dataset for Stuttering Event Detection from Podcasts with People Who Stutter Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we introduce Stuttering Events in Podcasts (SEP-28k), a dataset containing over 28k clips labeled with five event types including blocks, prolongations, sound repetitions, word repetitions, and interjections. |
C. Lea; V. Mitra; A. Joshi; S. Kajarekar; J. P. Bigham; |
1362 | A Hybrid CNN-BiLSTM Voice Activity Detector Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a new hybrid architecture for voice activity detection (VAD) incorporating both convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM) layers trained in an end-to-end manner. |
N. Wilkinson; T. Niesler; |
1363 | Self-Attentive VAD: Context-Aware Detection of Voice from Noise Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To cope with this issue with the self-attention mechanism and achieve a simple, powerful, and environment-robust VAD, we first adopt the self-attention architecture in building up the modules for voice detection and boosted prediction. |
Y. R. Jo; Y. Ki Moon; W. I. Cho; G. Sik Jo; |
1364 | Preventing Early Endpointing for Online Automatic Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to address the early endpointing problem from the gradient perspective. |
Y. Zhao; C. Ni; C. -C. Leung; S. Joty; E. S. Chng; B. Ma; |
1365 | MarbleNet: Deep 1D Time-Channel Separable Convolutional Neural Network for Voice Activity Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present MarbleNet, an end-to-end neural network for Voice Activity Detection (VAD). |
F. Jia; S. Majumdar; B. Ginsburg; |
1366 | Speech Enhancement Aided End-To-End Multi-Task Learning for Voice Activity Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, here we propose a speech enhancement aided end-to-end multi-task model for VAD. |
X. Tan; X. -L. Zhang; |
1367 | Robust Voice Activity Detection Using A Masked Auditory Encoder Based Convolutional Neural Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we propose a robust VAD approach using a masked auditory encoder based convolutional neural network (M-AECNN). |
N. Li; et al. |
1368 | A Stage Match for Query-by-Example Spoken Term Detection Based On Structure Information of Query Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a stage match strategy based on the structure information of the query, represented with the unvoiced-voiced attribute of the portions in itself. |
J. Zhan; Q. He; J. Su; Y. Li; |
1369 | Knowledge Transfer for Efficient On-Device False Trigger Mitigation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we address the task of determining whether a given utterance is directed towards a voice-enabled smart-assistant device or not. |
D. Dighe; E. Marchi; S. Vishnubhotla; S. Kajarekar; D. Naik; |
1370 | Progressive Voice Trigger Detection: Accuracy Vs Latency Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The main idea in this work is to exploit information in words that immediately follow the trigger phrase. |
S. Sigtia; J. Bridle; H. Richards; P. Clark; E. Marchi; V. Garg; |
1371 | Dynamic Curriculum Learning Via Data Parameters for Noise Robust Keyword Spotting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Similarly, in this paper, we propose using this curriculum learning approach for acoustic modeling, and train an acoustic model on clean and noisy utterances with the data parameters. |
T. Higuchi; S. Saxena; M. Souden; T. D. Tran; M. Delfarah; C. Dhir; |
1372 | CNN-Based Spoken Term Detection and Localization Without Dynamic Programming Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a spoken term detection algorithm for simultaneous prediction and localization of in-vocabulary and out-of-vocabulary terms within an audio segment. |
T. S. Fuchs; Y. Segal; J. Keshet; |
1373 | Query-By-Example Keyword Spotting System Using Multi-Head Attention and Soft-triple Loss Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a neural network architecture for tackling the query-by-example user-defined keyword spotting task. |
J. Huang; W. Gharbieh; H. S. Shim; E. Kim; |
1374 | A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In the present work we further investigate this connection and examine the interplay between the two problems. |
O. Braga; O. Siohan; |
1375 | Generalized Knowledge Distillation from An Ensemble of Specialized Teachers Leveraging Unsupervised Neural Clustering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes an improved generalized knowledge distillation framework with multiple dissimilar teacher networks, each of which is specialized for a specific domain, to make a deployable student network more robust to challenging acoustic environments. |
T. Fukuda; G. Kurata; |
1376 | Multistream CNN for Robust Acoustic Modeling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes multistream CNN, a novel neural network architecture for robust acoustic modeling in speech recognition tasks. |
K. J. Han; J. Pan; V. K. N. Tadala; T. Ma; D. Povey; |
1377 | Improved Robustness to Disfluencies in Rnn-Transducer Based Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We investigate data selection and preparation choices aiming for improved robustness of RNN-T ASR to speech disfluencies with a focus on partial words. |
V. Mendelev; T. Raissi; G. Camporese; M. Giollo; |
1378 | Representation Learning for Speech Recognition Using Feedback Based Relevance Weighting Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose an acoustic embedding based approach for representation learning in speech recognition. |
P. Agrawal; S. Ganapathy; |
1379 | Towards Data Selection on TTS Data for Children�s Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we adopt text-to-speech data augmentation to improve the performance of children?s speech recognition system. |
W. Wang; Z. Zhou; Y. Lu; H. Wang; C. Du; Y. Qian; |
1380 | An Investigation of End-to-End Models for Robust Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We address this gap and present a detailed comparison of speech enhancement-based techniques and three different model-based adaptation techniques covering data augmentation, multi-task learning, and adversarial learning for robust ASR. |
A. Prasad; P. Jyothi; R. Velmurugan; |
1381 | End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we focus on the multichannel multi-speaker reverberant condition, and propose to extend our previous framework for end-to-end dereverberation, beamforming, and speech recognition with improved numerical stability and advanced frontend subnetworks including voice activity detection like masks. |
W. Zhang; et al. |
1382 | Streaming Multi-Speaker ASR with RNN-T Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We investigate two approaches to multi-speaker model training of the RNN-T: deterministic output-target assignment and permutation invariant training. |
I. Sklyar; A. Piunova; Y. Liu; |
1383 | Improving RNN Transducer with Target Speaker Extraction and Neural Uncertainty Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work presents a joint framework that combines time-domain target-speaker speech extraction and Recurrent Neural Network Transducer (RNN-T). |
J. Shi; C. Zhang; C. Weng; S. Watanabe; M. Yu; D. Yu; |
1384 | A Progressive Learning Approach to Adaptive Noise and Speech Estimation for Speech Enhancement and Noisy Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a progressive learning-based adaptive noise and speech estimation (PL-ANSE) method for speech preprocessing in noisy speech recognition, leveraging upon a frame-level noise tracking capability of improved minima controlled recursive averaging (IMCRA) and an utterance-level deep progressive learning of nonlinear interactions between speech and noise. |
Z. Nian; Y. -H. Tu; J. Du; C. -H. Lee; |
1385 | The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The Accented English Speech Recognition Challenge (AESRC2020) is designed for providing a common testbed and promoting accent-related research. |
X. Shi; et al. |
1386 | Comparative Study of Different Epoch Extraction Methods for Speech Associated with Voice Disorders Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, this study aimed to compare the various algorithms for detecting epoch locations from the speech associated with voice disorders. |
P. Barche; K. Gurugubelli; A. K. Vuppala; |
1387 | Have You Made A Decision? Where? A Pilot Study on Interpretability of Polarity Analysis Based on Advising Problem Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study the problem of adding interpretability to the overall polarity by predicting the meta-polarity at the same time. |
T. Li; et al. |
1388 | Transformer Based Unsupervised Pre-Training for Acoustic Representation Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To handle this problem, we propose an unsupervised pre-training method using Transformer based encoder to learn a general and robust high-level representation for all acoustic tasks. |
R. Zhang; H. Wu; W. Li; D. Jiang; W. Zou; X. Li; |
1389 | A Comparison of Convolutional Neural Networks for Glottal Closure Instant Detection from Raw Speech Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we continue to investigate the use of machine learning for the automatic detection of glottal closure instants (GCIs) from raw speech. |
J. Matou�ek; D. Tihelka; |
1390 | Encoder-Decoder Based Pitch Tracking and Joint Model Training for Mandarin Tone Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose RNN based Encoder-Decoder framework with gating mechanism which underlying models both the state cost estimation and Viterbi back-tracing pass implemented in the RAPT algorithm. |
H. Huang; K. Wang; Y. Hu; S. Li; |
1391 | Construction of A Large-Scale Japanese ASR Corpus on TV Recordings Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a new large-scale Japanese speech corpus for training automatic speech recognition (ASR) systems. |
S. Ando; H. Fujihara; |
1392 | NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we attempt to overcome this limitation by developing a new dataset which has speech data from five different Indian languages along with English. |
S. B. Kalluri; D. Vijayasenan; S. Ganapathy; R. R. M; P. Krishnan; |
1393 | Multilingual Phonetic Dataset for Low Resource Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present a large multilingual phonetic dataset, which is preprocessed and aligned from the UCLA phonetic dataset. |
X. Li; D. R. Mortensen; F. Metze; A. W. Black; |
1394 | Age-VOX-Celeb: Multi-Modal Corpus for Facial and Speech Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle this problem, we construct a new audio-visual age corpus named AgeVoxCeleb by annotating age labels to VoxCeleb2 videos. AgeVoxCeleb is the first large-scale, balanced, and multi-modal age corpus that contains both video and speech of the same speakers from a wide age range. |
N. Tawara; A. Ogawa; Y. Kitagishi; H. Kamiyama; |
1395 | Didispeech: A Large Scale Mandarin Speech Corpus Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces a new open-sourced Mandarin speech corpus, called DiDiSpeech. |
T. Guo; et al. |
1396 | The In-the-Wild Speech Medical Corpus Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present the in-the-Wild Speech Medical (WSM) Corpus, a collection of in-the-wild videos, featuring subjects potentially affected by a SA disease – specifically, depression or Parkinson’s disease. |
J. Correia; F. Teixeira; C. Botelho; I. Trancoso; B. Raj; |
1397 | Multiple-Hypothesis CTC-Based Semi-Supervised Adaptation of End-to-End Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes an adaptation method for end-to-end speech recognition. |
C. -T. Do; R. Doddipatla; T. Hain; |
1398 | Vowel Non-Vowel Based Spectral Warping and Time Scale Modification for Improvement in Children�s ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we proposed a linear prediction based spectral warping method by using the knowledge of vowel and non-vowel regions in speech signals to mitigate the formant frequencies differences between child and adult speakers. |
H. Kathania; A. Kumar; M. Kurimo; |
1399 | Extending Parrotron: An End-to-End, Speech Conversion and Speech Recognition Model for Atypical Speech Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present an extended Parrotron model: a single, end-to-end network that enables voice conversion and recognition simultaneously. |
R. Doshi; et al. |
1400 | Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study proposes a novel technique for child ASR using both feature normalization and data augmentation methods based on the relationship between formants and fundamental frequency (fo). |
G. Yeung; R. Fan; A. Alwan; |
1401 | Analysis of X-Vectors for Low-Resource Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The paper presents a study of usability of x-vectors for adaptation of automatic speech recognition (ASR) systems. |
M. Karafi�t; et al. |
1402 | Refining Automatic Speech Recognition System for Older Adults Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: With 12 hours of training data, we attempt to develop an ASR system for socially isolated seniors (80+ years old) with possible cognitive impairments. |
L. Chen; M. Asgari; |
1403 | MixSpeech: Data Augmentation for Low-Resource Automatic Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose MixSpeech, a simple yet effective data augmentation method based on mixup for automatic speech recognition (ASR). |
L. Meng; J. Xu; X. Tan; J. Wang; T. Qin; B. Xu; |
1404 | End-to-End Multilingual Automatic Speech Recognition for Less-Resourced Languages: The Case of Four Ethiopian Languages Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We have, therefore, conducted ML E2E ASR experiments for four less-resourced Ethiopian languages using different language and acoustic modelling units. |
S. T. Abate; M. Y. Tachbelie; T. Schultz; |
1405 | Improved Data Selection for Domain Adaptation in ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We address the problem of domain adaptation with semi-supervised training (SST). |
S. Wotherspoon; W. Hartmann; M. Snover; O. Kimball; |
1406 | Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-Training and Its Application to Children�s ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a bidirectional unsupervised model pre-training (UPT) method and apply it to children?s automatic speech recognition (ASR). |
R. Fan; A. Afshan; A. Alwan; |
1407 | Meta-Adapter: Efficient Cross-Lingual Adaptation With Meta-Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to combine the adapter module with meta-learning algorithms to achieve high recognition performance under low-resource settings and improve the parameter-efficiency of the model. |
W. Hou; Y. Wang; S. Gao; T. Shinozaki; |
1408 | Error-Driven Fixed-Budget ASR Personalization for Accented Speakers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Given a speaker and an ASR model, we propose a method of identifying sentences for which the speaker?s utterances are likely to be harder for the given ASR model to recognize. |
A. Awasthi; A. Kansal; S. Sarawagi; P. Jyothi; |
1409 | Context-Aware Prosody Correction for Text-Based Speech Editing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In our work, we propose a new context-aware method for more natural sounding text-based editing of speech. |
M. Morrison; L. Rencker; Z. Jin; N. J. Bryan; J. -P. Caceres; B. Pardo; |
1410 | Fast DCTTS: Efficient Deep Convolutional Text-to-Speech Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an end-to-end speech synthesizer, Fast DCTTS, that synthesizes speech in real time on a single CPU thread. |
M. Kang; J. Lee; S. Kim; I. Kim; |
1411 | Speech Prediction in Silent Videos Using Variational Autoencoders Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a stochastic model for generating speech in a silent video. |
R. Yadav; A. Sardana; V. P. Namboodiri; R. M. Hegde; |
1412 | Learning Disentangled Phone and Speaker Representations in A Semi-Supervised VQ-VAE Paradigm Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a new approach to disentangle speaker voice and phone content by introducing new components to the VQ-VAE architecture for speech synthesis. |
J. Williams; Y. Zhao; E. Cooper; J. Yamagishi; |
1413 | High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a high-intelligibility speech synthesis method for persons with dysarthria caused by athetoid cerebral palsy. |
K. Matsubara; et al. |
1414 | Denoispeech: Denoising Text to Speech with Frame-Level Noise Modeling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we develop DenoiSpeech, a TTS system that can synthesize clean speech for a speaker with noisy speech data. |
C. Zhang; et al. |
1415 | Non-Autoregressive Sequence-To-Sequence Voice Conversion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel voice conversion (VC) method based on non-autoregressive sequence-to-sequence (NAR-S2S) models. |
T. Hayashi; W. -C. Huang; K. Kobayashi; T. Toda; |
1416 | PPG-Based Singing Voice Conversion with Adversarial Representation Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: On top of recent voice conversion works, we propose a novel model to steadily convert songs while keeping their naturalness and intonation. |
Z. Li; et al. |
1417 | Litesing: Towards Fast, Lightweight and Expressive Singing Voice Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: LiteSing proposed in this paper is a high-quality singing voice synthesis (SVS) system, which is fast, lightweight and expressive. |
X. Zhuang; T. Jiang; S. -Y. Chou; B. Wu; P. Hu; S. Lui; |
1418 | Semi-Supervised Learning for Singing Synthesis Timbre Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a semi-supervised singing synthesizer, which is able to learn new voices from audio data only, without any annotations such as phonetic segmentation. |
J. Bonada; M. Blaauw; |
1419 | Recurrent Phase Reconstruction Using Estimated Phase Derivatives from Deep Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose three changes for such a two-stage phase reconstruction system. |
L. Thieling; D. Wilhelm; P. Jax; |
1420 | Stable Checkpoint Selection and Evaluation in Sequence to Sequence Speech Synthesis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we propose a novel stability metric designed for automatic checkpoint selection based on incomplete utterance counts within a validation set. |
S. Shechtman; D. Haws; R. Fernandez; |
1421 | TSTNN: Two-Stage Transformer Based Neural Network for Speech Enhancement in The Time Domain Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a transformer-based architecture, called two-stage transformer neural network (TSTNN) for end-to-end speech denoising in the time domain. |
K. Wang; B. He; W. -P. Zhu; |
1422 | Self-Attention Generative Adversarial Network for Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To remedy this issue, we propose a self-attention layer adapted from non-local attention, coupled with the convolutional and deconvolutional layers of a speech enhancement GAN (SEGAN) using raw signal input. |
H. Phan; et al. |
1423 | Neural Kalman Filtering for Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we extend the conventional Kalman filtering (KF) and propose a supervised-learning based neural Kalman filter (NKF) for speech enhancement. |
W. Xue; G. Quan; C. Zhang; G. Ding; X. He; B. Zhou; |
1424 | Neural Noise Embedding for End-To-End Speech Enhancement with Conditional Layer Normalization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, a new normalization method, termed conditional layer normalization (CLN), is introduced to improve the generalization of deep learning based speech enhancement approaches for unseen environments. |
Z. Zhang; X. Li; Y. Li; Y. Dong; D. Wang; S. Xiong; |
1425 | Perceptual Loss Based Speech Denoising with An Ensemble of Audio Pattern Recognition and Self-Supervised Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a generalized framework called Perceptual Ensemble Regularization Loss (PERL) built on the idea of perceptual losses. |
S. Kataria; J. Villalba; N. Dehak; |
1426 | Towards An ASR Approach Using Acoustic and Language Models for Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose to modify the speech estimation process, by treating speech enhancement as a classification problem in an ASR-style manner. |
K. M. Nayem; D. S. Williamson; |
1427 | A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer and Large Scale Synthetic Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Since our goal is to recognize the input speech, we consider enhancements which improve word error rates (WERs) when the predicted speech signal is passed to an automatic speech recognition (ASR) model. |
N. Howard; A. Park; T. Z. Shabestary; A. Gruenstein; R. Prabhavalkar; |
1428 | Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On Percepnet Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a system combining a traditional acoustic echo canceller, and a low-complexity joint residual echo and noise suppressor based on a hybrid signal processing/deep neural network (DSP/DNN) approach. |
J. -M. Valin; S. Tenneti; K. Helwani; U. Isik; A. Krishnaswamy; |
1429 | Acoustic Echo Cancellation with The Dual-Signal Transformation LSTM Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper applies the dual-signal transformation LSTM network (DTLN) to the task of real-time acoustic echo cancellation (AEC). |
N. L. Westhausen; B. T. Meyer; |
1430 | High Fidelity Speech Regeneration with Application to Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a wav-to-wav generative model for speech that can generate 24khz speech in a real-time manner and which utilizes a compact speech representation, composed of ASR and identity features, to achieve a higher level of intelligibility. |
A. Polyak; L. Wolf; Y. Adi; O. Kabeli; Y. Taigman; |
1431 | A Time-Domain Convolutional Recurrent Network for Packet Loss Concealment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate a time-domain convolutional recurrent network (CRN) for online packet loss concealment. |
J. Lin; Y. Wang; K. Kalgaonkar; G. Keren; D. Zhang; C. Fuegen; |
1432 | Cascaded Time + Time-Frequency Unet For Speech Enhancement: Jointly Addressing Clipping, Codec Distortions, And Gaps Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we first systematically study and achieve state of the art results on each of these three distortions individually. Next, we demonstrate a neural network pipeline that cascades a time domain convolutional neural network with a time-frequency domain convolutional neural network to address all three distortions jointly. |
A. A. Nair; K. Koishida; |
1433 | Hidden Markov Model Diarisation with Speaker Location Information Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This report proposes to extend the Hidden Markov Model (HMM) clustering method, to enable the use of speaker location information. |
J. H. M. Wong; X. Xiao; Y. Gong; |
1434 | Compositional Embedding Models for Speaker Identification and Diarization with Simultaneous Speech From 2+ Speakers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new method for speaker diarization that can handle overlapping speech with 2+ people. |
Z. Li; J. Whitehill; |
1435 | Content-Aware Speaker Embeddings for Speaker Diarisation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, the content-aware speaker embeddings (CASE) approach is proposed, which extends the input of the speaker classifier to include not only acoustic features but also their corresponding speech content, via phone, character, and word embeddings. |
G. Sun; D. Liu; C. Zhang; P. C. Woodland; |
1436 | Multi-Scale Speaker Diarization with Neural Affinity Score Fusion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an unconventional method that tackles the trade-off between temporal resolution and the quality of the speaker representations. |
T. J. Park; M. Kumar; S. Narayanan; |
1437 | A Comparison Study on Infant-Parent Voice Diarization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We design a framework for studying prelinguistic child voice from 3 to 24 months based on state-of-the-art algorithms in diarization. |
J. Zhu; M. Hasegawa-Johnson; N. L. McElwain; |
1438 | End-To-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present an end-to-end deep network model that performs meeting diarization from single-channel audio recordings. |
S. Maiti; H. Erdogan; K. Wilson; S. Wisdom; S. Watanabe; J. R. Hershey; |
1439 | End-To-End Speaker Diarization As Post-Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To compensate for each other?s weakness, we propose to use a two-speaker end-to-end diarization method as post-processing of the results obtained by a clustering-based method. |
S. Horiguchi; P. Garc�a; Y. Fujita; S. Watanabe; K. Nagamatsu; |
1440 | BW-EDA-EEND: Streaming END-TO-END Neural Speaker Diarization for A Variable Number of Speakers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a novel online end-to-end neural diarization system, BW-EDA-EEND, that processes data incrementally for a variable number of speakers. |
E. Han; C. Lee; A. Stolcke; |
1441 | Integrating End-to-End Neural and Clustering-Based Diarization: Getting The Best of Both Worlds Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a simple but effective hybrid diarization framework that works with overlapped speech and for long recordings containing an arbitrary number of speakers. |
K. Kinoshita; M. Delcroix; N. Tawara; |
1442 | Siamese Capsule Network for End-to-End Speaker Recognition in The Wild Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an end-to-end deep model for speaker verification in the wild. |
A. Hajavi; A. Etemad; |
1443 | A Real-Time Speaker Diarization System Based on Spatial Spectrum Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we describe a speaker diarization system that enables localization and identification of all speakers present in a conversation or meeting. |
S. Zheng; W. Huang; X. Wang; H. Suo; J. Feng; Z. Yan; |
1444 | Unsupervised Neural Adaptation Model Based on Optimal Transport for Spoken Language Identification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an unsupervised neural adaptation model to deal with the distribution mismatch problem for SLID. |
X. Lu; P. Shen; Y. Tsao; H. Kawai; |
1445 | Joint ASR and Language Identification Using RNN-T: An Efficient Approach to Dynamic Language Switching Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Since this solution is neither scalable nor cost- and memory-efficient, especially for on-device applications, we propose end-to-end, streaming, joint ASR-LID architectures based on the recurrent neural network transducer framework. |
S. Punjabi; et al. |
1446 | Spoken Language Identification in Unseen Target Domain Using Within-Sample Similarity Loss Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this, we propose an auxiliary within-sample similarity loss (WSSL) which encourages the network to suppress the channel-specific contents in the speech. |
M. H; S. Kapoor; D. A. Dinesh; P. Rajan; |
1447 | Exploring The Use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Since the visual rendering of these characters is different, in this paper, we explore the benefits of representing such similar target subword units (e.g., Byte Pair Encoded(BPE) units) through a Common Label Set (CLS). |
V. M. Shetty; S. Umesh; |
1448 | Phone Distribution Estimation for Low Resource Languages Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a novel approach to estimate phone distributions by only requiring raw audio datasets: We first estimate the phone ranks by combining language-independent recognition results and Learning to Rank results. |
X. Li; J. Li; J. Yao; A. W. Black; F. Metze; |
1449 | How Phonotactics Affect Multilingual and Zero-Shot ASR Performance Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We show that the gain from modeling crosslingual phonotactics is limited, and imposing a too strong model can hurt the zero-shot transfer. |
S. Feng; et al. |
1450 | Modelling Paralinguistic Properties in Conversational Speech to Detect Bipolar Disorder and Borderline Personality Disorder Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we investigate the automatic detection of these two conditions by modelling both verbal and non-verbal cues in a set of interviews. |
B. Wang; Y. Wu; N. Vaci; M. Liakata; T. Lyons; K. E. A. Saunders; |
1451 | An Attention Model for Hypernasality Prediction in Children with Cleft Palate Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose an attention-based bidirectional long-short memory (BLSTM) model that directly maps the frame-level features to utterance-level ratings by focusing only on specific speech frames carrying hyper-nasal cues. |
V. C. Mathad; N. Scherer; K. Chapman; J. Liss; V. Berisha; |
1452 | An End-to-End Speech Accent Recognition Method Based on Hybrid CTC/Attention Transformer ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel accent recognition system in the framework of a transformer-based end-to-end speech recognition system. |
Q. Gao; H. Wu; Y. Sun; Y. Duan; |
1453 | Multi-Task Estimation of Age and Cognitive Decline from Speech Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, multi-task learning is applied for the joint estimation of age and the Mini-Mental Status Evaluation criteria (MMSE) commonly used to assess cognitive decline. |
Y. Pan; V. S. Nallanthighal; D. Blackburn; H. Christensen; A. H�rm�; |
1454 | Deepemocluster: A Semi-Supervised Framework for Latent Cluster Representation of Speech Emotions Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we introduce a new SSL framework, which we refer to as the DeepEmoCluster framework, for attribute-based SER tasks. |
W. -C. Lin; K. Sridhar; C. Busso; |
1455 | The Role of Task and Acoustic Similarity in Audio Transfer Learning: Insights from The Speech Emotion Recognition Case Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We perform a similar investigation for the case of speech emotion recognition (SER), and conclude that transfer learning for SER is influenced both by the choice of pre-training task and by the differences in acoustic conditions between the upstream and downstream data sets, with the former having a bigger impact. |
A. Triantafyllopoulos; B. W. Schuller; |
1456 | Speech-Based Depression Prediction Using Encoder-Weight-Only Transfer Learning and A Large Corpus Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We explore a speech-based transfer learning approach that uses a lightweight encoder and that transfers only the encoder weights, enabling a simplified run-time model. |
A. Harati; E. Shriberg; T. Rutowski; P. Chlebek; Y. Lu; R. Oliveira; |
1457 | Estimating Severity of Depression From Acoustic Features and Embeddings of Natural Speech Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we examine two novel approaches for improving depression severity estimation from short audio recordings of speech. |
S. H. Dumpala; S. Rempel; K. Dikaios; M. Sajjadian; R. Uher; S. Oore; |
1458 | Automatic Elicitation Compliance for Short-Duration Speech Based Depression Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this multi-corpus study of over 25,000 ?pataka? utterances, it was discovered that speech landmark- based features were sensitive to the number of ?pataka? utterances per recording. |
B. Stasak; Z. Huang; D. Joachim; J. Epps; |
1459 | Deep Neural Network Embeddings for The Estimation of The Degree of Sleepiness Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we employ the x-vector approach, currently the state-of-the-art in speaker recognition, as a neural network feature extractor to detect the level of sleepiness of a speaker. |
J. V. Egas-L�pez; G. Gosztolya; |
1460 | Pause-Encoded Language Models for Recognition of Alzheimer�s Disease and Emotion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose enhancing Transformer language models (BERT, RoBERTa) to take advantage of pauses. |
J. Yuan; X. Cai; K. Church; |
1461 | End-2-End Modeling of Speech and Gait from Patients with Parkinson�s Disease: Comparison Between High Quality Vs. Smartphone Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose the use of state-of-the-art deep learning techniques to evaluate the speech and gait symptoms of patients. |
J. C. Vasquez-Correa; T. Arias-Vergara; P. Klumpp; P. A. Perez-Toro; J. R. Orozco-Arroyave; E. N�th; |
1462 | A Sequential Contrastive Learning Framework for Robust Dysarthric Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a contrastive learning framework for robust dysarthric speech recognition (DSR) by capturing the dysarthric speech variability. |
L. Wu; D. Zong; S. Sun; J. Zhao; |
1463 | Automatic And Perceptual Discrimination Between Dysarthria, Apraxia of Speech, and Neurotypical Speech Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate a three-class automatic technique and a set of handcrafted features for the discrimination of dysarthria, AoS and neurotypical speech. |
I. Kodrasi; M. Pernon; M. Laganaro; H. Bourlard; |
1464 | Effect of Noise and Model Complexity on Detection of Amyotrophic Lateral Sclerosis and Parkinson�s Disease Using Pitch and MFCC Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study aims to examine the robustness of these cues against background noise and model complexity, which has not been investigated before. |
T. Bhattacharjee; et al. |
1465 | Multi-Task Transformer with Input Feature Reconstruction for Dysarthric Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose a multi-task Transformer with input feature reconstruction as an auxiliary task, where the main task of DSR and the auxiliary reconstruction task share the same encoder network. |
C. Ding; S. Sun; J. Zhao; |
1466 | Detecting Alzheimer�s Disease from Speech Using Neural Networks with Bottleneck Features and Data Augmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a method of detecting Alzheimer?s disease (AD) from the spontaneous speech of subjects in a picture description task using neural networks. |
Z. Liu; Z. Guo; Z. Ling; Y. Li; |
1467 | Automatic Dysarthric Speech Detection Exploiting Pairwise Distance-Based Convolutional Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we propose a novel automatic dysarthric speech detection approach based on analyses of pairwise distance matrices using convolutional neural networks (CNNs). |
P. Janbakhshi; I. Kodrasi; H. Bourlard; |
1468 | Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose extensions to these techniques that allow RNN-T to exploit external NNLMs during both training and inference time, resulting in 13-18% relative Word Error Rate improvement on Librispeech compared to strong baselines. |
S. Kim; et al. |
1469 | Internal Language Model Training for Domain-Adaptive End-To-End Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To improve the ILME-based inference, we propose an internal LM training (ILMT) method to minimize an additional internal LM loss by updating only the E2E model components that affect the internal LM estimation. |
Z. Meng; et al. |
1470 | Speech Recognition By Simply Fine-Tuning Bert Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a simple method for automatic speech recognition (ASR) by fine-tuning BERT, which is a language model (LM) trained on large-scale unlabeled text data and can generate rich contextual representations. |
W. -C. Huang; C. -H. Wu; S. -B. Luo; K. -Y. Chen; H. -M. Wang; T. Toda; |
1471 | Personalization Strategies for End-to-End Speech Recognition Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we demonstrate how first- and second-pass rescoring strategies can be leveraged together to improve the recognition of such words. |
A. Gourav; et al. |
1472 | Improving Entity Recall in Automatic Speech Recognition with Neural Embeddings Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present a method which uses learned text embeddings and nearest neighbor retrieval within a large database of entity embeddings to correct misrecognitions. |
C. Li; P. Rondon; D. Caseiro; L. Velikovich; X. Velez; P. Aleksic; |
1473 | Adaptable Multi-Domain Language Model for Transformer ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an adapter based multi-domain Transformer based language model (LM) for Transformer ASR. |
T. Lee; et al. |
1474 | Transformer Language Models with LSTM-Based Cross-Utterance Information Representation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To extract more powerful and robust cross-utterance representations for the Transformer LM (TLM), this paper proposes the R-TLM which uses hidden states in a long short-term memory (LSTM) LM. |
G. Sun; C. Zhang; P. C. Woodland; |
1475 | Large Margin Training Improves Language Models for ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a large margin language model (LMLM). |
J. Wang; J. Huang; K. W. Church; |
1476 | Domain-Aware Neural Language Models for Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a domain-aware rescoring framework suitable for achieving domain-adaptation during second-pass rescoring in production settings. |
L. Liu; et al. |
1477 | Bayesian Transformer Language Models for Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to address these issues, this paper proposes a full Bayesian learning framework for Transformer LM estimation. |
B. Xue; et al. |
1478 | Mixed Precision Quantization of Transformer Language Models for Speech Recognition Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, novel mixed precision DNN quantization methods are proposed in this paper. |
J. Xu; S. Hu; J. Yu; X. Liu; H. Meng; |
1479 | Federated Marginal Personalization for ASR Rescoring Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce federated marginal personalization (FMP), a novel method for continuously updating personalized neural network language models (NNLMs) on private devices using federated learning (FL). |
Z. Liu; F. Peng; |
1480 | Multi Path Training Framework for Data-Driven Open-Domain Conversation System Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel framework, Multi-Path Training (MPT), for training a robust dialogue response generation system. |
S. Wu; D. Zhang; Y. Li; Z. Wu; |
1481 | Action State Update Approach to Dialogue Management Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose the action state update approach (ASU) for utterance interpretation, featuring a statistically trained binary classifier used to detect dialogue state update actions in the text of a user utterance. |
S. Stoyanchev; S. Keizer; R. Doddipatla; |
1482 | Generating Empathetic Responses By Injecting Anticipated Emotion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel empathetic response generation method that incorporates the anticipated emotion into response generation by minimizing the divergence between distribution of responses? anticipated emotion and ground-truth emotion. |
Y. Liu; J. Du; X. Li; R. Xu; |
1483 | Towards Immediate Backchannel Generation Using Attention-Based Early Prediction Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To make accurate predictions on the basis of delayed ASR outputs, we propose early prediction for backchannel opportunity and backchannel category based on attention-based LSTM mechanisms. |
A. I. Adiba; T. Homma; T. Miyoshi; |
1484 | Error-Driven Pruning of Language Models for Virtual Assistants Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We customize entropy pruning by allowing for a keep list of infrequent n-grams that require a more relaxed pruning threshold, and propose three methods to construct the keep list. |
S. Gondala; L. Verwimp; E. Pusateri; M. Tsagkias; C. Van Gysel; |
1485 | Paragraph Level Multi-Perspective Context Modeling for Question Generation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this research, we proposed a multi-perspective paragraph context modeling mechanism, which firstly encodes the contextualized representation of input paragraph, and then utilize multi-head self-attention and Rezero network to further enhance paragraph-level feature extraction and context modeling. |
J. Bai; W. Rong; F. Xia; Y. Wang; Y. Ouyang; Z. Xiong; |
1486 | Improving Dialogue Response Generation Via Knowledge Graph Filter Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to leverage the contextual word representation of dialog post to filter out irrelevant knowledge with an attention-based triple filter network. |
Y. Wang; Y. Wang; X. Lou; W. Rong; Z. Hao; S. Wang; |
1487 | Topic-Aware Dialogue Generation with Two-Hop Based Graph Attention Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: For this purpose, we propose an open-domain topic-aware dialogue generation model via joint learning. |
S. Zhou; W. Rong; J. Zhang; Y. Wang; L. Shi; Z. Xiong; |
1488 | HSAN: A Hierarchical Self-Attention Network for Multi-Turn Dialogue Generation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a hierarchical self-attention network, named HSAN, which attends to the important words and utterances in context simultaneously. |
Y. Kong; L. Zhang; C. Ma; C. Cao; |
1489 | Learning to Select Context in A Hierarchical and Global Perspective for Open-Domain Dialogue Generation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel model with hierarchical self-attention mechanism and distant supervision to not only detect relevant words and utterances in short and long distances, but also discern related information globally when decoding. |
L. Shen; H. Zhan; X. Shen; Y. Feng; |
1490 | Towards Efficiently Diversifying Dialogue Generation Via Embedding Augmentation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Unlike previous augmentation methods that mostly focus on token manipulation and ignore the essential variety within a single sample using hard labels, we propose to promote the generation diversity of the neural dialogue models via soft embedding augmentation along with soft labels in this paper. |
Y. Cao; L. Ding; Z. Tian; M. Fang; |
1491 | End2End Acoustic to Semantic Transduction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel end-to-end sequence-to-sequence spoken language understanding model using an attention mechanism. |
V. Pelloin; et al. |
1492 | Acoustics Based Intent Recognition Using Discovered Phonetic Units for Low Resource Languages Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: With the aim of aiding development of spoken dialog systems in low resourced languages, we propose a novel acoustics based intent recognition system that uses discovered phonetic units for intent classification. |
A. Gupta; X. Li; S. K. Rallabandi; A. W. Black; |
1493 | Speech-Language Pre-Training for End-to-End Spoken Language Understanding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to unify a well-optimized E2E ASR encoder (speech) and a pre-trained language model encoder (language) into a transformer decoder. |
Y. Qian; et al. |
1494 | Two-Stage Textual Knowledge Distillation for End-to-End Spoken Language Understanding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To utilize textual information more effectively, this work proposes a two-stage textual knowledge distillation method that matches utterancelevel representations and predicted logits of two modalities during pre-training and fine-tuning, sequentially. |
S. Kim; G. Kim; S. Shin; S. Lee; |
1495 | Semi-Supervised Spoken Language Understanding Via Self-Supervised Speech and Language Model Pretraining Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a clean and general framework to learn semantics directly from speech with semi-supervision from transcribed or untranscribed speech to address these issues. |
C. -I. Lai; Y. -S. Chuang; H. -Y. Lee; S. -W. Li; J. Glass; |
1496 | DO As I Mean, Not As I Say: Sequence Loss Training for Spoken Language Understanding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose non-differentiable sequence losses based on SLU metrics as a proxy for semantic error and use the REINFORCE trick to train ASR and SLU models with this loss. |
M. Rao; et al. |
1497 | St-Bert: Cross-Modal Language Model Pre-Training for End-to-End Spoken Language Understanding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this context, we introduce a cross-modal pre-trained language model, called Speech-Text BERT (ST-BERT), to tackle end-to-end spoken language understanding (E2E SLU) tasks. |
M. Kim; G. Kim; S. -W. Lee; J. -W. Ha; |
1498 | End-to-End Spoken Language Understanding Using Transformer Networks and Self-Supervised Pre-Trained Features Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we introduce a modular End-to-End (E2E) SLU transformer network based architecture which allows the use of self-supervised pre- trained acoustic features, pre-trained model initialization and multi-task training. |
E. Morais; H. -K. J. Kuo; S. Thomas; Z. T�ske; B. Kingsbury; |
1499 | Sentiment Injected Iteratively Co-Interactive Network for Spoken Language Understanding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we argue that implicitly introducing sentimental features can promote SLU performance. |
Z. Huang; F. Liu; P. Zhou; Y. Zou; |
1500 | RNN Transducer Models for Spoken Language Understanding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a comprehensive study on building and adapting RNN transducer (RNN-T) models for spoken language understanding (SLU). |
S. Thomas; et al. |
1501 | Leveraging Acoustic and Linguistic Embeddings from Pretrained Speech and Language Models for Intent Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a novel intent classification framework that employs acoustic features extracted from a pretrained speech recognition system and linguistic features learned from a pretrained language model. |
B. Sharma; M. Madhavi; H. Li; |
1502 | ORTHROS: Non-autoregressive End-to-end Speech Translation With Dual-decoder Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel NAR E2E-ST framework, Orthros, in which both NAR and autoregressive (AR) decoders are jointly trained on the shared speech encoder. |
H. Inaguma; Y. Higuchi; K. Duh; T. Kawahara; S. Watanabe; |
1503 | Cascaded Models with Cyclic Feedback for Direct Speech Translation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a technique that allows cascades of automatic speech recognition (ASR) and machine translation (MT) to exploit in-domain direct speech translation data in addition to out-of-domain MT and ASR data. |
T. K. Lam; S. Schamoni; S. Riezler; |
1504 | Jointly Trained Transformers Models for Spoken Language Translation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, degradation in performance is reduced by creating an End-to-End differentiable pipeline between the ASR and MT systems. |
H. K. Vydana; M. Karafi�t; K. Zmolikova; L. Burget; H. Cernock�; |
1505 | Efficient Use of End-to-End Data in Spoken Language Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work examines how the best use can be made of limited end-to-end training for sequence-to-sequence tasks. |
Y. Lu; Y. Wang; M. J. F. Gales; |
1506 | Streaming Simultaneous Speech Translation with Augmented Memory Transformer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an end-to-end transformer-based sequence-to-sequence model, equipped with an augmented memory transformer encoder, which has shown great success on the streaming automatic speech recognition task with hybrid or transducer-based models. |
X. Ma; Y. Wang; M. J. Dousti; P. Koehn; J. Pino; |
1507 | An Empirical Study of End-To-End Simultaneous Speech Translation Decoding Strategies Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a decoding strategy for end-to-end simultaneous speech translation. |
H. Nguyen; Y. Est�ve; L. Besacier; |
1508 | Modeling Homophone Noise for Robust Neural Machine Translation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a robust neural machine translation (NMT) framework to deal with homophone errors. |
W. Qin; X. Li; Y. Sun; D. Xiong; J. Cui; B. Wang; |
1509 | Machine Translation Verbosity Control for Automatic Dubbing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on the problem of controlling the verbosity of machine translation out-put, so that subsequent steps of our automatic dubbing pipeline can generate dubs of better quality. |
S. M. Lakew; et al. |
1510 | Improvements to Prosodic Alignment for Automatic Dubbing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we present improvements to the prosodic alignment component of our recently introduced dubbing architecture. |
Y. Virkar; M. Federico; R. Enyedi; R. Barra-Chicote; |
1511 | Image-Assisted Transformer in Zero-Resource Multi-Modal Translation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate how to use visual information as an auxiliary hint for a Transformer-based system in a zero-resource translation scenario. |
P. Huang; S. Sun; H. Yang; |
1512 | Sentence Boundary Augmentation for Neural Machine Translation Robustness Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Through in-depth error analysis, we show that sentence boundary segmentation has the largest impact on quality, and we develop a simple data augmentation strategy to improve segmentation robustness. |
D. Li; T. I; N. Arivazhagan; C. Cherry; D. Padfield; |
1513 | An Empirical Study on Task-Oriented Dialogue Translation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we systematically investigate advanced models on the task-oriented dialogue translation task, including sentence-level, document-level and non-autoregressive NMT models. |
S. Liu; |
1514 | MAPGN: Masked Pointer-Generator Network for Sequence-to-Sequence Pre-Training Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a self-supervised learning method for pointer-generator networks to improve spoken-text normalization. |
M. Ihori; N. Makishima; T. Tanaka; A. Takashima; S. Orihashi; R. Masumura; |
1515 | Aligning The Training and Evaluation of Unsupervised Text Style Transfer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel training method based on the evaluation metrics to address the discrepancy issue. |
W. Qian; F. Zhu; J. Yang; J. Han; S. Hu; |
1516 | Neural Inverse Text Normalization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an efficient and robust neural solution for ITN leveraging transformer based seq2seq models and FST-based text normalization techniques for data preparation. |
M. Sunkara; C. Shivade; S. Bodapati; K. Kirchhoff; |
1517 | Generating Human Readable Transcript for Automatic Speech Recognition with Pre-Trained Language Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose an ASR post-processing model that aims to transform the incorrect and noisy ASR output into a readable text for humans and downstream tasks. |
J. Liao; et al. |
1518 | Improving Neural Text Normalization with Partial Parameter Generator and Pointer-Generator Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we treat TN as a neural machine translation problem and present a pure data-driven TN system using Transformer framework. |
W. Jiang; J. Li; M. Chen; J. Ma; S. Wang; J. Xiao; |
1519 | Incorporating Syntactic and Phonetic Information Into Multimodal Word Embeddings Using Graph Convolutional Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This inspires us to propose a new multimodal word representation model, namely, GCNW, which uses the graph convolutional network to incorporate the phonetic and syntactic information into the word representation. |
W. ZHU; S. LIU; C. LIU; |
1520 | LIFI: Towards Linguistically Informed Frame Interpolation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here we explore the problem of speech video interpolation. |
A. N. Mathur; D. Batra; Y. K. Singla; R. Ratn Shah; C. Chen; R. Zimmermann; |
1521 | Triple Sequence Generative Adversarial Nets for Unsupervised Image Captioning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel triple sequence generative adversarial net including an image generator, a discriminator, and a sentence generator. |
Y. Zhou; W. Tao; W. Zhang; |
1522 | Align or Attend? Toward More Efficient and Accurate Spoken Word Discovery Using Speech-to-Image Retrieval Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We verify our theory by conducting retrieval and word discovery experiments on MSCOCO and Flickr8k, and empirically demonstrate that both neural MT with self-attention and statistical MT achieve word discovery scores that are superior to those of a state-of-the-art neural retrieval system, outperforming it by 2% and 5% alignment F1 scores respectively. |
L. Wang; X. Wang; M. Hasegawa-Johnson; O. Scharenborg; N. Dehak; |
1523 | Towards Practical Lipreading with Distilled and Efficient Models Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a series of innovations that significantly bridge that gap: first, we raise the state-of-the-art performance by a wide margin on LRW and LRW-1000 to 88.5 % and 46.6 %, respectively using self-distillation. |
P. Ma; B. Martinez; S. Petridis; M. Pantic; |
1524 | End-To-End Audio-Visual Speech Recognition with Conformers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present a hybrid CTC/Attention model based on a ResNet-18 and Convolution-augmented transformer (Conformer), that can be trained in an end-to-end manner. |
P. Ma; S. Petridis; M. Pantic; |
1525 | ASR N-Best Fusion Nets Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a fusion network to jointly consider ASR n-best hypotheses for enhanced robustness to ASR errors. |
X. Liu; et al. |
1526 | Boosting Low-Resource Intent Detection with In-Scope Prototypical Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a universal In-scope Prototypical Networks for low-resource intent detection to be general to dialogue meta-train datasets lacking widely-varying domains, which focuses on the scope of episodic intent classes to construct meta-task dynamically. |
H. Lin; Y. Yan; G. Chen; |
1527 | Conversational Query Rewriting with Self-Supervised Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle these issues, we first propose to construct a large-scale CQR dataset automatically via self-supervised learning, which does not need human annotation. Then we introduce a novel CQR model Teresa based on Transformer, which is enhanced by self-attentive keywords detection and intent consistency constraint. |
H. Liu; M. Chen; Y. Wu; X. He; B. Zhou; |
1528 | Handling Class Imbalance in Low-Resource Dialogue Systems By Combining Few-Shot Classification and Interpolation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a new end-to-end pairwise learning framework that is designed specifically to tackle this phenomenon by inducing a few-shot classification capability in the utterance representations and augmenting data through an interpolation of utterance representations. |
V. Sunder; E. Fosler-Lussier; |
1529 | Improving Cross-Domain Slot Filling with Common Syntactic Structure Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we attempt to solve this task by exploiting the syntactic structures of user utterances, because these syntactic structures are actually accessible and can be shared between utterances from different domains. |
L. Liu; X. Lin; P. Zhang; B. Wang; |
1530 | Joint Intent Detection and Slot Filling Based on Continual Learning Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a Continual Learning Interrelated Model (CLIM) is proposed to consider semantic information with different characteristics and balance the accuracy between intent detection and slot filling effectively. |
Y. Hui; J. Wang; N. Cheng; F. Yu; T. Wu; J. Xiao; |
1531 | Knowledge-Based Chat Detection with False Mention Discrimination Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To deal with this issue, this paper proposes a new model for knowledge-based chat detection with false mention discrimination (FMD-KChat). |
W. Liu; P. Huang; D. Liang; Z. Zhou; |
1532 | Replacing Human Audio with Synthetic Audio for On-Device Unspoken Punctuation Prediction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a novel multi-modal unspoken punctuation prediction system for the English language which combines acoustic and text features. |
D. Soboleva; et al. |
1533 | Adversarial Generative Distance-Based Classifier for Robust Out-of-Domain Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an efficient adversarial attack mechanism to augment hard OOD samples and design a novel generative distance-based classifier to detect OOD samples instead of a traditional threshold-based discriminator classifier. |
Z. Zeng; et al. |
1534 | GAN-Based Out-of-Domain Detection Using Both In-Domain and Out-of-Domain Samples Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In the situation where both in-domain (ID) and OOD samples are available, our goal is to take advantage of OOD samples under the GAN-based framework for OOD detection. |
C. Liang; P. Huang; W. Lai; Z. Ruan; |
1535 | Progressive Dialogue State Tracking for Multi-Domain Dialogue Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To model the two observations, we propose to divide the task into two successive procedures: progressive domain-slot tracking and shrunk value prediction. |
J. Wang; M. Liu; X. Quan; |
1536 | Multi-Step Spoken Language Understanding System Based on Adversarial Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a novel multi-step spoken language understanding system based on adversarial learning that can leverage the multiround user?s feedback to update slot values. |
Y. Wang; Y. Shen; H. Jin; |
1537 | Multi-Entity Collaborative Relation Extraction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Instead of designing specific models for single relationship extraction tasks, this paper aims to propose a general framework to extract multiple relations among multiple entities in unstructured text by taking advantage of existing models. |
H. Liu; Z. Li; D. Sheng; H. -T. Zheng; Y. Shen; |
1538 | Multi-Granularity Heterogeneous Graph for Document-Level Relation Extraction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel Multi-granularity Heterogeneous Graph (MHG) to tackle this challenge. |
H. Tang; Y. Cao; Z. Zhang; R. Jia; F. Fang; S. Wang; |
1539 | Improving Event Detection By Exploiting Label Hierarchy Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To fully utilize such information to improve the detection of fine-grained event types, we propose a three-layer label hierarchy and introduce the detection of two coarser-grained types as auxiliary classification tasks. |
X. Xi; et al. |
1540 | Improving NER in Social Media Via Entity Type-Compatible Unknown Word Substitution Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To alleviate the noisy expression in social media data, we present a novel word substitution strategy based on constructing an entity type-compatible (ETC) semantic space. |
J. Xie; K. Zhang; L. Sun; Y. Su; C. Xu; |
1541 | More: A Metric Learning Based Framework for Open-Domain Relation Extraction Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these problems, in this work, we propose a novel learning framework named MORE (Metric learning-based Open Relation Extraction). |
Y. Wang; R. Lou; K. Zhang; M. Y. Chen; Y. Yang; |
1542 | �You Should Probably Read This�: Hedge Detection in Text Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we apply a joint model that leverages words and part-of-speech tags to improve hedge detection in text and achieve a new top score on the CoNLL-2010 Wikipedia corpus. |
D. Katerenchuk; R. Levitan; |
1543 | Enhancing Model Robustness By Incorporating Adversarial Knowledge Into Semantic Representation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, we propose AdvGraph, a novel defense which enhances the robustness of Chinese-based NLP models by incorporating adversarial knowledge into the semantic representation of the input. |
J. Li; T. Du; X. Liu; R. Zhang; H. Xue; S. Ji; |
1544 | Elbert: Fast Albert with Confidence-Window Based Early Exit Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose the ELBERT, which significantly improves the average inference speed compared to ALBERT due to the proposed confidence-window based early exit mechanism, without introducing additional parameters or extra training overhead. |
K. Xie; S. Lu; M. Wang; Z. Wang; |
1545 | Dualformer: A Unified Bidirectional Sequence-to-Sequence Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a new dual domain mapping based on a unified bidirectional sequence-to-sequence (seq2seq) learning. |
J. -T. Chien; W. -H. Chang; |
1546 | Task Aware Multi-Task Learning for Speech to Text Tasks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a task modulation network which allows the model to learn task specific features, while learning the shared features simultaneously. |
S. Indurthi; et al. |
1547 | Label-Aware Text Representation for Multi-Label Text Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a label-aware network to obtain both the label correlation and text representation. |
H. Guo; X. Li; L. Zhang; J. Liu; W. Chen; |
1548 | Mixup Regularized Adversarial Networks for Multi-Domain Text Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose mixup regularized adversarial networks (MRANs) to address these two issues. |
Y. Wu; D. Inkpen; A. El-Roby; |
1549 | Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel approach to overcome this problem based on two principles: a) taking into account uncertainty in the automatic phoneme recognition step, b) accounting for the fact that there may be multiple valid pronunciations. |
D. Korzekwa; J. Lorenzo-Trueba; S. Zaporowski; S. Calamaro; T. Drugman; B. Kostek; |
1550 | Attention-Based Multi-Encoder Automatic Pronunciation Assessment Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we propose an end-to-end (E2E) pronunciation scoring network based on attention mechanism and multi-encoder consisting of audio and text encoders. |
B. Lin; L. Wang; |
1551 | Improving Pronunciation Assessment Via Ordinal Regression with Anchored Reference Samples Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose two new statistical features, average GOP (aGOP) and confusion GOP (cGOP) and use them to train a binary classifier in Ordinal Regression with Anchored Reference Samples (ORARS). |
B. Su; S. Mao; F. Soong; Y. Xia; J. Tien; Z. Wu; |
1552 | Analysing Bias in Spoken Language Assessment Using Concept Activation Vectors Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work applies CAVs to assess bias in a spoken language assessment (SLA) system, a regression task. |
X. Wei; M. J. F. Gales; K. M. Knill; |
1553 | Senone-Aware Adversarial Multi-Task Training for Unsupervised Child to Adult Speech Adaptation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a feature adaptation approach by exploiting adversarial multi-task training to minimize acoustic mismatch at the senone (tied triphone states) level between adult and child speech and leverage large amounts of transcribed adult speech. |
R. Duan; N. F. Chen; |
1554 | Classifying Speech Intelligibility Levels of Children in Two Continuous Speech Styles Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper attempts to construct automatic systems that help detect children with severe speech problems at an early stage. |
Y. -S. Lin; S. -C. Tseng; |
1555 | Recent Advances in Arabic Syntactic Diacritics Restoration Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we implement a knowledge distillation technique where an ensemble of teachers/taggers is used to train a single student tagger. |
Y. Hifny; |
1556 | Making Punctuation Restoration Robust and Fast with Multi-Task Learning and Knowledge Distillation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address the former, we use a multi-task learning framework with ELECTRA, a recently proposed improvement on BERT, that has a generator-discriminator structure. |
M. Hentschel; E. Tsunoo; T. Okuda; |
1557 | Variational Dialogue Generation with Normalizing Flows Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents the dialogue flow VAE (DF-VAE) for variational dialogue generation. |
T. -C. Luo; J. -T. Chien; |
1558 | NN-KOG2P: A Novel Grapheme-to-Phoneme Model for Korean Language Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel Korean G2P model architecture, reflecting the characteristics of Korean pronunciation, called neural network-based Korean G2P (NN-KoG2P). |
H. -Y. Kim; J. -H. Kim; J. -M. Kim; |
1559 | Joint Alignment Learning-Attention Based Model for Grapheme-to-Phoneme Conversion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel approach to optimize the G2P conversion model directly alignment grapheme-phoneme sequence by using alignment learning (AL) as the loss function. |
Y. Wang; F. Bao; H. Zhang; G. Gao; |
1560 | Knowledge Distillation for Improved Accuracy in Spoken Question Answering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: However, the recent work shows that ASR systems generate highly noisy transcripts, which critically limit the capability of machine comprehension on the SQA task. To address the issue, we present a novel distillation framework. |
C. You; N. Chen; Y. Zou; |
1561 | Coarse-To-Careful: Seeking Semantic-Related Knowledge for Open-Domain Commonsense Question Answering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Towards the issue of introducing related knowledge, we propose a semantic-driven knowledge-aware QA framework, which controls the knowledge injection in a coarse-to-careful fashion. |
L. Xing; Y. Hu; J. Yu; Y. Xie; W. Peng; |
1562 | Language Model Is All You Need: Natural Language Understanding As Question Answering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we study the use of a certain family of transfer learning, where the target domain is mapped to the source domain. |
M. Namazifar; A. Papangelis; G. Tur; D. Hakkani-T�r; |
1563 | Integrating Subgraph-Aware Relation and Direction Reasoning for Question Answering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address these challenges, we propose a novel neural model, Relation-updated Direction-guided Answer Selector (RDAS), which converts relations in each subgraph to additional nodes to learn structure information. |
X. Wang; et al. |
1564 | Role Aware Multi-Party Dialogue Question Answering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In a novel aspect, this paper proposes the Role Aware Multi-Party Network (RAMPNet), a model utilizing the information of speaker and role to present who is speaking and who is mentioned, making role awareness an available message for our model. |
J. -H. Hsu; P. -W. Shen; H. -T. Su; C. -H. Chang; J. -F. Yeh; W. H. Hsu; |
1565 | MCR-NET: A Multi-Step Co-Interactive Relation Network for Unanswerable Questions on Machine Reading Comprehension Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle this problem, we propose a Multi-Step Co-Interactive Relation Network (MCR-Net) to explicitly model the mutual interaction and locate key clues from coarse to fine by introducing a co-interactive relation module. |
W. Peng; et al. |
1566 | Hierarchical Speaker-Aware Sequence-to-Sequence Model for Dialogue Summarization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a hierarchical transformer-based model for dialogue summarization. |
Y. Lei; Y. Yan; Z. Zeng; K. He; X. Zhang; W. Xu?; |
1567 | A Large-Scale Chinese Long-Text Extractive Summarization Corpus Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we publish a large-scale Chinese Long-text Extractive Summarization corpus named CLES. |
K. Chen; G. Fu; Q. Chen; B. Hu; |
1568 | Adaptive Bi-Directional Attention: Exploring Multi-Granularity Representations for Machine Reading Comprehension Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we propose a novel approach called Adaptive Bidirectional Attention, which adaptively exploits the source representations of different levels to the predictor. |
N. Chen; F. Liu; C. You; P. Zhou; Y. Zou; |
1569 | Graph Attention and Interaction Network With Multi-Task Learning for Fact Verification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a graph attention and interaction network (GAIN) for claim verification. |
R. Yang; R. Wang; Z. -H. Ling; |
1570 | Enhancing Deep Paraphrase Identification Via Leveraging Word Alignment Information Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Apart from directly encoding WAI into fixed-size embeddings, we propose a novel auxiliary task so that the baselines can be pre-trained using a large amount of unlabeled in-domain data. |
B. Li; T. Liu; B. Wang; L. Wang; |
1571 | An End-To-End Actor-Critic-Based Neural Coreference Resolution System Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an actor-critic-based neural coreference resolution system, which can achieve both mention detection and mention clustering by leveraging an actor-critic deep reinforcement learning technique and a joint training algorithm. |
Y. Wang; Y. Shen; H. Jin; |
1572 | Reduced-Complexity Modular Polynomial Multiplication for R-LWE Cryptosystems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a new method is proposed to integrate the modular reduction into the Karatsuba polynomial multiplication. |
X. Zhang; K. K. Parhi; |
1573 | Seizure Detection Using Power Spectral Density Via Hyperdimensional Computing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper explores seizure detection from electroencephalogram (EEG) from subjects with epilepsy using HD computing based on power spectral density (PSD) features. |
L. Ge; K. K. Parhi; |
1574 | FPGA Hardware Design for Plenoptic 3D Image Processing Algorithm Targeting A Mobile Application Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The idea presented in this paper is to use the FPGA based hardware design to improve the performance of a 3D depth estimation algorithm by utilizing the advantage of concurrent execution. |
F. Bhatti; T. Greiner; |
1575 | SLAP: A Split Latency Adaptive VLIW Pipeline Architecture Which Enables On-The-Fly Variable SIMD Vector-Length Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We describe the Split Latency Adaptive Pipeline (SLAP) VLIW architecture, a cache performance improvement technology that requires zero change to object code, while removing smart DMAs and their overhead. |
A. Shrivastava; A. Gatherer; T. Sun; S. Wokhlu; A. Chandra; |
1576 | Unsupervised Clustering of Time Series Signals Using Neuromorphic Energy-Efficient Temporal Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a neuromorphic approach to unsupervised time series clustering based on Temporal Neural Networks that is capable of ultra low-power, continuous online learning. |
S. Chaudhari; H. Nair; J. M. F. Moura; J. Paul Shen; |
1577 | Angle�of�Arrival (AoA) Factorization in Multipath Channels Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper considers the problem of estimating K angle of arrivals (AoA) using an array of M > K microphones. |
Y. -L. Wei; R. R. Choudhury; |
1578 | Scaled Fast Nested Key Equation Solver for Generalized Integrated Interleaved BCH Decoders Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes novel reformulations of the nested BCH KES to enable scalar pre-computation. |
Z. Xie; X. Zhang; |
1579 | Joint Optimization for Full-Duplex Cellular Communications Via Intelligent Reflecting Surface Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Specifically, we propose a low-complexity minorization-maximization (MM) algorithm for solving the subproblems of designing the precoding matrix and the reflection coefficients, respectively. |
Z. Peng; C. Pan; Z. Zhang; X. Chen; L. Li; A. L. Swindlehurst; |
1580 | A Color Doppler Processing Engine with An Adaptive Clutter Filter for Portable Ultrasound Imaging Devices Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents an optimized color Doppler processing engine for portable ultrasound devices. |
Y. -L. Lo; C. -H. Yang; |
1581 | Convolutional Neural Network-Aided Bit-Flipping for Belief Propagation Decoding of Polar Codes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a convolutional neural network-aided bit-flipping (CNN-BF) mechanism to further enhance BP decoding. |
C. -F. Teng; A. K. -S. Ho; C. -H. D. Wu; S. -S. Wong; A. -Y. A. Wu; |
1582 | Taming Voting Algorithms on Gpus for An Efficient Connected Component Analysis Algorithm Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper explores multiple ways to reduce those conflicts for voting algorithms and especially for Connected Component Analysis. |
F. Lemaitre; A. Hennequin; L. Lacassagne; |
1583 | Positnn: Training Deep Neural Networks with Mixed Low-Precision Posit Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The presented research aims to evaluate the feasibility to train deep convolutional neural networks using posits. |
G. Raposo; P. Tom�s; N. Roma; |
1584 | Bluetooth Low Energy and CNN-Based Angle of Arrival Localization in Presence of Rayleigh Fading Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The paper proposes an efficient Convolutional Neural Network (CNN)-based indoor localization framework to tackle these issues specific to BLE-based settings. |
Z. HajiAkhondi-Meybodi; M. Salimibeni; A. Mohammadi; K. N. Plataniotis; |
1585 | Robust Device-Free Proximity Detection Using Wifi Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose two robust and responsive features in the frequency dimension, which are sensitive to the distance of motion, and establish the connection between the underlying radio propagation properties and the features. |
Y. Hu; M. Z. Ozturk; F. Zhang; B. Wang; K. J. Ray Liu; |
1586 | Online Dynamic Window (ODW) Assisted 2-Stage LSTM Indoor Localization for Smart Phones Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this regard, the paper takes one step forward to transfer offline IMU-based models to online positioning frameworks. |
M. Atashi; A. Mohammadi; |
1587 | Optimal TOA Localization for Moving Sensor in Asymmetric Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We develop an optimal localization method based-on maximum likelihood (ML) estimator, namely ML-LOC, utilizing information on the SN velocity and clock drift, to determine the position of a moving SN. |
S. Zhao; X. -P. Zhang; X. Cui; M. Lu; |
1588 | Low Complexity SLM for OFDMA System with Implicit Side Information Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel SLM algorithm for the OFDMA system that requires no side information transmission. |
S. Hu; M. Yang; K. Kang; H. Qian; |
1589 | Reduced-Complexity Channel Estimation By Hierarchical Interpolation Exploiting Sparsity for Massive MIMO Systems with Uniform Rectangular Array Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To save the complexity, we propose hierarchical channel interpolation algorithm by exploiting the channel sparsity in the millimeter wave frequency band. |
C. -S. Wang; P. -Y. Tsai; |
1590 | Traffic Speed Forecasting Via Spatio-Temporal Attentive Graph Isomorphism Network Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work proposes an end-to-end framework to capture spatial dependencies through graph isomorphism network, while explicitly taking network topologic similarities into account and leveraging symmetric traffic for learning the traffic conditions. |
Q. Yang; T. Zhong; F. Zhou; |
1591 | Inferring High-Resolutional Urban Flow With Internet Of Mobile Things Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work introduces a new method for inferring fine-grained urban flow with the internet of mobile things such as taxis and bikes. |
F. Zhou; X. Jing; L. Li; T. Zhong; |
1592 | Transfer Learning for Input Estimation of Vehicle Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study proposes a learning-based method with domain adaptability for input estimation of vehicle suspension systems. |
L. M. Cronin; S. S. Eshkevari; D. Sen; S. N. Pakzad; |
1593 | Identification of Deep Breath While Moving Forward Based on Multiple Body Regions and Graph Signal Analysis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents an unobtrusive solution that can automatically identify deep breath when a person is walking past the global depth camera. |
Y. Wang; et al. |
1594 | Multi-Object Tracking Using Poisson Multi-Bernoulli Mixture Filtering For Autonomous Vehicles Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we developed an RFS-based MOT framework for 3D LiDAR data. |
S. Pang; H. Radha; |
1595 | Adaptive RF Fingerprint Decomposition in Micro UAV Detection Based on Machine Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a method using empirical mode decomposition (EMD) and ensemble empirical mode decomposition (EEMD) on extracting the communication channel characteristics of intruding UAVs. |
C. Xu; F. He; B. Chen; Y. Jiang; H. Song; |
1596 | Depression Detection By Analysing Eye Movements on Emotional Images Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To achieve an objective and efficient depression detection system, we propose a cognitive psychology experimental paradigm based on the attentional bias theory and eye movements in this paper. |
R. Shen; Q. Zhan; Y. Wang; H. Ma; |
1597 | Weakly Supervised Patch Label Inference Network with Image Pyramid for Pavement Diseases Recognition in The Wild Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present an end-to-end deep learning approach named Weakly Super-vised Patch Label Inference Network with Image Pyramid (WSPLIN-IP) for recognizing various types of pavement diseases that are not just limited to the specific ones, such as crack and pothole. |
G. Huang; S. Huang; L. Huangfu; D. Yang; |
1598 | A Wireless Reference Active Noise Control Headphone Using Coherence Based Selection Technique Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Hence, we adopt a wireless reference microphone to pick up the reference signals around the noise sources. |
X. Shen; D. Shi; W. -S. Gan; |
1599 | How to Use Time Information Effectively? Combining with Time Shift Module for Lipreading Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The proposed method verified on two challenging word-level lipreading datasets LRW and LRW-1000 and achieved new state-of-the-art performance. |
M. Hao; M. Mamut; N. Yadikar; A. Aysa; K. Ubul; |
1600 | Exploring The Application of Synthetic Audio in Training Keyword Spotters Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper details some initial exploration into the application of Text-To-Speech (TTS) audio as a helper tool for training keyword spotters in these low-resource scenarios. |
A. Werchniak; et al. |
1601 | Graph Enhanced Query Rewriting for Spoken Language Understanding System Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we construct a user interaction graph from their queries using data mined from a Markov Chain Model [1], and introduce a self-supervised pre-training process for learning query embeddings by leveraging the recent developments in Graph Representation Learning (GRL). |
S. Yuan; S. Gupta; X. Fan; D. Liu; Y. Liu; C. Guo; |
1602 | Deep Neural Network Based Cough Detection Using Bed-Mounted Accelerometer Measurements Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We have performed cough detection based on measurements from an accelerometer attached to the patient?s bed. |
M. Pahar; I. Miranda; A. Diacon; T. Niesler; |
1603 | Radio Frequency Based Heart Rate Variability Monitoring Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present mmHRV, a contact-free HRV monitoring system using commercial millimeter-wave (mmWave) radio. |
F. Wang; X. Zeng; C. Wu; B. Wang; K. J. Ray Liu; |
1604 | Discrete Cosine Transform Based Causal Convolutional Neural Network for Drift Compensation in Chemical Sensors Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we develop a causal convolutional neural network (CNN) with a Discrete Cosine Transform (DCT) layer to estimate the drift signal. |
D. Badawi; A. Agambayev; S. Ozev; A. Enis �etin; |
1605 | Wifi-Based Device-Free Gesture Recognition Through-the-Wall Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose such a gesture recognition system that can recover information about the actual trajectory of the hand movement allowing an expandable set of gestures. |
S. D. Regani; B. Wang; K. J. Ray Liu; |
1606 | Sound Recovery From Radio Signals Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we model the vibration on object surfaces due to sound for mmWave devices. |
M. Z. Ozturk; C. Wu; B. Wang; K. J. Ray Liu; |
1607 | Fully-Neural Approach to Vehicle Weighing and Strain Prediction on Bridges Using Wireless Accelerometers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a new BWIM approach based on a deep neural network using accelerometers, which are easier to install than strain sensors, thus helping the advancement of low-cost BWIM systems. |
T. Kawakatsu; K. Aihara; A. Takasu; J. Adachi; H. Wang; T. Nagayama; |
1608 | End To End Learning For Convolutive Multi-Channel Wiener Filtering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a dereverberation and speech source separation method based on deep neural network (DNN). |
M. Togami; |
1609 | Makf-Sr: Multi-Agent Adaptive Kalman Filtering-Based Successor Representations Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The proposed framework can adapt quickly to the changes in a multi-agent environment faster than the MF methods and with a lower computational cost compared to MB algorithms. |
M. Salimibeni; P. Malekzadeh; A. Mohammadi; P. Spachos; K. N. Plataniotis; |
1610 | Variation-Stable Fusion for PPG-Based Biometric System Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper investigates the employment of photoplethysmography (PPG) for user authentication systems. |
D. Y. Hwang; B. Taha; D. Hatzinakos; |
1611 | Improving Stability of Adversarial Li-ion Cell Usage Data Generation Using Generative Latent Space Modelling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this treatise, some robust loss-functions for the TimeGAN architecture are explored for generating realistic Li-ion CUD. |
S. Chattoraj; S. Pratiher; S. Pratiher; H. Konik; |
1612 | SQWA: Stochastic Quantized Weight Averaging For Improving The Generalization Capability Of Low-Precision Deep Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a new quantized neural network optimization approach, stochastic quantized weight averaging (SQWA), to design low-precision DNNs with good generalization capability using model averaging. |
S. Shin; Y. Boo; W. Sung; |
1613 | A Quantitative Analysis Of The Robustness Of Neural Networks For Tabular Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a quantitative approach to demonstrate the robustness of neural networks for tabular data. |
K. Gupta; B. Pesquet-Popescu; F. Kaakai; J. -C. Pesquet; |
1614 | Spatial Equalization Before Reception: Reconfigurable Intelligent Surfaces for Multi-Path Mitigation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to use an RIS as a spatial equalizer to address the well-known multi-path fading phenomenon. |
H. Zhang; L. Song; Z. Han; H. V. Poor; |
1615 | Interference Analysis in Reconfigurable Intelligent Surface-Assisted Multiple-Input Multiple-Output Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider a multiple-input multiple-output network where each base station serves a user equipment with the aid of an RIS equipped with N reconfigurable elements. |
J. Liu; X. Qian; M. Di Renzo; |
1616 | Codebook Design for Dual-Polarized Ultra-Massive Mimo Communications at Millimeter Wave and Terahertz Bands Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we analyze the dual-polarized intelligent surface channel and present an efficient precoding solution based on the array-of-subarray architecture that aims to maximize the spectral efficiency. |
S. Nie; I. F. Akyildiz; |
1617 | Performance Analysis of Spatial and Frequency Domain Index-Modulated Reconfigurable Intelligent Metasurfaces Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose novel electromagnetics-compliant designs of reconfigurable intelligent surface (RIS) apertures for realizing IM in 6G transceivers. |
J. A. Hodge; K. V. Mishra; B. M. Sadler; A. I. Zaghloul; |
1618 | Meta-Learning for 6G Communication Networks with Reconfigurable Intelligent Surfaces Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a practical channel acquisition and passive beamforming technique is proposed using a limited number of pilot symbols in an RIS-assisted cellular network. |
M. Jung; W. Saad; |
1619 | Model-Inspired Deep Learning for Light-Field Microscopy with Application to Neuron Localization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a model-inspired deep learning approach to perform fast and robust 3D localization of sources using light-field microscopy images. |
P. Song; H. V. Jadan; C. L. Howe; P. Quicke; A. J. Foust; P. Luigi Dragotti; |
1620 | Time-Varying Graph Signal Inpainting Via Unrolling Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an interpretable graph neural network based on algorithm unrolling to reconstruct a time-varying graph signal from partial measurements. |
S. Chen; Y. C. Eldar; |
1621 | Deep Learning for Linear Inverse Problems Using The Plug-and-Play Priors Framework Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this overview paper, we present the combination of the DL and the Plug-and-Play priors (PPP) framework, showcasing how it allows solving various inverse problems by leveraging the impressive capabilities of existing DL based denoising algorithms. |
W. Chen; D. Wipf; M. Rodrigues; |
1622 | A Plug-and-Play Deep Image Prior Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To mitigate this effect, this work incorporates a plug-and-play prior scheme which can accommodate additional regularization steps within a DIP framework. |
Z. Sun; F. Latorre; T. Sanchez; V. Cevher; |
1623 | MRI Image Recovery Using Damped Denoising Vector AMP Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Motivated by image recovery in magnetic resonance imaging (MRI), we propose a new approach to solving linear inverse problems based on iteratively calling a deep neural-network, sometimes referred to as plug-and-play recovery. |
S. Sarkar; R. Ahmad; P. Schniter; |
1624 | Overcoming Measurement Inconsistency In Deep Learning For Linear Inverse Problems: Applications In Medical Imaging Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We show that such inconsistency, which can be critical in domains like medical imaging or defense, is intimately related to the generalization error. We then propose a framework that post-processes the output of DNNs with an optimization algorithm that enforces measurement consistency. |
M. Vella; J. F. C. Mota; |
1625 | Scalable Reinforcement Learning For Routing In Ad-Hoc Networks Based On Physical-Layer Attributes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work proposes a novel and scalable reinforcement learning approach for routing in ad-hoc wireless networks. |
W. Cui; W. Yu; |
1626 | Blind Carbon Copy on Dirty Paper: Seamless Spectrum Underlay Via Canonical Correlation Analysis Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work proposes a practical data-driven approach that allows a pair of secondary users to reliably communicate in underlay mode while keeping the interference at the primary receiver close to its noise floor. |
M. S. Ibrahim; N. D. Sidiropoulos; |
1627 | An Actor-Critic Reinforcement Learning Approach to Minimum Age of Information Scheduling in Energy Harvesting Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider the user scheduling problem over a communication session. |
S. Leng; A. Yener; |
1628 | Moving Object Classification with A Sub-6 GHz Massive MIMO Array Using Real Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we analyze classification of moving objects by employing machine learning on real data from a massive multi-input-multi-output (MIMO) system in an indoor environment. |
B. R. Manoj; G. Tian; S. Gunnarsson; F. Tufvesson; E. G. Larsson; |
1629 | Optimizing Coverage and Capacity in Cellular Networks Using Machine Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we develop and compare two approaches for maximizing coverage and minimizing interference by jointly optimizing the transmit power and downtilt (elevation tilt) settings across sectors. |
R. M. Dreifuerst; et al. |
1630 | Unsupervised Learning for Asynchronous Resource Allocation In Ad-Hoc Wireless Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider optimal resource allocation problems under asynchronous wireless network setting. |
Z. Wang; M. Eisen; A. Ribeiro; |
1631 | Two-Stage Adaptive Pooling with RT-QPCR for Covid-19 Screening Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose two-stage adaptive pooling schemes, 2-STAP and 2-STAMP, for detecting COVID-19 using real-time reverse transcription quantitative polymerase chain reaction (RT-qPCR) test kits. |
A. Heidarzadeh; K. Narayanan; |
1632 | Point of Care Image Analysis for COVID-19 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here we train deep neural networks to significantly enhance the capability to detect, grade and monitor COVID-19 patients using CXRs and LUS. |
D. Yaron; et al. |
1633 | An Improved Data Driven Dynamic SIRD Model for Predictive Monitoring of COVID-19 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we carry out the functional modeling of COVID-19 infection trends using two models: the Gaussian mixture model (GMM) and the composite logistic growth model (CLGM). |
P. Singh; A. Singhal; B. Fatimah; A. Gupta; |
1634 | Leveraging A Multiple-Strain Model with Mutations in Analyzing The Spread of Covid-19 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we discuss how our recent work on a multiple-strain spreading model with mutations can help address some key questions concerning the spread of COVID-19. |
A. Sridhar; O. Yagan; R. Eletreby; S. A. Levin; J. B. Plotkin; H. V. Poor; |
1635 | Contact Tracing Enhances The Efficiency of Covid-19 Group Testing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we use side information (SI) collected from contact tracing (CT) within nonadaptive/single-stage group testing algorithms. |
R. Goenka; S. -J. Cao; C. -W. Wong; A. Rajwade; D. Baron; |
1636 | Optimal Questionnaires for Screening of Strategic Agents Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We investigate the problem of questioning travellers to classify them for further testing when the travellers are strategic or are unwilling to reveal their travel histories. |
A. S. Vora; A. A. Kulkarni; |
1637 | Exploring Visual-Audio Composition Alignment Network for Quality Fashion Retrieval in Video Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel Visual-Audio Composition Alignment Network (VACANet) to deal with quality fashion retrieval in video. |
Y. Zhang; et al. |
1638 | A Secure Searchable Image Retrieval Scheme with Correct Retrieval Identity Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Aiming at the issue of user privacy, we proposed a Secure searchable image retrieval scheme with correct retrieval identity. |
L. Wang; H. Yu; |
1639 | Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we improve Chinese spoken language understanding (SLU) by injecting word information. |
D. Teng; L. Qin; W. Che; S. Zhao; T. Liu; |
1640 | A Co-Interactive Transformer for Joint Slot Filling and Intent Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a Co-Interactive Transformer which considers the cross-impact between the two tasks. |
L. Qin; T. Liu; W. Che; B. Kang; S. Zhao; T. Liu; |
1641 | Dual Metric Discriminator for Open Set Video Domain Adaptation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a seminal framework, which involves spatial and temporal information to address OSVDA problem. |
Y. Wang; X. Song; Y. Wang; P. Xu; R. Hu; H. Chai; |
1642 | Cross-Domain Sentiment Classification with Contrastive Learning and Mutual Information Maximization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we propose a novel model for cross-domain sentiment classification – CLIM – Contrastive Learning with mutual Information Maximization, to explore the potential of contrastive learning for learning domain-invariant and task-discriminative features. |
T. Li; X. Chen; S. Zhang; Z. Dong; K. Keutzer; |
1643 | Low-Complexity Parameter Learning for OTFS Modulation Based Automotive Radar Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we consider an OTFS modulation based automotive joint radar-communication system and focus on the design of low-complexity parameter estimation algorithm for radar targets. |
C. Liu; S. Liu; Z. Mao; Y. Huang; H. Wang; |
1644 | Federated Dropout Learning for Hybrid Beamforming with Spatial Path Index Modulation in Multi-User Mmwave-Mimo Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce model-based and model-free frameworks for beamformer design in multi-user SPIM-MIMO systems. |
A. M. Elbir; S. Coleri; K. V. Mishra; |
1645 | Information Decoding and SDR Implementation of DFRC Systems Without Training Signals Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel method for decoding information at the communication receiver without using training data, which is implemented using a software-defined radio (SDR). |
D. M. Wong; B. K. Chalise; J. Metcalf; M. Amin; |
1646 | A Low-Complexity MIMO Dual Function Radar Communication System Via One-Bit Sampling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to implement a low-complexity multiple input multiple output DFRC (MIMO-DFRC) system relying on the generalized spatial modulation (GSM) and the low-resolution sampling. |
S. Zhu; F. Xi; S. Chen; A. Nehorai; |
1647 | Learning to Select for Mimo Radar Based on Hybrid Analog-Digital Beamforming Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose an energy-efficient radar beampattern design framework for Millimeter Wave (mmWave) massive multi-input multi-output (mMIMO) systems, equipped with a hybrid analog-digital (HAD) beamforming structure. |
Z. Xu; F. Liu; K. Diamantaras; C. Masouros; A. Petropulu; |
1648 | Word-Level ASL Recognition and Trigger Sign Detection with RF Sensors Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Thus, this paper investigates RF sensing as an alternative sensing modality for ASL recognition to facilitate interactive devices and smart environments for the deaf and hard-of-hearing. |
M. M. Rahman; et al. |
1649 | Hybrid Beamforming for Wideband OFDM Dual Function Radar Communications Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper considers the hybrid beamforming design for wideband OFDM-DFRC system serving multiple users (MUs). |
Z. Cheng; J. He; S. Shi; Z. He; B. Liao; |
1650 | Bit Constrained Communication Receivers In Joint Radar Communications Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we design bit constrained communication receivers in dual-function systems, by considering hybrid analog/digital architectures and treating their operation as task-based quantization. |
D. Ma; N. Shlezinger; T. Huang; Y. Liu; Y. C. Eldar; |
1651 | ICI-Aware Parameter Estimation for Mimo-Ofdm Radar Via Apes Spatial Filtering Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel three-stage delay-Doppler-angle estimation algorithm for a MIMO-OFDM radar in the presence of inter-carrier interference (ICI). |
M. F. Keskin; H. Wymeersch; V. Koivunen; |
1652 | Joint Communications with FH-MIMO Radar Systems: An Extended Signaling Strategy Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate the signaling strategy of communications embedding in frequency-hopping (FH) multiple input multiple output (MIMO) radar. |
X. Wang; J. Xu; A. Hassanien; E. Aboutanios; |
1653 | Full-Duplex Multifunction Transceiver with Joint Constant Envelope Transmission and Wideband Reception Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces and justifies a novel system concept that consists of full-duplex transceivers and uses a multifunction signal for simultaneous two-way communication, jamming and sensing tasks. |
J. Marin; M. Bernhardt; T. Riihonen; |
1654 | Waveform Design for The Joint MIMO Radar and Communications with Low Integrated Sidelobe Levels and Accurate Information Embedding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on the multiple-waveform design for the joint multiple-input multiple-output radar and communications system, which aims to simultaneously attain low integrated sidelobe level (ISL) of waveforms and accurate fast-time modulation for information embedding (IE). |
Y. Li; X. Wu; R. Tao; |
1655 | Ordered Reliability Bits Guessing Random Additive Noise Decoding Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To that end, here we introduce a soft-detection variant of Guessing Random Additive Noise Decoding (GRAND) called Ordered Reliability Bits GRAND that can decode any moderate redundancy block-code. |
K. R. Duffy; |
1656 | Learned Decimation for Neural Belief Propagation Decoders : Invited Paper Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a two-stage decimation process to improve the performance of neural belief propagation (NBP), recently introduced by Nachmani et al., for short low-density parity-check (LDPC) codes. |
A. Buchberger; C. H�ger; H. D. Pfister; L. Schmalen; A. G. i. Amat; |
1657 | ADMM-Based ML Decoding: from Theory to Practice Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate this approach with respect to its algorithmic and implementation-specific challenges. |
K. Kraft; N. Wehn; |
1658 | Towards Practical Near-Maximum-Likelihood Decoding of Error-Correcting Codes: An Overview Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This overview paper surveys recent progress in this emerging field by reviewing the GRAND algorithm, linear programming decoding, machine-learning aided decoding and the recursive projection-aggregation decoding algorithm. |
T. Tonnellier; M. Hashemipour; N. Doan; W. J. Gross; A. Balatsoukas-Stimming; |
1659 | High-Throughput VLSI Architecture for Soft-Decision Decoding with ORBGRAND Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work reports the first-ever hardware architecture for ORBGRAND, which achieves an average throughput of up to 42.5 Gbps for a code length of 128 at an SNR of 10 dB. |
S. M. Abbas; T. Tonnellier; F. Ercan; M. Jalaleddine; W. J. Gross; |
1660 | Hardware Implementation of Iterative Projection-Aggregation Decoding of Reed-Muller Codes Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present a simplification and a corresponding hardware architecture for hard-decision recursive projection-aggregation (RPA) decoding of Reed-Muller (RM) codes. |
M. Hashemipour-Nazari; K. Goossens; A. Balatsoukas-Stimming; |
1661 | M-Activity: Accurate and Real-Time Human Activity Recognition Via Millimeter Wave Radar Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose m-Activity, which can realize HAR while reducing noise caused by environmental multi-path effects, and operate fluently at runtime. |
Y. Wang; H. Liu; K. Cui; A. Zhou; W. Li; H. Ma; |
1662 | Pushing The Limit of Phase Offset for Contactless Sensing Using Commodity Wifi Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose BreathTrack2.0, a contactless breath tracking system based on commodity WiFi devices. |
D. Zhang; X. Li; Y. Chen; |
1663 | Noncontact Heartbeat Detection By Viterbi Algorithm with Fusion of Beat-Beat Interval and Deep Learning-Driven Branch Metrics Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, to detect heartbeat with a high accuracy, we propose a Doppler radar-based heartbeat detection method by the Viterbi algorithm with a fusion of Beat-Beat Interval (BBI) and deep learning-driven Branch Metrics (BM). |
K. Yamamoto; T. Ohtsuki; |
1664 | Typingwristband: A Human Slight Motion Sensing System Based on Vibration Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on the problem of how to detect the human?s typing motion, and designed a new system, named as Typing Wristband, to obtain the vibration of wrist using piezoelectric transducer (PZT). |
S. Cheng; J. Yan; J. Li; J. Liu; |
1665 | Movement Detection Using A Reciprocal Received Signal Strength Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A novel reciprocal signal strength model is presented, and an energy detector is developed. |
O. Kaltiokallio; H. Yigitler; |
1666 | Deep Convolutional Gaussian Processes for Mmwave Outdoor Localization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a deep convolutional Gaussian process (DCGP) based regression approach to achieve high robustness for fingerprinting-based mmWave outdoor localization, which exploits the convolutional structure for deep Gaussian process to allow uncertainty estimation on location predictions. |
X. Wang; M. Patil; C. Yang; S. Mao; P. A. Patel; |
1667 | Exploring Automatic COVID-19 Diagnosis Via Voice and Symptoms from Crowdsourced Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a voice-based framework to automatically detect individuals who have tested positive for COVID-19. |
J. Han; et al. |
1668 | Coughwatch: Real-World Cough Detection Using Smartwatches Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we propose CoughWatch, a lightweight cough detector for smartwatches that uses audio and movement data for in-the-wild cough detection. |
D. Liaqat; et al. |
1669 | Acoustic and Linguistic Analyses to Assess Early-Onset and Genetic Alzheimer�s Disease Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study proposes the use of acoustic and linguistic methods to extract features from speech recordings and their transcriptions to discriminate people with conditions related to the Paisa mutation. |
P. A. P�rez-Toro; et al. |
1670 | A Noise-Robust Signal Processing Strategy for Cochlear Implants Using Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a noise-robust signal processing strategy to deal with this problem. |
N. Zheng; Y. Shi; Y. Kang; Q. Meng; |
1671 | Context-Aware Speech Stress Detection in Hospital Workers Using Bi-LSTM Classifiers Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a context-aware speech-based system for stress detection. |
A. Gaballah; A. Tiwari; S. Narayanan; T. H. Falk; |
1672 | Unsupervised Heart Abnormality Detection Based on Phonocardiogram Analysis with Beta Variational Auto-Encoders Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a method of unsupervised PCG analysis that uses beta variational auto-encoder (? ? VAE) to model the normal PCG signals. |
S. Li; K. Tian; R. Wang; |
1673 | Compressing Deep Neural Networks for Efficient Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In order to address this problem, we propose a model compression pipeline to reduce DNN size for speech enhancement, which is based on three kinds of techniques: sparse regularization, iterative pruning and clustering-based quantization. |
K. Tan; D. Wang; |
1674 | Improved Mask-CTC for Non-Autoregressive End-to-End ASR Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To boost the performance of Mask-CTC, we first propose to enhance the encoder network architecture by employing a recently proposed architecture called Conformer. Next, we propose new training and decoding methods by introducing auxiliary objective to predict the length of a partial target sequence, which allows the model to delete or insert tokens during inference. |
Y. Higuchi; H. Inaguma; S. Watanabe; T. Ogawa; T. Kobayashi; |
1675 | Memory-Efficient Speech Recognition on Smart Devices Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We address transducer model’s memory access concerns by optimizing their model architecture and designing novel recurrent cell designs. |
G. Venkatesh; et al. |
1676 | Expediting Discovery in Neural Architecture Search By Combining Learning with Planning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Time to discover optimal structures is a key concern in many AML solutions including NASIL. Here, we proposed an extended version called GNASIL to speed up the process. |
F. S. Fard; V. Singh Tomar; |
1677 | Specialized Embedding Approximation for Edge Intelligence: A Case Study in Urban Sound Classification Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Hence, we introduce Specialized Embedding Approximation (SEA) to train a student featurizer to approximate the teacher?s embedding manifold for a given target domain. |
S. Srivastava; D. Roy; M. Cartwright; J. P. Bello; A. Arora; |
1678 | Light-TTS: Lightweight Multi-Speaker Multi-Lingual Text-to-Speech Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a new lightweight multi-speaker multi-lingual speech synthesis system, named LightTTS, which can quickly synthesize the Chinese, English or code-switch speech of multiple speakers in a non-autoregressive generation manner using only one model. |
S. Li; B. Ouyang; L. Li; Q. Hong; |
1679 | Efficient Long Periodic Binary Sequence Designs for Automotive Radar Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider an FFT-based algorithm for the efficient designs of long periodic binary sequences with arbitrary period lengths and ample diversity. |
Y. Chen; R. Lin; J. Li; |
1680 | Joint Localization and Predictive Beamforming in Vehicular Networks: Power Allocation Beyond Water-Filling Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper explores tailored power allocation (PA) for dual functional radar-communication (DFRC) in the vehicle-to-infrastructure (V2I) network, where a road side unit (RSU) provides both localization and communication services to multiple vehicles. |
F. Liu; C. Masouros; |
1681 | A New Automotive Radar 4D Point Clouds Detector By Using Deep Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, in this paper, we propose a new automotive radar detector based on deep learning using the spatial distribution feature of the real targets, in order to improve the performance of automotive radar detector in the real-world driving scene. |
Y. Cheng; J. Su; H. Chen; Y. Liu; |
1682 | Enhanced Automotive Target Detection Through Radar and Communications Sensor Fusion Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: A sensor fusion algorithm is proposed to benefit from the information from radar and communication to improve the final range estimates. |
S. H. Dokhanchi; B. Shankar Mysore; K. V. Mishra; B. Ottersten; |
1683 | Extended Object Tracking With Automotive Radar Using B-Spline Chained Ellipses Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces a B-spline chained ellipses model representation for extended object tracking (EOT) using high-resolution automotive radar measurements. |
G. Yao; P. Wang; K. Berntorp; H. Mansour; P. Boufounos; P. V. Orlik; |
1684 | Four-Dimensional High-Resolution Automotive Radar Imaging Exploiting Joint Sparse-Frequency and Sparse-Array Design Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel automotive radar imaging technique to provide high-resolution information in four dimensions, i.e., range, Doppler, azimuth, and elevation, by exploiting a joint sparsity design in frequency spectrum and array configurations. |
S. Sun; Y. D. Zhang; |
1685 | An Empirical Study of Visual Features for DNN Based Audio-Visual Speech Enhancement in Multi-Talker Environments Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we perform an empirical study of the most commonly used visual features for DNN based AVSE, the pre-processing requirements for each of these features, and investigate their influence on the performance. |
S. S. Shetu; S. Chakrabarty; E. A. P. Habets; |
1686 | On The Role of Visual Cues in Audiovisual Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present an introspection of an audiovisual speech enhancement model. |
Z. Aldeneh; et al. |
1687 | Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: For the objective we propose to use a convolutive transfer function invariant Signal-to-Distortion Ratio (CI-SDR) based loss. |
C. Boeddeker; et al. |
1688 | Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a new paradigm for handling far-field multi-speaker data in an end-to-end (E2E) neural network manner, called directional automatic speech recognition (D-ASR), which explicitly models source speaker locations. |
A. S. Subramanian; et al. |
1689 | Communication-Cost Aware Microphone Selection for Neural Speech Enhancement with Ad-Hoc Microphone Arrays Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a method for jointly-learning a microphone selection mechanism and a speech enhancement network for multi-channel speech enhancement with an ad-hoc microphone array. |
J. Casebeer; J. Kaikaus; P. Smaragdis; |
1690 | Deep Multi-Frame MVDR Filtering for Single-Microphone Speech Enhancement Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Aiming at merging the speech enhancement potential of the MFMVDR filter and the estimation capability of temporal convolutional networks (TCNs), in this paper we propose to embed the MFMVDR filter within a deep learning framework. |
M. Tammen; S. Doclo; |
1691 | Compressive Wideband Spectrum Sensing and Carrier Frequency Estimation with Unknown Mimo Channels Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider the problem of joint wideband spectrum sensing and carrier frequency estimation in a sub-Nyquist sampling framework. |
H. Wang; J. Wang; J. Fang; H. Li; |
1692 | Joint Optimization of Spectrally Co-Existing Multi-Carrier Radar and Communication Systems in Cluttered Environments Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a non-alternating approach to jointly optimize the radar and communication transmission power allocated to each sub-carrier. |
F. Wang; H. Li; B. Himed; |
1693 | Target Detection in Frequency Hopping MIMO Dual-Function Radar-Communication Systems Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider a multiple-input multiple-output (MIMO) dual function radar communication (DFRC) system employing frequency hopping (FH) radar waveforms. |
I. P. Eedara; M. G. Amin; G. A. Fabrizio; |
1694 | Asymptotic Distribution of Generalized Likelihood Ratio Test Under Model Misspecification With Application to Cooperative Radar-Communications Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The goal of this paper is to develop an expression for the asymptotic distribution of the generalized likelihood ratio test (GLRT) statistic under model misspecification, that is when the assumed data model is different from the true model. |
A. S. Bondre; C. D. Richmond; |
1695 | Online Antenna Selection for Enhanced DOA Estimation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We formulate the changing DOA estimation problem as a game in the context of online convex optimization, and employ a gradient-based technique that makes a move at each step in order to minimize the total loss after T steps. |
E. Aboutanios; H. Nosrati; X. Wang; |
1696 | Designing Random FM Radar Waveforms with Compact Spectrum Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here we examine the impact of using the family of super-Gaussian spectra to serve as alternative design templates. |
C. A. Mohr; S. D. Blunt; |
1697 | Collaborative Inference Via Ensembles on The Edge Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here we propose a framework for facilitating the application of DNNs on the edge in a manner which allows multiple users to collaborate during inference in order to improve their prediction accuracy. |
N. Shlezinger; E. Farhan; H. Morgenstern; Y. C. Eldar; |
1698 | Allocating DNN Layers Computation Between Front-End Devices and The Cloud Server for Video Big Data Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a computation allocation algorithm of DNN between the front-end devices and the cloud server. |
P. Xing; X. Liu; P. Peng; T. Huang; Y. Tian; |
1699 | Branchy-GNN: A Device-Edge Co-Inference Framework for Efficient Point Cloud Processing Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Built upon the emerging paradigm of device-edge co-inference, where an edge device extracts and transmits the intermediate feature to an edge server for further processing, we propose Branchy-GNN for efficient graph neural network (GNN) based point cloud processing by leveraging edge computing platforms. |
J. Shao; H. Zhang; Y. Mao; J. Zhang; |
1700 | Collaborative Intelligence: Challenges and Opportunities Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents an overview of the emerging area of collaborative intelligence (CI). |
I. V. Bajic; W. Lin; Y. Tian; |
1701 | Latent Space Motion Analysis for Collaborative Intelligence Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: By analyzing the effect of common DNN operations on optical flow, we show that the motion present in each channel of a feature tensor is approximately equal to the scaled version of the input motion. |
M. Ulhaq; I. V. Bajic; |
1702 | Teacher-Student Learning With Multi-Granularity Constraint Towards Compact Facial Feature Representation Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel end-to-end feature compression scheme by leveraging the representation and learning capability of deep neural networks, towards intelligent front-end equipped analysis with promising accuracy and efficiency. |
S. Wang; S. Wang; W. Yang; X. Zhang; S. Wang; S. Ma; |
1703 | Discriminability of Single-Layer Graph Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We focus on the property of discriminability and establish conditions under which the inclusion of pointwise nonlinearities to a stable graph filter bank leads to an increased discriminative capacity for high-eigenvalue content. |
S. Pfrommer; A. Ribeiro; F. Gama; |
1704 | On The Stability of Graph Convolutional Neural Networks Under Edge Rewiring Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we develop an interpretable upper bound elucidating that graph neural networks are stable to rewiring between high degree nodes. |
H. Kenlay; D. Thano; X. Dong; |
1705 | Geometric Scattering Attention Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we introduce a new attention-based architecture to produce adaptive task-driven node representations by implicitly learning node-wise weights for combining multiple scattering and GCN channels in the network. |
Y. Min; F. Wenkel; G. Wolf; |
1706 | Ego-GNNs: Exploiting Ego Structures in Graph Neural Networks Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose to augment the GNN message-passing operations with information de-fined on ego graphs (i.e., the induced subgraph surrounding each node). |
D. Sandfelder; P. Vijayan; W. L. Hamilton; |
1707 | Learning The Relevant Substructures for Tasks on Graph Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we demonstrate that LRP models can be used on challenging graph classification tasks to provide both state-of-the-art performance and interpretability, through the detection of the relevant substructures used by the network to make its decisions. |
L. Chen; Z. Chen; J. Bruna; |
1708 | A Short Tutorial on The Weisfeiler-Lehman Test And Its Variants Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: The goal of this short note is pedagogical and practical: We explain the differences between the WL and folklore-WL formulations, with pointers to existing discussions in the literature. |
N. T. Huang; S. Villar; |
1709 | Hybrid Model for Network Anomaly Detection with Gradient Boosting Decision Trees and Tabtransformer Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this pa-per, we present our solution for the ICASSP 2021 Network Anomaly Detection Challenge (NAD) challenge. |
X. Xu; X. Zheng; |
1710 | Voting-Based Ensemble Model for Network Anomaly Detection Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a machine learning framework based on XGBoost and deep neural networks to classify normal traffic and anomalous traffic. |
T. -H. Yang; Y. -T. Lin; C. -L. Wu; C. -Y. Wang; |
1711 | An Accuracy Network Anomaly Detection Method Based on Ensemble Model Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces an ensemble model, which is a powerful technique to increase accuracy on network anomaly detection. |
F. Liu; X. Li; W. Xiong; H. Jiang; G. Xie; |
1712 | Fden: Mining Effective Information of Features in Detecting Network Anomalies Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel method to effectively leverage the features in detecting network anomalies, named FDEn, consisting of flow-based Feature Derivation (FD) and prior knowledge incorporated Ensemble models (Enpk). |
B. Li; et al. |
1713 | Multi-Scale Residual Network for Covid-19 Diagnosis Using Ct-Scans Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a three-level approach to separate the cases of COVID-19, pneumonia from normal patients using chest CT scans. |
P. Garg; R. Ranjan; K. Upadhyay; M. Agrawal; D. Deepak; |
1714 | Diagnosing Covid-19 from CT Images Based on An Ensemble Learning Framework Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel ensemble learning framework to solve this problem. |
B. Li; Q. Zhang; Y. Song; Z. Zhao; Z. Meng; F. Su; |
1715 | CNR-IEMN: A Deep Learning Based Approach to Recognise Covid-19 from CT-Scan Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we proposed deep learning based approach to recognize the Covid-19 infection from the CT-scans. |
F. Bougourzi; R. Contino; C. Distante; A. Taleb-Ahmed; |
1716 | Covid-19 Diagnostic Using 3d Deep Transfer Learning for Classification of Volumetric Computerised Tomography Chest Scans Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes COVID-19 diagnosis based on analysis of Computerised tomography (CT) chest scans. |
S. Xue; C. Abhayaratne; |
1717 | A Multi-Stage Progressive Learning Strategy for Covid-19 Diagnosis Using Chest Computed Tomography with Imbalanced Data Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, a multi-stage progressive learning strategy is investigated to train classifiers for COVID-19 Diagnosis using imbalanced Chest Computed Tomography Data acquired from patients infected with COVID-19 Pneumonia, Community Acquired Pneumonia (CAP) and from normal healthy subjects. |
Z. Yang; Y. Hou; Z. Chen; L. Zhang; J. Chen; |
1718 | Detecting Covid-19 and Community Acquired Pneumonia Using Chest CT Scan Images With Deep Learning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a two-stage Convolutional Neural Network (CNN) based classification framework for detecting COVID-19 and Community Acquired Pneumonia (CAP) using the chest Computed Tomography (CT) scan images. |
S. Chaudhary; S. Sadbhawna; V. Jakhetiya; B. N. Subudhi; U. Baid; S. C. Guntuku; |
1719 | Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we investigate different speaker representations and proposed to integrate pretrained and learnable speaker representations. |
C. -M. Chien; J. -H. Lin; C. -y. Huang; P. -c. Hsu; H. -y. Lee; |
1720 | The Thinkit System for Icassp2021 M2voc Challenge Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce the low resource text-to-speech system from the ThinkIT team submitted to Multi-Speaker Multi-Style Voice Cloning Challenge (M2VoC). |
Z. Shang; H. Zhang; Z. Chen; B. Zhou; P. Zhang; |
1721 | Dian: Duration Informed Auto-Regressive Network for Voice Cloning Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel end-to-end speech synthesis approach, Duration Informed Auto-regressive Network (DIAN), which consists of an acoustic model and a separate duration model. |
W. Song; et al. |
1722 | Prosody and Voice Factorization for Few-Shot Speaker Adaptation in The Challenge M2voc 2021 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: To prevent the model from overfitting, this paper proposes a novel speaker adaptation framework that decomposes the prosody and voice characteristics in the end-to-end model. |
T. Wang; et al. |
1723 | The Huya Multi-Speaker and Multi-Style Speech Synthesis System for M2voc Challenge 2020 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose the Huya multi-speaker and multi-style speech synthesis system which is based on DurIAN and HiFi-GAN to generate high-fidelity speech even under low-resource condition. |
J. Wang; et al. |
1724 | The Multi-Speaker Multi-Style Voice Cloning Challenge 2021 Related Papers Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a detailed explanation on the tasks and data used in the challenge, followed by a summary of submitted systems and evaluation results. |
Q. Xie; et al. |