SPEECH/AUDIO

FastFit: Towards Real-Time Iterative Neural Vocoder by Replacing U-Net Encoder With Multiple STFTs

INTERSPEECH 2023. 08

REINFORCEMENT LEARNING

On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement Learning

ICML 2023. 07

COMPUTER VISION

Local 3D Editing via 3D Distillation of CLIP Knowledge

CVPR 2023. 06

COMPUTER VISION

Revisiting the Importance of Amplifying Bias for Debiasing

AAAI 2023. 02

COMPUTER VISION

Efficient Skeleton-Based Action Recognition via Joint-Mapping strategies

WACV 2023. 01

NLP

Normalizing Mutual Information for Robust Adaptive Training for Translation

EMNLP Short Paper 2022. 12

NLP

LittleBird: Efficient Faster & Longer Transformer for Question Answering

EMNLP Long Paper 2022. 12

NLP

APEACH: Attacking Pejorative Expressions with Analysis on Crowd-Generated Hate Speech Evaluation Datasets

Findings of EMNLP 2022. 12

NLP

Persona-Knowledge Dialogue Multi-Context Retrieval and Enhanced Decoding Methods

Customized Chat Grounding Persona and Knowledge Workshop at COLING 2022. 10

SPEECH/AUDIO

Generalizing RNN-Transducer to Out-Domain Audio via Sparse Self-Attention Layers

INTERSPEECH 2022. 09

SPEECH/AUDIO

Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning

INTERSPEECH 2022. 09

NLP

The Emotion is Not One-hot Encoding: Learning with Grayscale Label for Emotion Recognition in Conversation

INTERSPEECH 2022. 09

SPEECH/AUDIO

JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech

INTERSPEECH 2022. 09

COMPUTER VISION

Proxyless Neural Architecture Adaptation at Once

IEEE Access 2022. 09

COMPUTER VISION

Efficient Two-Stream Network for Online Video Action Segmentation

IEEE Access 2022. 08

COMPUTER VISION

Classification-based Multi-task Learning for Efficient Pose Estimation Network

ICPR 2022. 08

KNOWLEDGE GRAPH

ComDensE : Combined Dense Embedding of Relation-aware and Common Features for Knowledge Graph Completion

ICPR 2022. 08

KNOWLEDGE GRAPH

OASYS: Domain-Agnostic Automated System for Constructing Knowledge Base from Unstructured Text

Mining and Learning on Graphs Workshop at SIGKDD 2022. 08

MACHINE LEARNING

Connecting a Low Loss Subspace for Personalized Federated Learning

SIGKDD Research Track 2022. 08

NLP

Paraphrasing via Ranking Many Candidates

INLG 2022. 07

COMPUTER VISION

A Statistical Manifold Framework for Point Cloud Data

ICML 2022. 07

REINFORCEMENT LEARNING

Towards Validating Long-Term User Feedbacks in Interactive Recommendation Systems

SIGIR (Best Short Paper Honorable Mention) 2022. 07

NLP

CoMPM: Context Modeling with Speaker's Pre-trained Memory Tracking for Emotion Recognition in Conversation

NAACL 2022. 07

COMPUTER VISION

Contrastive Regularization for Semi-Supervised Learning

L3D-IVU Workshop at CVPR 2022. 06

COMPUTER VISION

X-ViT: High Performance Linear Vision Transformer without Softmax

Transformers for Vision Workshop at CVPR 2022. 06

NLP

Vacillating Human Correlation of SacreBLEU in Unprotected Languages

Human Evaluation of NLP Systems Workshop at ACL 2022. 05

NLP

Multimodal Interactions Using Pretrained Unimodal Models for SIMMC 2.0

DSTC10 Wokrshop at AAAI 2022. 02

COMPUTER VISION

Proxyless Neural Architecture Adaptation for Supervised Learning and Self-Supervised Learning

Learning Network Architecture during Training Workshop at AAAI 2022. 02

COMPUTER VISION

SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness

NeurIPS 2021. 12

COMPUTER VISION

Learning Debiased Representation via Disentangled Feature Augmentation

NeurIPS Oral 2021. 12

NLP

Kakao Enterprise’s WMT21 Machine Translation using Terminologies Task Submission

WMT 2021 System Papers 2021. 11

NLP

Capturing Speaker Incorrectness: Speaker-Focused Post-Correction for Abstractive Dialogue Summarization

NewSum workshop, EMNLP 2021. 11

NLP

An Evaluation Dataset and Strategy for Building Robust Multi-turn Response Selection Model

EMNLP 2021. 11

NLP

AligNART: Non-autoregressive Neural Machine Translation by Jointly Learning to Estimate Alignment and Translate

EMNLP 2021. 11

COMPUTER VISION

Distilling Global and Local Logits with Densely Connected Relations

ICCV 2021. 10

SPEECH/AUDIO

Improving End-to-End Contextual Speech Recognition via a Word-Matching Algorithm with Backward Search

IEEE Signal Processing Letters 2021. 10

SPEECH/AUDIO

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

INTERSPEECH 2021. 08

NLP

Auxiliary Sequence Labeling Tasks for Disfluency Detection

INTERSPEECH 2021. 08

SPEECH/AUDIO

SE-Conformer: Time-Domain Speech Enhancement using Conformer

INTERSPEECH 2021. 08

NLP

Learning to Walk across Time for Interpretable Temporal Knowledge Graph Completion

SIGKDD Research Track Long Paper 2021. 08

NLP

Adaptive Batch Scheduling for Open-Domain Question Answering

IEEE Access 2021. 08

NLP

Deep Context- and Relation-Aware Learning for Aspect-based Sentiment Analysis

ACL-IJCNLP Main Conference 2021. 08

NLP

OutFlip: Generating Examples for Unknown Intent Detection with Natural Language Attack

ACL-IJCNLP Findings of ACL 2021. 08

COMPUTER VISION

ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision

ICML Long Talk 2021. 07

SPEECH/AUDIO

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

ICML 2021. 07

SPEECH/AUDIO

U-Convolution Based Residual Echo Suppression With Multiple Encoders

ICASSP 2021. 06

SPEECH/AUDIO

Multitask Learning and Joint Optimization For Transformer-Rnn-Tranducer Speech Recognition

ICASSP 2021. 06

NLP

Korean Erroneous Sentence Classification with Integrated Eojeol Embedding

IEEE Access 2021. 06

COMPUTER VISION

Suppressing Spoof-irrelevant Factors for Domain-agnostic Face Anti-spoofing

IEEE Access 2021. 05

NLP

RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases

중첩된 SELECT문을 좀 더 정확하게 생성하는 SPC 기법을 적용한 Text-to-SQL 알고리즘 'RYANSQL' 제안

Computational Linguistics 2021. 03

COMPUTER VISION

A Plug-in Method for Representation Factorization in Connectionist Models

딥러닝 모델에서 추출한 임베딩 벡터를 독립 요인으로 분해하는 기법 ‘FDEN’ 제안

IEEE Transactions on Neural Networks and Learning Systems 2021. 02

COMPUTER VISION

Multi-level Distance Regularization for Deep Metric Learning

딥러닝 기반 거리 학습에 적합한 새로운 정규화 기법 ‘MDR’ 제안

AAAI 2021. 02

NLP

Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection

응답 선택에서 대화 맥락에 호응하면서도 의미적 유사도가 높은 문장을 선택하는 기법 'UMS' 제안

AAAI 2021. 02

SPEECH/AUDIO

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

고품질의 음성 오디오를 빠르게 합성하는 TTS 모델 'Hi-Fi GAN' 제안

NeurIPS 2020. 12

SPEECH/AUDIO

Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search

플로우 기반 생성 모델과 동적 프로그래밍을 활용한 TTS 모델 'Glow-TTS' 제안

NeurIPS Oral 2020. 12

NLP

Stable Style Transformer: Delete and Generate Approach with Encoder-Decoder for Text Style Transfer

비병렬 데이터셋을 활용한 새로운 텍스트 스타일 변환 모델 'SST' 제안

INLG 2020. 12

NLP

Reference and Document Aware Semantic Evaluation Methods for Korean Language Summarization

텍스트 요약 모델을 평가하는 새로운 척도 'RDASS' 제안

COLING 2020. 12

COMPUTER VISION

Face Video Retrieval Based on the Deep CNN With RBF Loss

IEEE Transactions on Image Processing 2020. 12

NLP

Sparse and Decorrelated Representations for Stable Zero-shot NMT

강건한 제로샷 번역 모델을 위해 정규화 기법 'SLNI' 도입 제안

EMNLP Findings of ACL 2020. 11

NLP

Revisiting modularized multilingual NMT to meet industrial demands

다국어 번역 모델 아키텍처인 'M2NMT'의 재발견

EMNLP 2020. 11

NLP

AttnIO: Knowledge Graph Exploration with In-and-Out Attention Flow for Knowledge-Grounded Dialogue

대화 맥락에 따른 지식 그래프 경로 탐색 모델 'AttnIO' 제안

EMNLP 2020. 11

SPEECH/AUDIO

Accelerating RNN Transducer Inference via Adaptive Expansion Search

E2E 음성인식의 가속화를 위한 적응적 검색 기법 'AES' 제안

IEEE Signal Processing Letters 2020. 11

COMPUTER VISION

Learning Discriminative Part Features Through Attentions For Effective And Scalable Person Search

한 번의 딥러닝 추론으로 사람 검출과 검색을 한꺼번에 구현하는 기법 제안

ICIP 2020. 10

SPEECH/AUDIO

JDI-T: Jointly trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment

음성합성 모델과 음소-오디오 정렬 모델을 한꺼번에 훈련하는 아키텍처 'JDI-T' 제안

INTERSPEECH 2020. 10

COMPUTER VISION

Diversified Mutual Learning for Deep Metric Learning

여러 모델간 상호학습 방식으로 이미지 검색 성능을 높이는 기법 제안

ECCV workshop on TASK-CV 2020. 09

COMPUTER VISION

BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition

수많은 사람의 얼굴을 효과적으로 학습하는 기법 'BroadFace' 제안

ECCV 2020. 08

COMPUTER VISION

Deep Metric Learning with Multi-Objective Functions

패션 이미지를 효율적으로 검색하는 새로운 거리학습 기법 제안

CVPR workshop on CVFAD 2020. 06

COMPUTER VISION

GroupFace: Learning Latent Groups and Constructing Group-based Representations for Face Recognition

얼굴 인식에 전문화된 새로운 아키텍처 'GroupFace' 제안

CVPR 2020. 06

SPEECH/AUDIO

Affective Latent Representation of Acoustic and Lexical Features for Emotion Recognition

음성의 음향・어휘 특성을 동시에 반영해 사람의 감정을 효과적으로 인식하는 아키텍처 제안

SENSORS 2020. 05

SPEECH/AUDIO

Many-to-Many Voice Conversion using Conditional Cycle-Consistent Adversarial Networks

CycleGAN을 활용해 다자 간의 음성 스타일을 변환하는 기법 제안

ICASSP 2020. 02

COMPUTER VISION

Attentional Feature-Pair Relation Networks for Accurate Face Recognition

ICCV 2019. 10

COMPUTER VISION

BASN: Enriching Feature Representation Using Bipartite Auxiliary Supervision for Face Anti-Spoofing

ICCV Workshop on DFW 2019. 10

NLP

기계 독해를 이용한 웹 기반 오픈 도메인 한국어 질의응답

한글 및 한국어정보처리 학술대회 2019. 10

NLP

한국어 챗봇에서의 오류에 강건한 한국어 문장 분류를 위한 어절 단위 임베딩

한글 및 한국어정보처리 학술대회 2019. 10

NLP

오픈도메인 질의문 자동 분류를 위한 주석 말뭉치 구축 연구

한글 및 한국어정보처리 학술대회 2019. 10

NLP

한국어 질의 응답에서의 화제성을 고려한 딥러닝 기반 정답 유형 분류기

한글 및 한국어정보처리 학술대회 2019. 10

SPEECH/AUDIO

Speech Enhancement Using a Two-Stage Network for an Efficient Boosting Strategy

ICASSP 2019. 05

NLP

DNN-based Emotion Recognition based on Bottleneck Acoustic Features and Lexical Features