| 1 |
EACL 2024 |
Autoregressive Score Generation for Multi-trait Essay Scoring |
| 2 |
ICLR |
Learning Energy Decompositions for Partial Inference of GFlowNets |
| 3 |
ICLR |
A Simple and Scalable Representation for Graph Generation |
| 4 |
ICLR |
Graph Generation with K2-trees |
| 5 |
WACV |
Interactive Network Perturbation between Teacher and Students for Semi-Supervised Semantic Segmentation |
| 6 |
ISBI |
MODALITY-AGNOSTIC STYLE TRANSFER FOR HOLISTIC FEATURE IMPUTATION |
| 7 |
AAAI |
Active learning guided by efficient surrogate learners |
| 8 |
AAAI |
Robust distributed gradient aggregation using projections onto gradient manifolds |
| 9 |
LREC-COLING |
Denoising Table-Text Retrieval for Open-Domain Question Answering |
| 10 |
LREC-COLING |
Explainable Multi-hop Question Generation: An End-to-End Approach without Intermediate Question Labeling |
| 11 |
WACV |
Efficient Semantic Matching with Hypercolumn Correlation |
| 12 |
CVPR |
Contrastive Mean-Shift Learning for Generalized Category Discovery |
| 13 |
CVPR |
Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences |
| 14 |
CVPR |
Learning SO(3)-Invariant Semantic Correspondence via Local Shape Transform |
| 15 |
ICAIIC |
Empirical Investigation of Adversarial Attacks for Semi-Supervised Object Detection |
| 16 |
WWW'24 |
Improving Retrieval in Theme-specific Applications using a Corpus Topical Taxonomy |
| 17 |
WWW'24 |
Top-Personalized-K Recommendation |
| 18 |
WWW'24 |
Doubly Calibrated Estimator for Recommendation on Data Missing Not At Random |
| 19 |
LREC-COLING |
KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark |
| 20 |
AAAI |
FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields |
| 21 |
ICLR |
Pac Prediction Sets under Label Shift |
| 22 |
ICML |
Hybrid Neural Representations for Spherical Data |
| 23 |
ICML |
Gaussian Plane-Wave Neural Operator For Electron Density Estimation |
| 24 |
NAACL |
TRAQ: Trustworthy Retrieval Augmented Question Answering via Conformal Prediction |
| 25 |
CVPR |
MedBN: Robust Test Time Adaptation against Malicious Test Samples |
| 26 |
CVPR |
CLIPtone: Unsupervised Learning for Text-based Image Tone Adjustment |
| 27 |
ACL |
Multi-Dimensional Optimization for Text Summarization via Reinforcement Learning |
| 28 |
S&P |
Few-shot Unlearning |
| 29 |
ICML |
Active Label Correction for Semantic Segmentation with Foundation Models |
| 30 |
ICML |
Breadth-First Exploration in Adaptive Grid-based Reinforcement Learning |
| 31 |
ICML |
Mitigating Oversmoothing Through Reverse Process of GNNs for Heterophilic Graphs |
| 32 |
IJCAI |
EPIC: Graph Augmentation with Edit Path Interpolation via Learnable Cost |
| 33 |
NAACL |
Extending CLIP's Image-Text Alignment to Referring Image Segmentation |
| 34 |
ICML |
Improving Robustness to Multiple Spurious Correlations by Multi-Objective Optimization |
| 35 |
AIED |
Aspect-based Semantic Textual Similarity for Educational Test Items |
| 36 |
NAACL |
Adversarial DPO: Harnessing Harmful Data for Reducing Toxicity with Minimal Impact on Coherence and Evasiveness in Dialogue Agents |
| 37 |
ICLR |
Decoupled Marked Temporal Point Process using Neural Ordinary Differential Equations |
| 38 |
ICML |
Neurodegenerative Brain Network Classification via Adaptive Diffusion with Temporal Regularization |
| 39 |
CVPR |
Burst Image Super-Resolution with Base Frame Selection |
| 40 |
ICML |
3D Geometric Shape Assembly via Efficient Point Cloud Matching |
| 41 |
LATIN |
Minimum-Width Double-Slabs and Widest Empty Slabs in High Dimensions |
| 42 |
CCCG |
Guarding Points on a Terrain by Watchtowers |
| 43 |
CVPR |
Learning Discriminative Dynamics with Label Corruption for Noisy Label Detection |
| 44 |
NAACL |
Exploring Language Model’s Code Generation Ability with Auxiliary Functions |
| 45 |
NAACL |
Rectifying Demonstration Shortcut in In-Context Learning |
| 46 |
KDD |
Continual Collaborative Distillation for Recommender System |
| 47 |
CVPR |
ParamISP: Learned Forward and Inverse ISPs using Camera Parameters |
| 48 |
CVPR |
Generalizable Novel-View Synthesis using a Stereo Camera |
| 49 |
SIGGRAPH |
Deep Hybrid Camera Deblurring for Smartphone Cameras |
| 50 |
AAAI |
Learning to Approximate Adaptive Kernel Convolution on Graphs |
| 51 |
ICUFN |
A Word-axis Speaker Embedding Trained with Multi-Speaker Analysis Task |
| 52 |
NeurIPS |
3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction |
| 53 |
NeurIPS |
ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation |
| 54 |
NeurIPS |
Active Learning for Semantic Segmentation with Multi-class Label Query |
| 55 |
ECCV |
BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models |
| 56 |
NeurIPS |
Bootstrapping Top-down Information for Self-modulating Slot Attention |
| 57 |
ECCV |
Classification Matters: Improving Video Action Detection with Class-Specific Attention |
| 58 |
ACCV |
CNG-SFDA: Clean-and-Noisy Region Guided Online-Offline Source-Free Domain Adaptation |
| 59 |
ACCV |
Diffusion Model Compression for Image-to-Image Translation |
| 60 |
ECCV |
Distilling Diffusion Models into Conditional GANs |
| 61 |
ECCV |
Distributed active client selection with noisy clients using model association scores |
| 62 |
ECCV |
Efficient and Versatile Robust Fine-Tuning of Zero-shot Models |
| 63 |
EMNLP |
Eliciting Instruction-tuned Code Language Models' Capabilities to Utilize Auxiliary Function for Code Generation |
| 64 |
Interspeech |
Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert |
| 65 |
AAAI |
Feature Unlearning for Pre-trained GANs and VAEs |
| 66 |
ECCV |
FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models |
| 67 |
EMNLP 2024 |
Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards |
| 68 |
ECCV |
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation |
| 69 |
CVPR |
Learning Correlation Structures for Vision Transformers |
| 70 |
ECCV |
Learning-based Axial Video Motion Magnification |
| 71 |
BMVC |
MeTTA: Single-View to 3D Textured Mesh Reconstruction with Test-Time Adaptation |
| 72 |
MICCAI |
Multi-Modal Graph Neural Network with Transformer-Guided Adaptive Diffusion for Preclinical Alzheimer Classification |
| 73 |
MICCAI |
Multi-order Simplex-based Graph Neural Network for Brain Network Analysis |
| 74 |
Interspeech |
MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset |
| 75 |
ECCV |
NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image |
| 76 |
MICCAI |
OCL: Ordinal Contrastive Learning for Imputating Features with Progressive Labels |
| 77 |
ECCV |
Online Temporal Action Localization with Memory-Augmented Transformer |
| 78 |
ECCV |
PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery |
| 79 |
EMNLP |
Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization |
| 80 |
ACCV |
RNA: Video Editing with ROI-based Neural Atlas |
| 81 |
NeurIPS |
Selective Generation for Controllable Language Models |
| 82 |
EMNLP |
Taxonomy-guided Semantic Indexing for Academic Paper Search |
| 83 |
ECCV |
The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers |
| 84 |
ECCV |
Towards More Practical Group Activity Detection: A New Benchmark and Model |
| 85 |
WACV |
UGPNet: Universal Generative Prior for Image Restoration |
| 86 |
MICCAI |
Uncertainty-aware Diffusion-based Adversarial Attack for Realistic Colonoscopy Image Synthesis |
| 87 |
AAAI |
Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations |
| 88 |
Interspeech |
Key-Element-Informed sLLM Tuning for Document Summarization |
| 89 |
Pacific Graphics |
High-Quality Geometry and Texture Editing of Neural Radiance Field |
| 90 |
Pacific Graphics |
Inverse Rendering of Translucent Objects with Shape-Adaptive Importance Sampling |
| 91 |
ECCV |
Deep Cost Ray Fusion for Sparse Depth Video Completion |
| 92 |
ACM SIGGRAPH |
Toonify3D: StyleGAN-based 3D Stylized Face Generator |
| 93 |
CVPR |
Discontinuity-preserving Normal Integration with Auxiliary Edges |
| 94 |
CVPR |
ParamISP: Learned Forward and Inverse ISPs using Camera Parameters |
| 95 |
ECCV |
MemBN: Robust Test-Time Adaptation via Batch Norm with Statistics Memory |
| 96 |
ECCV |
FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions |
| 97 |
WACV |
Learning Unified Distance Metric Across Diverse Data Distributions with Parameter-Efficient Transfer Learning |
| 98 |
WACV |
Boosting Semi-supervised Video Action Detection with Temporal Context |
| 99 |
ICLR |
CAS: A Probability-Based Approach for Universal Condition Alignment Score |
| 100 |
AIED |
Examining the Impact of Flipped Learning for Developing Young Job Seekers’ AI Literacy |
| 101 |
ICML |
Exploring the Enigma of Neural Dynamics Through A Scattering-Transform Mixer Landscape for Riemannian Manifold |
| 102 |
ICLR |
Generalized Neural Sorting Networks with Error-Free Differentiable Swap Functions |
| 103 |
WACV |
LaughTalk: Expressive 3D Talking Head Generation with Laughter |
| 104 |
CVPR |
MoReVQA: Exploring Modular Reasoning Models for Video Questiong Answering |
| 105 |
ICLR |
Noise Map Guidance: Inversion with Spatial Context for Real Image Editing |
| 106 |
CVPR |
Object-Centric Domain Randomization |
| 107 |
WACV |
Optical Flow Domain Adaptation via Target Style Transfer |
| 108 |
AAAI |
Owq: Outlier-aware weight quantization for efficient fine-tuning and inference of large language models |
| 109 |
CVPR |
Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering |
| 110 |
EMNLP |
QEFT: Quantization for Efficient Fine-Tuning of LLMs |
| 111 |
WACV |
Self-supervised Learning of Semantic Correspondence Using Web Videos |
| 112 |
NAACL |
SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models |
| 113 |
MICCAI |
Uncovering Cortical Pathways of Prion-like Pathology Spreading in Alzheimer’s Disease by Neural Optimal Mass Transport |
| 114 |
CVPR |
Differentiable Display Photometric Stereo |
| 115 |
SIGDIAL |
An Investigation Into Explainable Audio Hate Speech Detection |
| 116 |
EACL |
Multi-Level Attention Aggregation for Language-Agnostic Speaker Replication |
| 117 |
ICASSP |
Leveraging Effective Language and Speaker Conditioning in Indic TTS for LIMMITS 2024 Challenge |
| 118 |
LREC-COLING |
Leveraging the Interplay Between Syntatic and Acoustic Cues for Optimizing Korean TTS Pause Formation |
| 119 |
Interspeech |
Acoustic Feature Mixup for Balanced Multi-aspect Pronunciation Assessment |
| 120 |
SIGDIAL |
DiagESC: Dialogue Synthesis for Integrating Depression Diagnosis into Emotional Support Conversation |
| 121 |
SIGDIAL |
Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning |
| 122 |
EMNLP |
Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages |
| 123 |
EMNLP |
Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing |
| 124 |
EMNLP |
Audio-Based Linguistic Feature Extraction for Enhancing Multi-lingual and Low-Resource Text-to-Speech |