All 175 CVPR 26 ICCV 8 ECCV 12 NeurIPS 15 ICLR 2 ICML 4 ACMMM 15 ACL 3 AAAI 17 IJCAI 10 TPAMI 13 IJCV 18 TIP 7 arXiv 24 Others 1
No publications match your search.

2026

UHR-Micro: Diagnosing and Mitigating the Resolution Illusion
arXiv
UHR-Micro: Diagnosing and Mitigating the Resolution Illusion in Earth Observation VLMs
arXiv, 2026
Seirenes: Adversarial Self-play with Evolving Distractions f
arXiv
Seirenes: Adversarial Self-play with Evolving Distractions for LLM Reasoning
arXiv, 2026
DocScope: Benchmarking Verifiable Reasoning for Trustworthy
arXiv
DocScope: Benchmarking Verifiable Reasoning for Trustworthy Long-document Understanding
arXiv, 2026
Any2Any: Unified Arbitrary Modality Translation for Remote S
ICML
Any2Any: Unified Arbitrary Modality Translation for Remote Sensing
ICML, 2026
Text Before Vision: Staged Knowledge Injection Matters for A
ICML
Text Before Vision: Staged Knowledge Injection Matters for Agentic RLVR in Ultra-high-resolution Remote Sensing Understanding
ICML, 2026
Degradation-aware Metric Prompting for Hyperspectral Image R
ICML
Degradation-aware Metric Prompting for Hyperspectral Image Restoration
ICML, 2026
SAMe: A Semantic Anatomy Mapping Engine for Robotic Ultrasou
arXiv
SAMe: A Semantic Anatomy Mapping Engine for Robotic Ultrasound
arXiv, 2026
Omni-I2C: A Holistic Benchmark for High-fidelity Image-to-co
ACL
Omni-I2C: A Holistic Benchmark for High-fidelity Image-to-code Generation
ACL, 2026
Event-based Simultaneous Localization and Mapping: A Compreh
IJCV
Event-based Simultaneous Localization and Mapping: A Comprehensive Survey
IJCV, 2026
Universal Pansharpening Foundation Model
arXiv
Universal Pansharpening Foundation Model
arXiv, 2026
Seeing Clearly without Training: Mitigating Hallucinations i
arXiv
Seeing Clearly without Training: Mitigating Hallucinations in Multimodal LLMs for Remote Sensing
arXiv, 2026
Heuristic-inspired Reasoning Priors Facilitate Data-efficien
CVPR
Heuristic-inspired Reasoning Priors Facilitate Data-efficient Referring Object Detection
CVPR, 2026
GeoBridge: A Semantic-anchored Multi-view Foundation Model B
CVPR
GeoBridge: A Semantic-anchored Multi-view Foundation Model Bridging Images and Text for Geo-localization
CVPR, 2026
UniGeoSeg: Towards Unified Open-world Segmentation for Geosp
CVPR
UniGeoSeg: Towards Unified Open-world Segmentation for Geospatial Scenes
CVPR, 2026
SARMAE: Masked Autoencoder for SAR Representation Learning
CVPR
SARMAE: Masked Autoencoder for SAR Representation Learning
CVPR, 2026
Perceptual-evidence Anchored Reinforced Learning for Multimo
CVPR
Perceptual-evidence Anchored Reinforced Learning for Multimodal Reasoning
CVPR, 2026
Residual Diffusion Bridge Model for Image Restoration
CVPRHighlight
Residual Diffusion Bridge Model for Image Restoration
CVPR, 2026
DeepSketcher: Internalizing Visual Manipulation for Multimod
CVPR
DeepSketcher: Internalizing Visual Manipulation for Multimodal Reasoning
CVPR Findings, 2026
GeoEyes: On-demand Visual Focusing for Evidence-grounded Und
arXiv
GeoEyes: On-demand Visual Focusing for Evidence-grounded Understanding of Ultra-high-resolution Remote Sensing Imagery
arXiv, 2026
VLRS-Bench: A Vision-language Reasoning Benchmark for Remote
arXiv
VLRS-Bench: A Vision-language Reasoning Benchmark for Remote Sensing
arXiv, 2026
JOintGS: Joint Optimization of Cameras, Bodies and 3D Gaussi
arXiv
JOintGS: Joint Optimization of Cameras, Bodies and 3D Gaussians for In-the-wild Monocular Reconstruction
arXiv, 2026
AnesSuite: A Comprehensive Benchmark and Dataset Suite for A
ICLR
AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs
ICLR, 2026

2025

CrossEarth: Geospatial Vision Foundation Model for Domain Ge
TPAMI
CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation
IEEE TPAMI, 2025
GeoZero: Incentivizing Reasoning from Scratch on Geospatial
arXiv
GeoZero: Incentivizing Reasoning from Scratch on Geospatial Scenes
arXiv, 2025
S5: Scalable Semi-supervised Semantic Segmentation in Remote
AAAIOral
S5: Scalable Semi-supervised Semantic Segmentation in Remote Sensing
AAAI, 2025
RoMA: Scaling up Mamba-based Foundation Models for Remote Se
NeurIPS
RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing
NeurIPS, 2025
GeoLLaVA-8K: Scaling Remote-sensing Multimodal Large Languag
NeurIPSSpotlight
GeoLLaVA-8K: Scaling Remote-sensing Multimodal Large Language Models to 8K Resolution
NeurIPS, 2025
DGSolver: Diffusion Generalist Solver with Universal Posteri
NeurIPS
DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration
NeurIPS, 2025
REX-RAG: Reasoning Exploration with Policy Correction in Ret
arXiv
REX-RAG: Reasoning Exploration with Policy Correction in Retrieval-augmented Generation
arXiv, 2025
Synergistic Prompting for Robust Visual Recognition with Mis
ICCV
Synergistic Prompting for Robust Visual Recognition with Missing Modalities
ICCV, 2025
Rethink Sparse Signals for Pose-guided Text-to-image Generat
ICCV
Rethink Sparse Signals for Pose-guided Text-to-image Generation
ICCV, 2025
Harnessing Massive Satellite Imagery with Efficient Masked I
ICCV
Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling
ICCV, 2025
High-quality Pseudo-labeling for Point Cloud Segmentation wi
TPAMI
High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation
IEEE TPAMI, 2025
๐Ÿ“„
arXiv
OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-spheres Interactions with Multimodal Observational Earth Data
arXiv, 2025
LogicOCR: Do Your Large Multimodal Models Excel at Logical R
arXiv
LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-rich Images?
arXiv, 2025
Advances in Radiance Field for Dynamic Scene: From Neural Fi
arXiv
Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field
arXiv, 2025
Dynamic Parallel Tree Search for Efficient LLM Reasoning
ACL
Dynamic Parallel Tree Search for Efficient LLM Reasoning
ACL, 2025
MapNav: A Novel Memory Representation via Annotated Semantic
ACL
MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-language Navigation
ACL, 2025
TiMo: Spatiotemporal Foundation Model for Satellite Image Ti
arXiv
TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series
arXiv, 2025
SafeMap: Robust HD Map Construction from Incomplete Observat
ICML
SafeMap: Robust HD Map Construction from Incomplete Observations
ICML, 2025
Human-imperceptible, Machine-recognizable Images
IJCAI
Human-imperceptible, Machine-recognizable Images
IJCAI, 2025
๐Ÿ“„
IJCAI
DDPA-3DVG: Vision-language Dual-decoupling and Progressive Alignment for 3D Visual Grounding
IJCAI, 2025
๐Ÿ“„
IJCAI
BEVTrack: A Simple and Strong Baseline for 3D Single Object Tracking in Bird's-Eye View
IJCAI, 2025
๐Ÿ“„
IJCAI
Open-vocabulary Fine-grained Hand Action Detection
IJCAI, 2025
Unified Domain Adaptive Semantic Segmentation
TPAMI
Unified Domain Adaptive Semantic Segmentation
IEEE TPAMI, 2025
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundat
TPAMI
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
IEEE TPAMI, 2025
InstructVEdit: A Holistic Approach for Instructional Video E
arXiv
InstructVEdit: A Holistic Approach for Instructional Video Editing
arXiv, 2025
XLRS-Bench: Could Your Multimodal LLMs Understand Extremely
CVPRHighlight
XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-high-resolution Remote Sensing Imagery?
CVPR, 2025
๐Ÿ“„
CVPR
SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-image Pretraining
CVPR, 2025

2024

General Class-balanced Multicentric Dynamic Prototype Pseudo
IJCV
General Class-balanced Multicentric Dynamic Prototype Pseudo-labeling for Source-free Domain Adaptation
IJCV, 2024
๐Ÿ“„
AAAI
UAWTrack: Universal 3D Single Object Tracking in Adverse Weather
AAAI, 2024
๐Ÿ“„
AAAI
Semi-supervised Infrared Small Target Detection with Thermodynamic-inspired Uneven Perturbation and Confidence Adaptation
AAAI, 2024
๐Ÿ“„
AAAI
MOCID: Motion Context and Displacement Information Learning for Moving Infrared Small Target Detection
AAAI, 2024
Hi-SAM: Marrying Segment Anything Model for Hierarchical Tex
TPAMI
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
IEEE TPAMI, 2024
Is Your HD Map Constructor Reliable under Sensor Corruptions
NeurIPS
Is Your HD Map Constructor Reliable under Sensor Corruptions?
NeurIPS, 2024
GoMatching: A Simple Baseline for Video Text Spotting via Lo
NeurIPS
GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching
NeurIPS, 2024
๐Ÿ“„
IJCV
Learning General and Specific Embedding with Transformer for Few-shot Object Detection
IJCV, 2024
HandRefiner: Refining Malformed Hands in Generated Images by
ACMMM
HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting
ACM MM, 2024
Multi-granularity Hand Action Detection
ACMMM
Multi-granularity Hand Action Detection
ACM MM, 2024
๐Ÿ“„
ACMMM
SAR-SLAM: Self-attentive Rendering-based SLAM with Neural Point Cloud Encoding
ACM MM, 2024
Unleashing the Power of Generic Segmentation Model: A Simple
ACMMM
Unleashing the Power of Generic Segmentation Model: A Simple Baseline for Infrared Small Target Detection
ACM MM, 2024
IRSAM: Advancing Segment Anything Model for Infrared Small T
ECCV
IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection
ECCV, 2024
MapDistill: Boosting Efficient Camera-based HD Map Construct
ECCV
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation
ECCV, 2024
ESceme: Vision-and-language Navigation with Episodic Scene M
IJCV
ESceme: Vision-and-language Navigation with Episodic Scene Memory
IJCV, 2024
PoseBench: Benchmarking the Robustness of Pose Estimation Mo
arXiv
PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions
arXiv, 2024
A Survey on Self-supervised Learning: Algorithms, Applicatio
TPAMI
A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends
IEEE TPAMI, 2024
๐Ÿ“„
TIP
Expanding and Refining Hybrid Compressors for Efficient Object Re-identification
IEEE TIP, 2024
LeMeViT: Efficient Vision Transformer with Learnable Meta To
IJCAI
LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation
IJCAI, 2024
UniMix: Towards Domain Adaptive and Generalizable LiDAR Sema
CVPR
UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
CVPR, 2024
A Semi-supervised Nighttime Dehazing Baseline with Spatial-f
CVPR
A Semi-supervised Nighttime Dehazing Baseline with Spatial-frequency Aware and Realistic Brightness Constraint
CVPR, 2024
๐Ÿ“„
Workshop
From Pixels to Preservation: The Power of Large Vision Models in Heritage Content Understanding
SUMAC @ ACM Multimedia, 2024

2023

Pruning Self-attentions into Convolutional Layers in Single
TPAMI
Pruning Self-attentions into Convolutional Layers in Single Path
IEEE TPAMI, 2023
APTv2: Benchmarking Animal Pose Estimation and Tracking with
arXiv
APTv2: Benchmarking Animal Pose Estimation and Tracking with a Large-scale Dataset and Beyond
arXiv, 2023
SurgicalPart-SAM: Part-to-whole Collaborative Prompting for
arXiv
SurgicalPart-SAM: Part-to-whole Collaborative Prompting for Surgical Instrument Segmentation
arXiv, 2023
Vision Transformer with Quadrangle Attention
TPAMI
Vision Transformer with Quadrangle Attention
IEEE TPAMI, 2023
Decomposing Semantic Shifts for Composed Image Retrieval
AAAI
Decomposing Semantic Shifts for Composed Image Retrieval
AAAI, 2023
๐Ÿ“„
AAAI
IRPruneDet: Efficient Infrared Small Target Detection via Wavelet Structure-regularized Soft Channel Pruning
AAAI, 2023
SurgicalSAM: Efficient Class Promptable Surgical Instrument
AAAI
SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation
AAAI, 2023
SimDistill: Simulated Multi-modal Distillation for BEV 3D Ob
AAAI
SimDistill: Simulated Multi-modal Distillation for BEV 3D Object Detection
AAAI, 2023
Grounded Affordance from Exocentric View
IJCV
Grounded Affordance from Exocentric View
IJCV, 2023
ViTPose++: Vision Transformer for Generic Body Pose Estimati
TPAMI
ViTPose++: Vision Transformer for Generic Body Pose Estimation
IEEE TPAMI, 2023
๐Ÿ“„
TPAMI
On Exploring Multiplicity of Primitives and Attributes for Texture Recognition in the Wild
IEEE TPAMI, 2023
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with S
NeurIPS
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model
NeurIPS, 2023
End-to-end One-shot Human Parsing
TPAMI
End-to-end One-shot Human Parsing
IEEE TPAMI, 2023
GraMMaR: Ground-aware Motion Model for 3D Human Motion Recon
ACMMM
GraMMaR: Ground-aware Motion Model for 3D Human Motion Reconstruction
ACM MM, 2023
AniPixel: Towards Animatable Pixel-aligned Human Avatar
ACMMM
AniPixel: Towards Animatable Pixel-aligned Human Avatar
ACM MM, 2023
Unifying Flow, Stereo and Depth Estimation
TPAMI
Unifying Flow, Stereo and Depth Estimation
IEEE TPAMI, 2023
๐Ÿ“„
ICCV
Domain Specified Optimization for Deployment Authorization
ICCV, 2023
ESSAformer: Efficient Transformer for Hyperspectral Image Su
ICCV
ESSAformer: Efficient Transformer for Hyperspectral Image Super-resolution
ICCV, 2023
Sensitivity-aware Visual Parameter-efficient Fine-tuning
ICCV
Sensitivity-aware Visual Parameter-efficient Fine-tuning
ICCV, 2023
๐Ÿ“„
IJCV
Deep Corner
IJCV, 2023
Transformer-based Context Condensation for Boosting Feature
IJCV
Transformer-based Context Condensation for Boosting Feature Pyramids in Object Detection
IJCV, 2023
Learning to Purification for Unsupervised Person Re-identifi
TIP
Learning to Purification for Unsupervised Person Re-identification
IEEE TIP, 2023
Scalable Mask Annotation for Video Text Spotting
arXiv
Scalable Mask Annotation for Video Text Spotting
arXiv, 2023
๐Ÿ“„
IJCV
VNAS: Variational Neural Architecture Search
IJCV, 2023
OSP2B: One-stage Point-to-box Network for 3D Siamese Trackin
IJCAI
OSP2B: One-stage Point-to-box Network for 3D Siamese Tracking
IJCAI, 2023
DCN-T: Dual Context Network with Transformer for Hyperspectr
TIP
DCN-T: Dual Context Network with Transformer for Hyperspectral Image Classification
IEEE TIP, 2023
Deep Image Matting: A Comprehensive Survey
arXiv
Deep Image Matting: A Comprehensive Survey
arXiv, 2023
Rethinking Portrait Matting with Privacy Preserving
IJCV
Rethinking Portrait Matting with Privacy Preserving
IJCV, 2023
Deep Learning for Camera Calibration and Beyond: A Survey
arXiv
Deep Learning for Camera Calibration and Beyond: A Survey
arXiv, 2023
๐Ÿ“„
CVPR
Leverage Interactive Affinity for Affordance Learning
CVPR, 2023
DeepSolo: Let Transformer Decoder with Explicit Points Solo
CVPR
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
CVPR, 2023
CLAMP: Prompt-based Contrastive Learning for Connecting Lang
CVPR
CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal Pose
CVPR, 2023
Referring Image Matting
CVPR
Referring Image Matting
CVPR, 2023
Dynamic Focus-aware Positional Queries for Semantic Segmenta
CVPR
Dynamic Focus-aware Positional Queries for Semantic Segmentation
CVPR, 2023

2022

๐Ÿ“„
TPAMI
IC9600: A Benchmark Dataset for Automatic Image Complexity Assessment
IEEE TPAMI, 2022
GLT-T: Global-local Transformer Voting for 3D Single Object
AAAI
GLT-T: Global-local Transformer Voting for 3D Single Object Tracking in Point Clouds
AAAI, 2022
Learning to Learn Better for Video Object Segmentation
AAAI
Learning to Learn Better for Video Object Segmentation
AAAI, 2022
DPText-DETR: Towards Better Scene Text Detection with Dynami
AAAI
DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
AAAI, 2022
APT-36K: A Large-scale Benchmark for Animal Pose Estimation
NeurIPSSpotlight
APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking
NeurIPS, 2022
๐Ÿ“„
NeurIPS
Exploring Figure-ground Assignment Mechanism in Perceptual Organization
NeurIPS, 2022
Watermarking for Out-of-distribution Detection
NeurIPSSpotlight
Watermarking for Out-of-distribution Detection
NeurIPS, 2022
ViTPose: Simple Vision Transformer Baselines for Human Pose
NeurIPS
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
NeurIPS, 2022
Information-theoretic Odometry Learning
IJCV
Information-theoretic Odometry Learning
IJCV, 2022
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
ECCV
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
ECCV, 2022
JPerceiver: Joint Perception Network for Depth, Pose and Lay
ECCV
JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes
ECCV, 2022
FakeCLR: Exploring Contrastive Learning for Solving Latent D
ECCV
FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-efficient GANs
ECCV, 2022
ReAct: Temporal Action Detection with Relational Queries
ECCV
ReAct: Temporal Action Detection with Relational Queries
ECCV, 2022
Towards Scale-aware, Robust, and Generalizable Unsupervised
ECCV
Towards Scale-aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics
ECCV, 2022
VSA: Learning Varied-size Window Attention in Vision Transfo
ECCV
VSA: Learning Varied-size Window Attention in Vision Transformers
ECCV, 2022
BMD: A General Class-balanced Multicentric Dynamic Prototype
ECCV
BMD: A General Class-balanced Multicentric Dynamic Prototype Strategy for Source-free Domain Adaptation
ECCV, 2022
Towards Data-efficient Detection Transformers
ECCV
Towards Data-efficient Detection Transformers
ECCV, 2022
PolyphonicFormer: Unified Query Learning for Depth-aware Vid
ECCV
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
ECCV, 2022
RegionCL: Exploring Contrastive Region Pairs for Self-superv
ECCV
RegionCL: Exploring Contrastive Region Pairs for Self-supervised Representation Learning
ECCV, 2022
๐Ÿ“„
ACMMM
GT-MUST: Gated Try-on by Learning the Mannequin-specific Transformation
ACM MM, 2022
๐Ÿ“„
ACMMM
Exploring Feature Compensation and Cross-level Correlation for Infrared Small Target Detection
ACM MM, 2022
๐Ÿ“„
ACMMM
RKformer: Runge-kutta Transformer with Random-connection Attention for Infrared Small Target Detection
ACM MM, 2022
One-shot Object Affordance Detection in the Wild
IJCV
One-shot Object Affordance Detection in the Wild
IJCV, 2022
Toward real-world single image deraining: A new benchmark an
arXiv
Toward real-world single image deraining: A new benchmark and beyond
arXiv, 2022
DUT: Learning video stabilization by simply watching unstabl
TIP
DUT: Learning video stabilization by simply watching unstable videos
IEEE TIP, 2022
๐Ÿ“„
IJCAI
SAR-to-Optical Image Translation via Neural Partial Differential Equations
IJCAI, 2022
A Comprehensive Survey on Data-efficient GANs in Image Gener
arXiv
A Comprehensive Survey on Data-efficient GANs in Image Generation
arXiv, 2022
I3CL: Intra-and Inter-instance Collaborative Learning for Ar
IJCV
I3CL: Intra-and Inter-instance Collaborative Learning for Arbitrary-shaped Scene Text Detection
IJCV, 2022
DearKD: Data-efficient Early Knowledge Distillation for Visi
CVPR
DearKD: Data-efficient Early Knowledge Distillation for Vision Transformers
CVPR, 2022
๐Ÿ“„
CVPR
ISNet: Shape Matters for Infrared Small Target Detection
CVPR, 2022
RU-Net: Regularized Unrolling Network for Scene Graph Genera
CVPR
RU-Net: Regularized Unrolling Network for Scene Graph Generation
CVPR, 2022
Learning Affordance Grounding from Exocentric Images
CVPR
Learning Affordance Grounding from Exocentric Images
CVPR, 2022
FIBA: Frequency-injection based Backdoor Attack in Medical I
CVPR
FIBA: Frequency-injection based Backdoor Attack in Medical Image Analysis
CVPR, 2022
Recurrent Glimpse-based Decoder for Detection with Transform
CVPROral
Recurrent Glimpse-based Decoder for Detection with Transformer
CVPR, 2022
GMFlow: Learning Optical Flow via Global Matching
CVPROral
GMFlow: Learning Optical Flow via Global Matching
CVPR, 2022
ViTAEv2: Vision Transformer Advanced by Exploring Inductive
IJCV
ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond
IJCV, 2022
๐Ÿ“„
ICLR
FP-DETR: Detection Transformer Advanced by Fully Pre-training
ICLR, 2022

2021

๐Ÿ“„
TIP
Robust Object Detection via Adversarial Novel Style Exploration
IEEE TIP, 2021
SASA: Semantics-augmented Set Abstraction for Point-based 3D
AAAI
SASA: Semantics-augmented Set Abstraction for Point-based 3D Object Detection
AAAI, 2021
Visual Semantics Allow for Textual Reasoning Better in Scene
AAAI
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
AAAI, 2021
Siamese Network with Interactive Transformer for Video Objec
AAAI
Siamese Network with Interactive Transformer for Video Object Segmentation
AAAI, 2021
Wide-angle Image Rectification: A Survey
IJCV
Wide-angle Image Rectification: A Survey
IJCV, 2021
๐Ÿ“„
IJCV
CODON: On orchestrating cross-domain attentions for depth super-resolution
IJCV, 2021
Bridging Composite and Real: Towards End-to-end Deep Image M
IJCV
Bridging Composite and Real: Towards End-to-end Deep Image Matting
IJCV, 2021
AP-10K: A Benchmark for Animal Pose Estimation in the Wild
NeurIPS
AP-10K: A Benchmark for Animal Pose Estimation in the Wild
NeurIPS, 2021
ViTAE: Vision Transformer Advanced by Exploring Intrinsic In
NeurIPS
ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias
NeurIPS, 2021
Towards High Performance Human Keypoint Detection
IJCV
Towards High Performance Human Keypoint Detection
IJCV, 2021
Deep Automatic Natural Image Matting
IJCAI
Deep Automatic Natural Image Matting
IJCAI, 2021
One-shot Affordance Detection
IJCAI
One-shot Affordance Detection
IJCAI, 2021
A Comprehensive Survey on Image Dehazing Based on Deep Learn
IJCAI
A Comprehensive Survey on Image Dehazing Based on Deep Learning
IJCAI, 2021
Out-of-boundary View Synthesis Towards Full-frame Video Stab
ICCV
Out-of-boundary View Synthesis Towards Full-frame Video Stabilization
ICCV, 2021
Exploring Sequence Feature Alignment for Domain Adaptive Det
ACMMM
Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers
ACM MM, 2021
DSP: Dual Soft-paste for Unsupervised Domain Adaptive Semant
ACMMM
DSP: Dual Soft-paste for Unsupervised Domain Adaptive Semantic Segmentation
ACM MM, 2021
Privacy-preserving Portrait Matting
ACMMM
Privacy-preserving Portrait Matting
ACM MM, 2021

2020

Progressive One-shot Human Parsing
AAAI
Progressive One-shot Human Parsing
AAAI, 2020
SIR: Self-supervised image rectification via seeing the same
TIP
SIR: Self-supervised image rectification via seeing the same scene from multiple different lenses
IEEE TIP, 2020
๐Ÿ“„
NeurIPS
Auto Learning Attention
NeurIPS, 2020
Nighttime Dehazing with a Synthetic Benchmark
ACMMM
Nighttime Dehazing with a Synthetic Benchmark
ACM MM, 2020
๐Ÿ“„
IJCV
Recursive Context Routing for Object Detection
IJCV, 2020
๐Ÿ“„
CVPROral
Deep Degradation Prior for Low-quality Image Classification
CVPR, 2020

2019

๐Ÿ“„
NeurIPS
Learn, Imagine and Create: Text-to-image Generation from Prior Knowledge
NeurIPS, 2019
Category Anchor-guided Unsupervised Domain Adaptation for Se
NeurIPS
Category Anchor-guided Unsupervised Domain Adaptation for Semantic Segmentation
NeurIPS, 2019
Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Hu
AAAIOral
Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing
AAAI, 2019
Progressive Retinex: Mutually Reinforced Illumination-noise
ACMMM
Progressive Retinex: Mutually Reinforced Illumination-noise Perception Network for Low-light Image Enhancement
ACM MM, 2019
๐Ÿ“„
ICCV
Deep Multiple-attribute-perceived Network for Real-world Texture Recognition
ICCV, 2019
Multi-level Deep Cascade Trees for Conversion Rate Predictio
AAAI
Multi-level Deep Cascade Trees for Conversion Rate Prediction in Recommendation System
AAAI, 2019
MirrorGAN: Learning Text-to-image Generation by Redescriptio
CVPR
MirrorGAN: Learning Text-to-image Generation by Redescription
CVPR, 2019
FAMED-Net: A Fast and Accurate Multi-scale End-to-end Dehazi
TIP
FAMED-Net: A Fast and Accurate Multi-scale End-to-end Dehazing Network
IEEE TIP, 2019

2018

Fully Point-wise Convolutional Neural Network for Modeling S
ACMMM
Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images
ACM MM, 2018

2017

๐Ÿ“„
CVPR
Fast Haze Removal for Nighttime Image Using Maximum Reflectance Prior
CVPR, 2017