Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2
Published in arXiv, 2022
This paper proposes a novel Adaptive Implicit Representation Mapping (AIRM) approach for ultra-high-resolution image segmentation, addressing limitations in current CNN-based IRM methods. Our method includes an Affinity Empowered Encoder (AEE) with transformer architecture to capture long-distance semantic information and an Adaptive Implicit Representation Mapping Function (AIRMF) that dynamically translates pixel-wise features while preserving global context.
Published in ACM Multimedia, 2022
This paper introduces a cross-modal few-shot approach for 3D point cloud segmentation that uses labeled 2D images instead of 3D annotations. By converting 2D images to 3D format and employing a co-embedding network, the method achieves effective segmentation through prototype-based cosine similarity, performing competitively on benchmarks with minimal labeled 2D support.
Published in AAAI, 2023
This paper presents a multi-layer transformer network for few-shot 3D point cloud semantic segmentation, addressing limitations in computational complexity and fine-grained relationship learning in existing methods. By aggregating query point cloud features with class-specific support features at multiple scales and avoiding pooling, our approach fully utilizes pixel-level support features.
Published in ACM Multimedia, 2024
This paper introduces a cross-modal few-shot approach for 3D point cloud segmentation, using multi-view synthesis with color and depth inpainting to address occlusions and reduce reliance on 3D annotations. A Co-embedding Network aligns features between synthesized views and original 3D data, while a weighted prototype network enhances segmentation performance.
Published in CVPR, 2025
This paper presents DPSeg, a dual-prompt framework for open-vocabulary semantic segmentation that integrates both visual and textual prompts to generate spatial-semantic cost volumes. A multi-scale cost volume-guided decoder and a semantic-guided prompt refinement strategy are introduced to enhance spatial detail and alignment. The method significantly improves segmentation accuracy across diverse benchmarks by effectively mitigating the domain gap between image and text embeddings.
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.