Publications

(*=equal contribution)

Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking

Yun Liu*, Bowen Yang*, Licheng Zhong*, He Wang, Li Yi

arxiv preprint
CVPR Workshop on Humanoid Agents, 2025, Spotlight
Project Page / arXiv

A benchmark of Humanoid Scene Interaction. The first comprehensive benchmark for generalizable humanoid-scene interaction learning via human mimicking. Integrated a large-scale diverse human skill reference dataset with both synthetic and real-world human-scene interactions. Developed a general skill-learning paradigm and provide support for both pipeline-wise and modular evaluations.

Reconstruction and Simulation of Elastic Objects with Spring-Mass 3D Gaussians

Licheng Zhong, Hong-Xing "Koven" Yu, Jiajun Wu, Yunzhu Li

European Conference on Computer Vision (ECCV), 2024
Project Page / arXiv / Data / Code

Spring-Gaus reconstructs the appearance, geometry, and physical dynamics properties of elastic objects from video observations. Spring-Gaus enables future prediction and simulation under different initial states and environmental parameters.

Color-NeuS: Reconstructing Neural Implicit Surfaces with Color

Licheng Zhong*, Lixin Yang*, Kailin Li, Haoyu Zhen, Mei Han, Cewu Lu

International Conference on 3D Vision (3DV), 2024
Project Page / Paper / arXiv / Data / Code

Color-NeuS focuses on mesh reconstruction with color. We remove view-dependent color while using a relighting network to maintain volume rendering performance. Mesh is extracted from the SDF network, and vertex color is derived from the global color network. We conceived a in hand object scanning task and gathered several videos for it to evaluate Color-NeuS.

Multi-view Hand Reconstruction with a Point-Embedded Transformer

Lixin Yang, Licheng Zhong, Pengxiang Zhu, Xinyu Zhan, Junxiao Kong, Jian Xu, Cewu Lu

arxiv preprint
arXiv / Code

POEMv2 is a generalizable multi-view hand mesh reconstruction model which embeds a static basis point within the multi-view stereo space.

CHORD: Category-level in-Hand Object Reconstruction via Shape Deformation

Kailin Li*, Lixin Yang*, Haoyu Zhen, Zenan Lin, Xinyu Zhan, Licheng Zhong, Jian Xu, Kejian Wu, Cewu Lu

International Conference on Computer Vision (ICCV), 2023
Project Page / Paper / arXiv / PyBlend

We proposed a new method CHORD which exploits the categorical shape prior for reconstructing the shape of intra-class objects. In addition, we constructed a new dataset, COMIC, of category-level hand-object interaction. COMIC encompasses a diverse collection of object instances, materials, hand interactions, and viewing directions, as illustrated.

POEM: Reconstructing Hand in a Point Embedded Multi-view Stereo

Lixin Yang, Jian Xu, Licheng Zhong, Xinyu Zhan, Zhicheng Wang, Kejian Wu, Cewu Lu

Computer Vision and Pattern Recognition (CVPR), 2023
Paper / arXiv / Code

POEM (Point Embedded Multi-view) focuses on reconstructing an articulation body with "true scale" and "accurate pose" from a series of sparsely arranged camera views. In practice, we used the example of hand. POEM explores the power of points, using a cluster of (x, y, z) coordinates with natural positional encoding to find associations in multi-view stereo.

Centralized and Decentralized Methods for Multi-robot Safe Navigation

Xinbei Wang*, Zexuan Yan*, Licheng Zhong*

Machine Learning and Intelligent Systems Engineering (MLISE), 2022
Paper

We simulated two centralized methods and two decentralized methods for multi-robot safe navigation in their respective environments.

Licheng Zhong 钟李骋