Ruixiang Xue薛瑞翔

Research Area

Spatial Intelligence

Spatial
Intelligence
Intelligent point cloud compression3D Gaussian splatting compressionImplicit neural representationsStreet generative novel-view synthesisAI-assisted photography

Projects

Intelligent Point Cloud Compression

Research and development of deep learning based point cloud compression methods for MPEG AI-PCC.

  • Designed Transformer models for point cloud representation and efficient spatial feature extraction.
  • Used density changes during point cloud downsampling to guide reconstruction mode selection.
  • Achieved 39% geometry compression gain and 34% attribute compression gain over the baseline under public test conditions.
Standardization proposals
  1. MPEG m70061 · Nov. 2024 AI-PCC CfP Response Proposal from Nanjing University and OPPO (Track1) T. Chen, R. Xue, J. Zhang, J. Wang, Z. Ma, S. Xia, R. Xue, Z. Sun, C. Ma, Y. Yu, H. Yu, D. Wang
  2. MPEG m70062 · Nov. 2024 AI-PCC CfP Response Proposal from Nanjing University and OPPO (Track2) T. Chen, R. Xue, J. Zhang, J. Wang, Z. Ma, S. Xia, R. Xue, Z. Sun, C. Ma, Y. Yu, H. Yu, D. Wang
  3. MPEG m70395 · Nov. 2024 [AI-GC][CfP-related] Improved Cross-Platform Reproducibility for AI-Based Point Cloud Compression J. Zhang, T. Chen, R. Xue, Z. Ma, S. Xia, Z. Sun, C. Ma, Y. Yu, H. Yu, D. Wang
  4. MPEG m70396 · Nov. 2024 [AI-GC][CfP-related] On Model Quantization for AI-Based Point Cloud Compression J. Zhang, T. Chen, R. Xue, Z. Ma, S. Xia, Z. Sun, C. Ma, Y. Yu, H. Yu, D. Wang
  5. MPEG m64417 · Jul. 2023 [AI-3DGC EE 5.4] Update On the Training Datasets for Attribute Compression J. Wang, R. Xue, J. Li, Z. Ma, H. Wei, Y. Yu, V. Zakharchenko, D. Wang
  6. MPEG m64418 · Jul. 2023 [AI-3DGC EE 5.1] Performance Comparison of Dynamic Point Cloud Geometry Compression for LiDAR Point Clouds R. Xue, J. Wang, J. Li, Z. Ma, H. Wei, Y. Yu, V. Zakharchenko, D. Wang
  7. MPEG m62176 · Jan. 2023 [AI-3DGC] On the Training Datasets for Attribute Compression J. Wang, R. Xue, J. Li, Z. Ma, H. Wei, Y. Yu, V. Zakharchenko, D. Wang
  8. MPEG m62177 · Jan. 2023 [AI-3DGC][EE5.1-related][EE5.3-related] Dynamic Point Cloud Geometry Compression for LiDAR Point Cloud with Ego-Motion Compensation R. Xue, J. Wang, J. Li, Z. Ma, H. Wei, Y. Yu, V. Zakharchenko, D. Wang
  9. MPEG m60353 · Jul. 2022 SparsePCGCv2: Multihead Neighborhood Point Attention for Sparse Point Cloud Ruixiang Xue, Jianqiang Wang, Zhan Ma, Honglian Wei, Yue Yu, Vladyslav Zakharchenko, Dong Wang
  10. MPEG m59552 · Apr. 2022 SparsePCGCv2: Improved SparsePCGC with attention mechanism Jianqiang Wang, Ruixiang Xue, Zhan Ma, Honglian Wei, Yue Yu, Vladyslav Zakharchenko, Dong Wang
Patent applications
  1. 2024 编解码方法、编码器、解码器以及存储介质 马展,薛瑞翔,魏红莲
  2. 2023 基于隐式神经网络和深度图投影的激光雷达点云序列表征方法 薛瑞翔,李嘉欣,马展
  3. 2023 基于神经网络的动态激光雷达点云多尺度几何无损压缩方法 马展,薛瑞翔
  4. 2022 基于神经网络的点云几何压缩后处理方法 马展,薛瑞翔
  5. 2022 基于注意力机制和稀疏卷积的点云几何无损压缩方法 薛瑞翔,王剑强,马展 also filed as 点云几何信息的压缩、解压缩及点云视频编解码方法、装置
3D Gaussian Splatting Coding

Compact and progressive compression for 3D Gaussian splatting scenes.

  • Investigated compactification, compression, and reconstruction methods for 3D Gaussian splatting.
  • Proposed RecastGS to reorganize pretrained 3DGS into a region-aware layered hierarchy with prompt-driven object extraction and progressive distillation.
  • Proposed LayeredCGS, a feed-forward layered 3DGS compressor with cross-layer context modeling and truncatable bitstreams.
AI-assisted photography

A virtual photography system that turns single photos into controllable 3D shooting spaces, then repairs raw 3DGS renders with generative neural rendering.

Virtual studio: Marble 3DGS, SplatTransform, UE import, immersive capture.
Human-scene compositing: scene splats plus a separate human 3DGS asset pipeline.
Spatial reframing: Apple-like nearby-view recomposition from a single photo.
Virtual Studio Pipeline From one reference image to an Unreal-based virtual shooting stage and final neural-rendered photo.
Reference image single photo Marble 3DGS world asset SplatTransform preprocess Unreal Engine virtual stage Camera capture navigation Human 3DGS composite Neural render photo polish
Spatial Reframing Pipeline Apple-like nearby-view recomposition under constrained viewpoint changes.
Input photo user image Single-image 3DGS geometry prior Subject cues mask + depth Reframe camera small motion Confidence map holes + alpha Generative repair low-confidence only Reframed photo final view

Education

Nanjing University

Ph.D. Student, Information and Communication Engineering

2021.09 - 2027.06

  • NJU Vision Lab, advised by Prof. Zhan Ma and Researcher Tong Chen
  • Research fields: intelligent point cloud compression, 3D Gaussian splatting compression, implicit neural representations, and AI-assisted photography
  • Excellent result in doctoral mid-term assessment

Hangzhou Dianzi University

B.Eng. in Electronic Information Engineering

2017.09 - 2021.06

Work Experience

Geely

Artificial Intelligence Center · Algorithm Intern

2026.05 - Present

Research on street-scene novel-view extrapolation with mutual enhancement between feed-forward reconstruction and generative models.

  • Used feed-forward street-scene reconstruction to improve the geometric consistency of generative novel-view synthesis models.
  • Leveraged generative priors to improve the rendering-quality loss for feed-forward street-scene reconstruction under large-baseline novel-view extrapolation.

OPPO

Research Institute · Algorithm Intern

2024.02 - 2024.11

Worked on intelligent point cloud compression research and MPEG AI-PCC standardization.

  • Developed and evaluated intelligent point cloud compression algorithms around MPEG AI-PCC standardization.
  • Implemented point cloud codec software and evaluated performance across standard test sequences.
  • Participated in multiple MPEG meetings, submitted 10 standardization proposals, and filed 5 invention patents.

Publications

  1. 3D Gaussian Splatting Compression with Object Scalability
    ECCV 2026 · European Conference on Computer Vision
    Ruixiang Xue, Tong Chen, Zhan Ma
    CCF-B3D Gaussian splatting compression
    PaperCode
    Abstract

    We introduce a framework towards scalable, finer-grained object-level 3DGS compression. First, a post-training method named RecastGS is proposed to reorganize pretrained 3DGS into a layered representation and progressively distills cumulative submodels to improve rate-distortion efficiency. Leveraging multi-view SAM predictions from user click prompts, Gaussians are further partitioned into user-defined regions of interest (ROI), enabling region-adaptive quality control without retraining. Second, built upon this reorganized region-aware layered hierarchy, a feed-forward 3DGS compression method named LayeredCGS is proposed to compress position using a lightweight point cloud codec and attributes with a layer-wise context model to exploit cross-layer correlations. Extensive experiments show that LayeredCGS achieves 35% BD-Rate gain over the existing feed-forward method FCGS. With progressive distillation in RecastGS enabled, our method further outperforms most per-scene optimization methods. Moreover, the proposed method supports ROI-aware compression and flexible bitstream truncation, achieving up to 2 dB higher ROI PSNR at comparable bitrates compared with the uniform quality allocation baseline while enabling low-latency preview and progressive quality refinement. The code will be released at https://github.com/RuixiangXue/ScalableGSC.

    BibTeX
    @inproceedings{xue2026objectscalable3dgs,
      title={3D Gaussian Splatting Compression with Object Scalability},
      author={Xue, Ruixiang and Chen, Tong and Ma, Zhan},
      booktitle={European Conference on Computer Vision (ECCV)},
      year={2026}
    }
  2. A Versatile Point Cloud Compressor Using Universal Multiscale Conditional Coding – Part I: Geometry
    TPAMI · IEEE Transactions on Pattern Analysis and Machine Intelligence
    Vol. 47, No. 1, pp. 269-287, Jan. 2025 · DOI: 10.1109/TPAMI.2024.3462938
    Jianqiang Wang, Ruixiang Xue, Jiaxin Li, Dandan Ding, Yi Lin, Zhan Ma
    SCI Q1CCF-AIF 18.6Co-first authorIntelligent point cloud compression
    Abstract

    A universal multiscale conditional coding framework, Unicorn, is proposed to compress the geometry and attribute of any given point cloud. Geometry compression is addressed in Part I of this paper, while attribute compression is discussed in Part II. We construct the multiscale sparse tensors of each voxelized point cloud frame and properly leverage lower-scale priors in the current and (previously processed) temporal reference frames to improve the conditional probability approximation or content-aware predictive reconstruction of geometry occupancy in compression. Unicorn is a versatile, learning-based solution capable of compressing static and dynamic point clouds with diverse source characteristics in both lossy and lossless modes. Following the same evaluation criteria, Unicorn significantly outperforms standard-compliant approaches like MPEG G-PCC, V-PCC, and other learning-based solutions, yielding state-of-the-art compression efficiency while presenting affordable complexity for practical implementations.

    BibTeX
    @article{wang2025unicorngeometry,
      title={A Versatile Point Cloud Compressor Using Universal Multiscale Conditional Coding -- Part I: Geometry},
      author={Wang, Jianqiang and Xue, Ruixiang and Li, Jiaxin and Ding, Dandan and Yi, Lin and Ma, Zhan},
      journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
      volume={47},
      number={1},
      pages={269--287},
      year={2025},
      doi={10.1109/TPAMI.2024.3462938}
    }
  3. NeRI: Implicit Neural Representation of LiDAR Point Cloud Using Range Image Sequence
    ICASSP 2024 · IEEE International Conference on Acoustics, Speech, and Signal Processing
    ICASSP 2024, pp. 8020-8024 · DOI: 10.1109/ICASSP48485.2024.10446596
    Ruixiang Xue, Jiaxin Li, Tong Chen, Dandan Ding, Xun Cao, Zhan Ma
    CCF-BFirst authorImplicit neural representationsIntelligent point cloud compression
    PaperCode
    Abstract

    This paper proposes the NeRI, an implicit neural representation (INR) based LiDAR point cloud compressor. In NeRI, we first transform a sequence of 3D LiDAR frames into a 2D range image sequence through range image projection over time. Then, we employ a neural network conditioned on the temporal frame index and associated LiDAR sensor pose to fit input range images as closely as possible. The optimized network parameters, which implicitly represent the input LiDAR data, are later lossily compressed. NeRI decoder is then initialized using decoded parameters to generate range images for reconstructing the 3D LiDAR sequence accordingly. Extensive experimental results demonstrate the significant superiority of NeRI regarding the compression efficiency and decoding speed compared to state-of-the-art 2D and 3D compressors for LiDAR point cloud.

    BibTeX
    @inproceedings{xue2024neri,
      title={NeRI: Implicit Neural Representation of LiDAR Point Cloud Using Range Image Sequence},
      author={Xue, Ruixiang and Li, Jiaxin and Chen, Tong and Ding, Dandan and Cao, Xun and Ma, Zhan},
      booktitle={ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
      pages={8020--8024},
      year={2024},
      doi={10.1109/ICASSP48485.2024.10446596}
    }
    Poster
    NeRI: Implicit Neural Representation of LiDAR Point Cloud Using Range Image Sequence poster
  4. A Versatile Point Cloud Compressor Using Universal Multiscale Conditional Coding – Part II: Attribute
    TPAMI · IEEE Transactions on Pattern Analysis and Machine Intelligence
    Vol. 47, No. 1, pp. 252-268, Jan. 2025 · DOI: 10.1109/TPAMI.2024.3462945
    Jianqiang Wang, Ruixiang Xue, Jiaxin Li, Dandan Ding, Yi Lin, Zhan Ma
    SCI Q1CCF-AIF 18.6Second authorIntelligent point cloud compression
    Abstract

    A universal multiscale conditional coding framework, Unicorn, is proposed to code the geometry and attribute of any given point cloud. Attribute compression is discussed in Part II of this paper, while geometry compression is given in Part I of this paper. We first construct the multiscale sparse tensors of each voxelized point cloud attribute frame. Since attribute components exhibit very different intrinsic characteristics from the geometry element, e.g., 8-bit RGB color versus 1-bit occupancy, we process the attribute residual between lower-scale reconstruction and current-scale data. Similarly, we leverage spatially lower-scale priors in the current frame and (previously processed) temporal reference frame to improve the probability estimation of attribute intensity through conditional residual prediction in lossless mode or enhance the attribute reconstruction through progressive residual refinement in lossy mode for better performance. The proposed Unicorn is a versatile, learning-based solution capable of compressing a great variety of static and dynamic point clouds in both lossy and lossless modes. Following the same evaluation criteria, Unicorn significantly outperforms standard-compliant approaches like MPEG G-PCC, V-PCC, and other learning-based solutions, yielding state-of-the-art compression efficiency with affordable encoding/decoding runtime.

    BibTeX
    @article{wang2025unicornattribute,
      title={A Versatile Point Cloud Compressor Using Universal Multiscale Conditional Coding -- Part II: Attribute},
      author={Wang, Jianqiang and Xue, Ruixiang and Li, Jiaxin and Ding, Dandan and Yi, Lin and Ma, Zhan},
      journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
      volume={47},
      number={1},
      pages={252--268},
      year={2025},
      doi={10.1109/TPAMI.2024.3462945}
    }
  5. GRNet: Geometry Restoration for G-PCC Compressed Point Clouds Using Auxiliary Density Signaling
    TVCG · IEEE Transactions on Visualization and Computer Graphics
    Vol. 30, No. 10, pp. 6740-6753, Oct. 2024 · DOI: 10.1109/TVCG.2023.3336936
    Gexin Liu, Ruixiang Xue, Jiaxin Li, Dandan Ding, Zhan Ma
    SCI Q1CCF-AIF 6.5Second authorIntelligent point cloud compression
    Paper
    Abstract

    The lossy Geometry-based Point Cloud Compression (G-PCC) inevitably impairs the geometry information of point clouds, which deteriorates the quality of experience (QoE) in reconstruction and/or misleads decisions in tasks such as classification. To tackle it, this work proposes GRNet for the geometry restoration of G-PCC compressed large-scale point clouds. By analyzing the content characteristics of original and G-PCC compressed point clouds, we attribute the G-PCC distortion to two key factors: point vanishing and point displacement. Visible impairments on a point cloud are usually dominated by an individual factor or superimposed by both factors, which are determined by the density of the original point cloud. To this end, we employ two different models for coordinate reconstruction, termed Coordinate Expansion and Coordinate Refinement, to attack the point vanishing and displacement, respectively. In addition, 4-byte auxiliary density information is signaled in the bitstream to assist the selection of Coordinate Expansion, Coordinate Refinement, or their combination. Before being fed into the coordinate reconstruction module, the G-PCC compressed point cloud is first processed by a Feature Analysis Module for multiscale information fusion, in which kNN-based Transformer is leveraged at each scale to adaptively characterize neighborhood geometric dynamics for effective restoration. Following the common test conditions recommended in the MPEG standardization committee, GRNet significantly improves the G-PCC anchor and remarkably outperforms state-of-the-art methods on a great variety of point clouds (e.g., solid, dense, and sparse samples) both quantitatively and qualitatively. Meanwhile, GRNet runs fairly fast and uses a smaller-size model when compared with existing learning-based approaches, making it attractive to industry practitioners.

    BibTeX
    @article{liu2024grnet,
      title={GRNet: Geometry Restoration for G-PCC Compressed Point Clouds Using Auxiliary Density Signaling},
      author={Liu, Gexin and Xue, Ruixiang and Li, Jiaxin and Ding, Dandan and Ma, Zhan},
      journal={IEEE Transactions on Visualization and Computer Graphics},
      volume={30},
      number={10},
      pages={6740--6753},
      year={2024},
      doi={10.1109/TVCG.2023.3336936}
    }

Awards

*
First Prize Scholarship, Nanjing University
2021 - 2025
*
Special Prize, 12th Zhejiang College Student Entrepreneurship Plan Competition
2020