CVPR 2026

ReLaGS: Relational Language Gaussian Splatting

Yaxu Xie1,2*  ·  Abdalla Arafa1,2*  ·  Alireza Javanmardi1  ·  Christen Millerdurai1  ·  Jia Cheng Hu3  ·  Shaoxiang Wang1,2  ·  Alain Pagani1  ·  Didier Stricker1,2

*Equal contribution

1 German Research Center for Artificial Intelligence (DFKI)   2 RPTU University Kaiserslautern-Landau   3 University of Modena and Reggio Emilia

ReLaGS Teaser

Relational Language Gaussian Splatting. We build a platform with multi-hierarchical language Gaussian field and open-vocabulary 3D scene graph, to support various tasks such as object selection via click, open vocabulary 3D object segmentation across semantic granularity, spatial relationship reasoning between objects and querying object with relation-guidance.

Abstract

Achieving unified 3D perception and reasoning across tasks such as segmentation, retrieval, and relation understanding remains challenging, as existing methods are either object-centric or rely on costly training for inter-object reasoning. We present a novel framework that constructs a hierarchical language-distilled Gaussian scene and its 3D semantic scene graph without scene-specific training. A Gaussian pruning mechanism refines scene geometry, while a robust multi-view language alignment strategy aggregates noisy 2D features into accurate 3D object embeddings. On top of this hierarchy, we build an open-vocabulary 3D scene graph with Vision Language-derived annotations and Graph Neural Network-based relational reasoning. Our approach enables efficient and scalable open-vocabulary 3D reasoning by jointly modeling hierarchical semantics and inter/intra-object relationships, validated across tasks including open-vocabulary segmentation, scene graph generation, and relation-guided retrieval.


Citation

@inproceedings{xiearafa2026relags,
  title     = {ReLaGS: Relational Language Gaussian Splatting},
  author    = {Xie, Yaxu and Arafa, Abdalla and Javanmardi, Alireza and
               Millerdurai, Christen and Hu, Jia Cheng and Wang, Shaoxiang and
               Pagani, Alain and Stricker, Didier},
  booktitle = {CVPR},
  year      = {2026}
}

Acknowledgements

This work has been partially funded by the EU projects dAIEDGE (GA Nr 101120726) and LUMINOUS (GA Nr 101135724).