https://github.com/ActiveVisionLab/Awesome-LLM-3D

https://aicarrier.feishu.cn/wiki/GvibwrWumiYVxYk3Ik5c7Yd3nSb

pipeline整理:

  1. 大部分论文的pipeline:Encoder → Projector → LLM
    1. Encoder:对point clouds提取特征
    2. Projector:使用projector将特征投影到LLM所需的point token
    3. LLM:使用LLM预测token
  2. Encoder → LLM
    1. An Embodied Generalist Agent in 3D World
  3. Building object set → LLM
    1. ConceptGraphs

Point Encoder的variants