Jindong is a research scientist in the Learning and Perception Research (LPR) team of NVIDIA Research. Prior to joining NVIDIA, Jindong was a PhD student at Rutgers University under the supervision of Prof. Sungjin Ahn. His research interests lie at the intersection of representation learning and visual reasoning, with a strong interests in developing novel architectures that can improve agent's visual reasoning capabilities. The long-term objective of his research is to develop artificial intelligence agents capable of human-like reasoning. This involves designing systems that can uncover latent structure of the physical world, predict future scenarios based on current states, infer the causality or correlation between events, and engage in logical planning to accomplish goals. Currently, he is focusing on Multimodal LLMs, Vision Foundation Models, and their synergy.