Family name:
Given name:
Quoted from a Chinese idiom 一馬平川, my full name means the flat ground that one can ride straight across and thus implies enjoying a smooth life.
眾峰來自天目山,勢若駿馬奔平川。
There are lots of mountains from Tianmu Mountain, which have the might of fine horses galloping across flat ground.
Currently Working: Improving vision-language models (VLMs) for visual grounding and developing a multi-agent framework to transcribe live sports, emulating the style of human commentators.
Worked with multimodals and Vision-Language Alignment, fine-tuning SDXL on custom data, built framework to retrieve visually semantic video
@INPROCEEDINGS{10651096,
author={Sharma, Karun and Vats, Vidushee and Singh, Abhinendra and Sahani, Rahul and Rai, Deepak and Sharma, Ashok},
booktitle={2024 International Joint Conference on Neural Networks (IJCNN)},
title={LLaVA-PlantDiag: Integrating Large-scale Vision-Language Abilities for Conversational Plant Pathology Diagnosis},
year={2024},
volume={},
number={},
pages={1-7},
keywords={Pathology;Visualization;Plant diseases;Accuracy;Generative AI;Convolution;Neural networks;Multimodal;LLM;LLaVA;Phytopathological Multimodal Data},
doi={10.1109/IJCNN60899.2024.10651096}}
}