Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 338 Bytes

File metadata and controls

2 lines (2 loc) · 338 Bytes

Vision-Language-Models-for-Activity-Recognition-and-Abnormality-Detection-for-Elderly

VLM PrismerZ model for recognition of emergency and non-emergneyc situations via vision and language transformers. PrismerZ is directed on understanding the contextual information and completing image captioning and vision question answering tasks.