Gaurvi Goyal

Gaurvi Goyal
Istituto Italiano di Tecnologia | IIT · iCub Facility

Doctor of Philosophy

About

9
Publications
450
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
22
Citations
Citations since 2017
8 Research Items
22 Citations
20172018201920202021202220230246810
20172018201920202021202220230246810
20172018201920202021202220230246810
20172018201920202021202220230246810
Introduction
My work in Computer Vision has been on designing Deep Learning methods that can be trained on minimal data for Action recognition applications from RGB videos with focus on viewpoint generalization. More recently, I'm working on mental well being estimation from RGB(+D) video data of human interaction.

Publications

Publications (9)
Article
Cross-view action recognition refers to the task of recognizing actions observed from view-points that are unfamiliar to the system. To address the complexity of the problem, state of the art methods often rely on large-scale datasets, where the variability of viewpoints is appropriately represented. However, this comes to a significant price, in t...
Article
Full-text available
MoCA is a bi-modal dataset in which we collect Motion Capture data and video sequences acquired from multiple views, including an ego-like viewpoint, of upper body actions in a cooking scenario. It has been collected with the specific purpose of investigating view-invariant action properties in both biological and artificial systems. Besides that,...
Chapter
In this work we discuss the action classification performance obtained with a baseline assessment of the MoCA dataset: a multimodal, synchronised dataset including Motion Capture data and multi-view video sequences of upper body actions in a cooking scenario. To this purpose, we setup a classification pipeline to manipulate the two data type. For t...
Preprint
Full-text available
Analysis and interpretation of egocentric video data is becoming more and more important with the increasing availability and use of wearable cameras. Exploring and fully understanding affinities and differences between ego and allo (or third-person) vision is paramount for the design of effective methods to process, analyse and interpret egocentri...
Conference Paper
In this work we start investigating the use of appropriately learnt space-time primitives for modeling upper body human actions. As a study case we consider cooking activities which may undergo large intra class variations and are characterized by subtle details, observed by different view points. With a BoK procedure we quantize each video frame w...

Network

Cited By