Gaurav Shrivastava

Gaurav Shrivastava
  • Doctor of Philosophy
  • Research Assistant at University of Maryland, College Park

About

9
Publications
219
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
46
Citations
Current institution
University of Maryland, College Park
Current position
  • Research Assistant
Additional affiliations
February 2019 - July 2019
University of Maryland, College Park
Position
  • Faculty Assistant
April 2018 - August 2018
National University of Singapore
Position
  • RA
Education
August 2019 - May 2021
University of Maryland, College Park
Field of study
  • Computer Science
August 2013 - May 2017
Birla Institute of Technology and Science, Pilani
Field of study
  • Electronics and Instrumentation

Publications

Publications (9)
Conference Paper
Nowadays, spatial analysis in text is widely considered as important for both researchers and users. In certain fields such as epidemiology, the extraction of spatial information in text is crucial and both resources and methods are necessary. In most of spatial analysis process, gazetteer is a commonly used resource. A gazetteer is a data source w...
Preprint
Full-text available
Diffusion models have made significant strides in image generation, mastering tasks such as unconditional image synthesis, text-image translation, and image-to-image conversions. However, their capability falls short in the realm of video prediction, mainly because they treat videos as a collection of independent images, relying on external constra...
Preprint
Full-text available
Obstacle-aware trajectory navigation is crucial for many systems. For example, in real-world navigation tasks, an agent must avoid obstacles, such as furniture in a room, while planning a trajectory. Gaussian Process (GP) regression, in its current form, fits a curve to a set of data pairs, with each pair consisting of an input point 'x' and its co...
Preprint
Full-text available
Multi-step prediction models, such as diffusion and rectified flow models, have emerged as state-of-the-art solutions for generation tasks. However, these models exhibit higher latency in sampling new frames compared to single-step methods. This latency issue becomes a significant bottleneck when adapting such methods for video prediction tasks, gi...
Preprint
Full-text available
In the evolving landscape of video enhancement and editing methodologies, a majority of deep learning techniques often rely on extensive datasets of observed input and ground truth sequence pairs for optimal performance. Such reliance often falters when acquiring data becomes challenging, especially in tasks like video dehazing and relighting, wher...
Preprint
Full-text available
Diffusion models have made significant strides in image generation, mastering tasks such as unconditional image synthesis, text-image translation, and image-to-image conversions. However, their capability falls short in the realm of video prediction, mainly because they treat videos as a collection of independent images, relying on external constra...
Preprint
Full-text available
Generating future frames given a few context (or past) frames is a challenging task. It requires modeling the temporal coherence of videos and multi-modality in terms of diversity in the potential future states. Current variational approaches for video generation tend to marginalize over multi-modal future outcomes. Instead, we propose to explicitl...

Network

Cited By