Brody Huval’s research while affiliated with Stanford University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (4)


Fig. 1: Sample output from our neural network capable of lane and vehicle detection. 
Fig. 6: Image after perspective distortion 
Fig. 7: Left: lane prediction on test image. Right: Lane detection evaluated in 3D 
Fig. 9: Car Detector Bounding Box Performance 
Fig. 10: Vehicle False Positives 

+2

An Empirical Evaluation of Deep Learning on Highway Driving
  • Article
  • Full-text available

April 2015

·

4,851 Reads

·

469 Citations

Brody Huval

·

Tao Wang

·

Sameep Tandon

·

[...]

·

Andrew Y. Ng

Numerous groups have applied a variety of deep learning techniques to computer vision problems in highway perception scenarios. In this paper, we presented a number of empirical evaluations of recent deep learning advances. Computer vision, combined with deep learning, has the potential to bring about a relatively inexpensive, robust solution to autonomous driving. To prepare deep learning for industry uptake and practical applications, neural networks will require large data sets that represent all possible driving environments and scenarios. We collect a large data set of highway data and apply deep learning and computer vision algorithms to problems such as car and lane detection. We show how existing convolutional neural networks (CNNs) can be used to perform lane and vehicle detection while running at frame rates required for a real-time system. Our results lend credence to the hypothesis that deep learning holds promise for autonomous driving.

Download

Deep learning for class-generic object detection

December 2013

·

205 Reads

·

35 Citations

We investigate the use of deep neural networks for the novel task of class generic object detection. We show that neural networks originally designed for image recognition can be trained to detect objects within images, regardless of their class, including objects for which no bounding box labels have been provided. In addition, we show that bounding box labels yield a 1% performance increase on the ImageNet recognition challenge.


Semantic Compositionality through Recursive Matrix-Vector Spaces

July 2012

·

756 Reads

·

1,275 Citations

Single-word vector space models have been very successful at learning lexical information. However, they cannot capture the compositional meaning of longer phrases, preventing them from a deeper understanding of language. We introduce a recursive neural network (RNN) model that learns compositional vector representations for phrases and sentences of arbitrary syntactic type and length. Our model assigns a vector and a matrix to every node in a parse tree: the vector captures the inherent meaning of the constituent, while the matrix captures how it changes the meaning of neighboring words or phrases. This matrix-vector RNN can learn the meaning of operators in propositional logic and natural language. The model obtains state of the art performance on three different experiments: predicting fine-grained sentiment distributions of adverb-adjective pairs; classifying sentiment labels of movie reviews and classifying semantic relationships such as cause-effect or topic-message between nouns using the syntactic path between them.


Figure 2: Visualization of the k-means filters used in the CNN layer after unsupervised pre-training: (left) Standard RGB filters (best viewed in color) capture edges and colors. When the method is applied to depth images (center) the resulting filters have sharper edges which arise due to the strong discontinuities at object boundaries. The same is true, though to a lesser extent, when compared to filters trained on gray scale versions of the color images (right). 
Convolutional-Recursive Deep Learning for 3D Object Classification

January 2012

·

2,381 Reads

·

592 Citations

Recent advances in 3D sensing technologies make it possible to easily record color and depth images which together can improve object recognition. Most current methods rely on very well-designed features for this new 3D modality. We in-troduce a model based on a combination of convolutional and recursive neural networks (CNN and RNN) for learning features and classifying RGB-D images. The CNN layer learns low-level translationally invariant features which are then given as inputs to multiple, fixed-tree RNNs in order to compose higher order fea-tures. RNNs can be seen as combining convolution and pooling into one efficient, hierarchical operation. Our main result is that even RNNs with random weights compose powerful features. Our model obtains state of the art performance on a standard RGB-D object dataset while being more accurate and faster during train-ing and testing than comparable architectures such as two-layer CNNs.

Citations (4)


... With the development of deep learning, some deep neural network-based approaches [19,20] have shown significant superiority in lane line detection. Current deep learning approaches for lane line detection are primarily based on image segmentation methods [21][22][23][24], anchor-based methods [13,16,[25][26][27][28] and parameter prediction methods [11]. ...

Reference:

Slope-embedded ViT-based model for lane line detection under occlusions
An Empirical Evaluation of Deep Learning on Highway Driving

... (1) View-based: Some researchers make use of RGB-D data by considering the depth map as an additional channel [65]. Some process 3D data as front-view images [66,67] or project 3D information to a bird's-eye view [68]. ...

Convolutional-Recursive Deep Learning for 3D Object Classification

... In recent years, researchers have been studying and exploring ways to transform linguistic data into structured data so as to extract valuable information, such as [1][2][3]. Socher et al. [4] proposed to apply matrix to recurrent neural network to process natural language, and named it matrix recurrent neural network model. Sun et al. [5] designed a classifier model trained by convolutional neural networks to learn the shortest dependency path, and used the self-attention principle of multi-end attention networks to calculate the relationship between each word in a sentence and other words respectively, solving the problem of gradient disappearance. ...

Semantic Compositionality through Recursive Matrix-Vector Spaces
  • Citing Conference Paper
  • July 2012

... For more than a decade, countless systems have been implemented based on deep and convolutional neural approaches, and the oldest of them using DNNs (deep neural networks) have been dedicated to object detection. Brody et al. [22] developed a DNN-based model with a bounding box generation by the sliding window method. They claimed that by training the network is efficient in detecting objects. ...

Deep learning for class-generic object detection
  • Citing Article
  • December 2013