Deep learning has emerged as an effective approach for seismic data processing in general, and for earthquake monitoring in particular. The ability of deep learning models to generalize beyond the training and validation data is important for comprehensive earthquake monitoring; this ability furthermore depends on the availability of a sufficiently large and complete training dataset. However, this requirement can prove challenging to meet due to significant effort and time for data collection and labeling. Data augmentation provides an efficient and effective approach for increasing the dimension of training samples and improving generalization to unseen samples. In this paper, we present augmentation methods appropriate for seismic waveforms and demonstrate their ability to reduce bias and increase performance. These augmentation methods can be applied to a wide range of deep learning applications designed for seismic data.