Data warehouses are primary entities that among other functions collect data over time. These data make it possible for researchers to address many different questions that, for example, can inform policy decision. In order to conduct analyses whose inputs and outputs are valid, data need to be of high quality. That is, data that are consistent, reliable, complete, and accurate. However, data
... [Show full abstract] analysts might encounter major issues when working with data, such as storage, amount and type of data, and more importantly, having to deal with different formats. Because data are only as valuable as its level of quality, careful statistical procedures need to be performed that will integrate the data so that (the desired information is not lost or inaccurately represented. Using educational data from a warehouse in a southeastern state, this paper will present efficient ways to merge and modify data of different formats and lengths.