Lisa Li’s research while affiliated with University of Texas at Austin and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (1)


Auto-Suggestive Real-Time Classification of Driller Memos into Activity Codes for Invisible Lost Time Analysis
  • Conference Paper

February 2020

·

44 Reads

·

6 Citations

Jared Ucherek

·

·

Matthew Prinz

·

[...]

·

Juan Mejia

Activity codes recorded by drillers are very useful for quantifying invisible lost time (ILT). However, classifying more than 100 activity codes accurately and consistently across various rig operations becomes infeasible for human operators. We propose an auto-suggestive system that guides the drillers to the correct codes based on memos they enter into the system. This aims to both eliminate manual classification errors and improve memo entry. The method for extracting activity codes from memos can be broken into the following steps. The first step consists of filtering unnecessary text and vectorizing the memos. The vectors are then re-weighted using the term frequency-inverse document frequency (TFIDF) statistical measure. Next, data resampling helps to create a uniform set of labels for the training data, because there are quite a few important activity codes that appear infrequently with respect to others. Finally, a classifier is trained. It is shown that the finalized model can be used as a real-time auto-suggestive mechanism during the drillers’ data input process. Moreover, its use for cleaning up historical datasets is also explored. This method was implemented on a large historical dataset consisting of 150 wells, and ILT analysis was performed with the original dataset and with the auto-classified dataset. Comparing these results clearly showed that performing analysis on a dataset that has not been properly classified can lead to incorrect and misleading conclusions. Also, this method did not require a manual re-labeling of the dataset for model training. This makes the algorithm readily applicable for any end-user, irrespective of the number of activity codes used. Various classifiers including logistic regression, support vector machine, random forests, naïve Bayes, and multi-layered perceptron were implemented and tested. Given comparable performances, we conclude that a simple and interpretable logistic regression model is best for real-time classification. Tests were also performed to see how many typed words in a memo would be needed before the correct activity code was identified. The results are detailed in this paper. This is the first body of work that has taken drillers’ memos and converted them into activity codes, without the need for a human-classified training dataset. The real-time classifier is very powerful in ensuring clean data at the source and will be particularly useful when implemented on reporting systems for classifying rig activities by IADC activity codes. We further demonstrate the use of the classifier for cleansing historical datasets such that ILT analysis can be done more accurately.

Citations (1)


... They address common rig operations, such as drilling, reaming, and coring, and common rig activities, such as pickup, lay-down, and connection. These codes were primarily used for manual reporting, and recent papers have focused on the natural language processing of these digitized reports (Ucherek et al. 2020) to improve usability (and standardization). ...

Reference:

A General Framework to Describe Drilling Process States
Auto-Suggestive Real-Time Classification of Driller Memos into Activity Codes for Invisible Lost Time Analysis
  • Citing Conference Paper
  • February 2020