Timo Immanuel Denk

Timo Immanuel Denk
Google Inc. | Google

Bachelor of Science
Working on applied problems in the domain of audio and music processing

About

6
Publications
8,349
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
6
Citations

Publications

Publications (6)
Thesis
Full-text available
Chargrid is a recently proposed approach to understanding documents with 2-dimensional structure. It represents a document with a grid, thereby preserving its spatial structure for the processing model. Text is embedded in the grid with one-hot encoding on character level. With Wordgrid we extend Chargrid by employing a grid on word level. For emb...
Preprint
Full-text available
For understanding generic documents, information like font sizes, column layout, and generally the positioning of words may carry semantic information that is crucial for solving a downstream document intelligence task. Our novel BERTgrid, which is based on Chargrid by Katti et al. (2018), represents a document as a grid of contextualized word piec...
Technical Report
Full-text available
Conference Paper
Full-text available
BERT is a popular language model whose main pre-training task is to fill in the blank, i.e., predicting a word that was masked out of a sentence, based on the remaining words. In some applications, however, having an additional context can help the model make the right prediction, e.g., by taking the domain or the time of writing into account. This...
Technical Report
Full-text available
Based on an available list of the top 100,000 most popular domains on the web, we define a novel vision-based page rank estimation task: A model is asked to predict the rank of a given web domain purely based on screenshots of its web pages and information about the web link graph that interconnects them. This work is a feasibility study seeking to...
Technical Report
Full-text available
We present a parameterizable neural network meta-architecture for text classification tasks. It is based on one-dimensional separable convolutional layers, followed by a classification head consisting of stacked fully connected layers. The classifier operates on word level, with words represented by word embeddings, which we fine-tune during the tr...

Network

Cited By

Projects

Project (1)
Archived project
Extend "Chargrid" with word-level information