Ankit Satpute

Ankit Satpute
FIZ Karlsruhe - Leibniz Institute for Information Infrastructure | FIZ

Master of Science

About

10
Publications
607
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
16
Citations

Publications

Publications (10)
Conference Paper
Full-text available
Large Language Models (LLMs) have demonstrated exceptional capabilities in various natural language tasks, often achieving performances that surpass those of humans. Despite these advancements, the domain of mathematics presents a distinctive challenge, primarily due to its specialized structure and the precision it demands. In this study, we adopt...
Preprint
Full-text available
The carbon footprint share of the information and communication technology (ICT) sector has steadily increased in the past decade and is predicted to make up as much as 23 \% of global emissions in 2030. This shows a pressing need for developers, including the information retrieval community, to make their code more energy-efficient. In this projec...
Chapter
Defined as “the use of ideas, concepts, words, or structures without appropriately acknowledging the source to benefit in a setting where originality is expected" [6], plagiarism poses a severe concern in the rapidly increasing number of scientific publications.
Chapter
Full-text available
Plagiarism is a pressing concern, even more so with the availability of large language models. Existing plagiarism detection systems reliably find copied and moderately reworded text but fail for idea plagiarism, especially in mathematical science, which heavily uses formal mathematical notation. We make two contributions. First, we establish a tax...
Preprint
Full-text available
This demo paper presents the first tool to annotate the reuse of text, images, and mathematical formulae in a document pair-TEIMMA. Annotating content reuse is particularly useful to develop plagiarism detection algorithms. Real-world content reuse is often obfuscated, which makes it challenging to identify such cases. TEIMMA allows entering the ob...
Preprint
Full-text available
Small to medium-scale data science experiments often rely on research software developed ad-hoc by individual scientists or small teams. Often there is no time to make the research software fast, reusable, and open access. The consequence is twofold. First, subsequent researchers must spend significant work hours building upon the proposed hypothes...
Article
Full-text available
Small to medium-scale data science experiments often rely on research software developed ad-hoc by individual scientists or small teams. Often there is no time to make the research software fast, reusable, and open access. The consequence is twofold. First, subsequent researchers must spend significant work hours building upon the proposed hypothes...
Thesis
Storage system traces are rich in information as it contains real-world behavior. Replaying already recorded traces is used to reproduce the realistic behavior of systems as accurately as possible. The growing popularity of object storage systems and less focus on creating precise trace replay workload leads this work to focus on identifying compon...
Conference Paper
Full-text available
The application of UAS for the inspection of infrastructures has increased significantly in recent years. The possibility to capture all parts of a building with high-resolution images, detect damages and track the condition of a structure over long periods makes UAS an important tool for the maintenance management of traffic infrastructure. The re...

Network

Cited By