Leo Galambos

Leo Galambos
Czech Technical University in Prague | ČVUT · Department of Security Technologies and Engineering

About

16
Publications
1,719
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
108
Citations
Additional affiliations
September 2010 - present
Charles University in Prague
Position
  • External Member
September 2008 - present
Czech Technical University in Prague
Position
  • Research Assistant
September 2004 - September 2008
Charles University in Prague
Position
  • Research Assistant

Publications

Publications (16)
Article
Full-text available
The majority of today's IR systems base the IR task on two main processes: indexing and searching. There exists a special group of dynamic IR systems where both processes (indexing and searching) happen simultaneously; such a system discards obsolete information, simultaneously dealing with the insertion of new in-formation, while still answering u...
Conference Paper
Full-text available
Syllable-based compression achieves suciently good results on text documents of a medium size. Since the majority of XML docu- ments are of that size, we suppose that the syllable-based method can give good results on XML documents, especially on documents that have a simple structure (small amount of elements and attributes) and rela- tively long...
Conference Paper
The world of mathematical knowledge on the WWW has grown enormously. Despite the clear importance of a mathematical search engine this research field had been abandoned until very recently. Although, currently available full text search engines can be used on these documents too, they are deficient in almost all cases. They cannot handle structured...
Chapter
Full-text available
The crawling theory and practice are summarized. The paper includes web theory related to crawling, description of various crawling strategies, and popular implementations available for research or industrial use.
Conference Paper
Full-text available
EgoMath is a full text search engine focused on digital mathematical content with little semantic information available. Recently, we have decided that another step towards making mathematics in digital form more accessible was to enable mathematical searching in one of the world’s largest digital libraries - Wikipedia. The library is an excellent...
Data
Egothor2 software (2004-2009) with fixes (2013)
Conference Paper
Full-text available
In this position paper we discuss the what, who, when, where, why and how of uncertain reasoning based on achievements of URW3XG [2], our experiments and some future plans. What and Why – improving semantic web practice through uncertain reasoning. This vision is described in the URW3XG charter (see [2]), especially the objective is "to identify an...
Article
Full-text available
The WWW became the main resource of mathematical knowledge. Currently available full text search engines can be used on these documents but they are deficient in almost all cases. By applying axioms, equal transformations, and by using different notation each formula can be expressed in numerous ways. Most of these documents do not contain semantic...
Conference Paper
In this paper two stemmers which extract stemming rules from a sample dictionary of transformations are compared. We present their capability to generalize the information in the dictionary. The factors which affect such a generalization are also shown and discussed.
Conference Paper
Full-text available
The majority of today’s IR systems base the IR task on two main processes: indexing and searching. There exists a special group of dynamic IR systems where both processes (indexing and searching) happen simultaneously; such a system discards obsolete information, simultaneously dealing with the insertion of new information, while still answering us...
Article
This work examines the area of IR systems which work in a dynamic, multilingual environment, such as, the Internet or the environment of a big enterprise. A new stemmer technique is developed that could ensure the better processing of information. Such a technique is suitable for, among other things, the multilingual processing of text (information...
Conference Paper
Full-text available
Stemming is a widely accepted practice in Document Information Retrieval Systems (DIRs), because it is more benefical than harmful [3]as well as having the virtue of improving retrieval effciency by reducing the size of the term index. We will present a technique of semi-automatic stemming that is fine designed for JAVA environment. The method work...
Article
. The distributed information and retrieval systems (DIRS) should have little impact on the user other than to make a larger number of document available in them. There are four major problems. First, dierent databases have dierent formats and processing requirements. Second, data redundancy. Third, dierent systems of DIRS may rate a given document...
Article
Full-text available
EGOTHOR is a search engine that indexes the Web and allows us to search the Web documents. Its hit list contains URL and title of the hits, and also some snippet which tries to shortly show a match. The snippet can be almost always assembled by an algorithm that has a full knowledge of the original document (mostly HTML page). It implies that the s...

Network

Cited By

Projects

Projects (3)
Project
This project implements secure services for people localization using GSM networks. The auxilliary data is concentrated and a system is able to detect and locate dangerous GSM interceptors in public mobile networks.
Archived project
Project
Q!D is a novel database system written on top of J5M platform with the support of Java 8 Streams API and Lambda expressions. It implements the anamorphic (quantum) database from Egothor3 project. Q!D aims to manage and process big-data collections faster, and in more secure environment, than any other product.