Andres Sanoja

Andres Sanoja
Central University of Venezuela | UCV · Escuela de Computacíon

PhD Computer Science
Data Sonification and Distributed Systems

About

21
Publications
23,184
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
105
Citations
Introduction
I currently work at Central University of Venezuela. I do research in Algorithms, Databases, Web Archives and Distributed Computing. I'm preparing a project in Data Sonification
Additional affiliations
January 2022 - March 2022
Central University of Venezuela
Position
  • Computer Science Graduate Studies Coordinator
Description
  • Teaching, Administrative Activity
October 2005 - March 2022
Central University of Venezuela
Position
  • Coordinator of the Paralell and Distributed Systems Centre
September 2011 - February 2015
Sorbonne Université
Position
  • PhD Student
Education
September 2011 - January 2015
Sorbonne Université
Field of study
  • Web page segmentation, evaluation an applications
September 2005 - July 2008
Central University of Venezuela
Field of study
  • Web Content Extraction

Publications

Publications (21)
Preprint
Full-text available
Sonorización de Logs HTTP en Aplicaciones Distribuidas Resumen: Las aplicaciones distribuidas son cada vez más complejas, lo que convierte la depuración en una tarea desafiante. Los archivos de registro, que graban la actividad de una aplicación, pueden ser una fuente valiosa de información para la depuración. Analizar grandes volúmenes de regist...
Technical Report
Full-text available
This Technical Report is about the development of a Evaluation Tool defined in a previous PhD thesis. Initially it was intended for evaluating Web pages for very experience users. This tool allows general users to create evaluations, following the predefined evaluation model and metrics.
Article
Full-text available
A Web page segmentation is an important task in Web page analysis. The objective is to divide a Web page into blocks, each one representing a coherent part (or segment) of the content. In this work we describe the development of the Manual-design of Blocks (MoB). At the same time we describe how to get a ground truth of segmentations and how to com...
Technical Report
Full-text available
The main objective of this report is to describe the development of a tool for building a ground truth of manual segmentations of Web pages. It is proposed a model for choosing the "best" segmentation which is a selection of the most popular blocks among a set of segmentations, done by several users. The tool is developed as an extension of the Cho...
Conference Paper
Full-text available
Web archives (and the Web itself) are likely to suffer from format obsolescence. In a few years or decades, future Web browsers will no more be able to properly render Web pages written in HTML4 format. Thus we propose a migration tool from HTML4 to HTML5. This is challenging, because it requires to generate HTML5 semantic elements that do not exis...
Technical Report
Full-text available
Select a Groupware open source solution for the Central Bank of Venezuela
Experiment Findings
Full-text available
This repository includes segmentation results for different algorithms, such as : BoM, VIPS, jVIPS, BlockFusion and MIG45. Collections: GOSH and MIG5. This data is used mainly for evaluation. The highlight feature is the geometrics aspect of the segmentation (ie. rectangles), but content information is included as well.
Technical Report
Full-text available
El objetivo del presente reporte es presentar el modelo, los parámetros y la evaluación de herramienta para el modelado de Procesos de Negocios utilizando la notación BPMN en el Banco Central de Venezuela. Se presenta primero el modelo de evaluación. Luego, se presentan los parámetros discutidos y los resultados.
Conference Paper
Full-text available
Web archives are not exempt of format obsolescence. In the near future Web pages written in HTML4 format, could be obsolete. We will have to choose between two preservation strategies: emulation or migration. The first option is the most evident, however due to the size of the Web and the amount of information that Web archives handle it is not pra...
Conference Paper
Full-text available
In this paper, we present a framework for evaluating seg-mentation algorithms for Web pages. Web page segmenta-tion consists in dividing a Web page into coherent fragments, called blocks. Each block represents one distinct information element in the page. We define an evaluation model that includes different metrics to evaluate the quality of a seg...
Article
Full-text available
Web pages are becoming more complex than ever, as they are generated by Content Management Systems (CMS). Thus, analyzing them, i.e. automatically identifying and classifying different elements from Web pages, such as main content, menus, among others, becomes difficult. A solution to this issue is provided by Web page segmentation which refers to...
Article
Full-text available
In this paper we describe Block-o-Matic, a web page segmentation framework. It is a hybrid approach inspired by automated document processing methods and visual-based content segmentation techniques. A web page is associated with three structures: the DOM tree, the content structure and the logical structure. The DOM tree represents the HTML elemen...
Article
Full-text available
Poster presented in the iPRES 2012 conference at the Information Faculty of the University of Toronto, Canada.
Article
Full-text available
The motivation of this work is to provide criteria oriented to the software leaders of the U.C.V Science Faculty for the selection of web technologies for the development of a module for the “Control de Estudios” System (CONEST), proposing to measure them by the use of software metrics. The module was developed in two versions, using different Web...
Article
Full-text available
This article describes the design and implementation of Extratos 1 , a Service Oriented In-formation Extraction System for web content sharing, based on web services as extractors and BPEL business process generation. Some insights from archaeological sciences are applied to the design of the system. It is organized in five subsystems: Xpathula, La...
Article
Full-text available
Este documento se centra en el análisis del gobierno electrónico en Venezuela, partiendo del análisis de las estrategias y lineamientos establecidos en el Plan Nacional de Tecnologías de la Información (Ministerio de Ciencia y Tecnología, 2001) y de los fundamentos, objetivos, principios rectores y bases legales definidas para el Gobierno Electróni...
Article
Full-text available
La tendencia mundial a la transformación del Estado utilizando las TIC existentes es hoy una realidad bajo el nombre de gobierno-e. Latinoamérica no escapa a ello y sus gobiernos están trabajando, en conjunto con organismos multilaterales, para implantar la tecnología y el conocimiento necesarios para llevarlo a cabo. Existen dos tendencias al desa...
Article
Full-text available
El gobierno electrónico es un modelo de desarrollo del estado que consiste en el uso de las Tecnologías de la Información y la Comunicación (TIC) en los procesos internos de gobierno y en los procesos externos de interacción entre el estado y los ciudadanos, para la mejora de los servicios públicos, el fortalecimiento de la responsabilidad administ...

Network

Cited By