About
56
Publications
20,219
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
183
Citations
Introduction
Current institution
Additional affiliations
October 2004 - present
October 2004 - present
March 1999 - October 2004
Publications
Publications (56)
Presentation by Alejandro Bia describing our experience with Just-in-time teaching and Team-based Learning
http://dhw.umh.es/slides/XARXES18/#/0
El presente trabajo describe el proceso de creación de un archivo digital para investigadores, que comprende una base de datos con datos de investigación de humanidades, y un repositorio de documentos de investigación asociados a estos datos. En este trabajo, vamos a discutir las decisiones de diseño que debieron tomarse para evitar o minimizar los...
La creación de nuevos documentos XML, desde cero o a partir de texto plano, puede ser una tarea difícil, lenta y propensa a errores, sobre todo cuando el vocabulario de marcado utilizado es rico y complejo, como es el caso del TEI. Por lo general, lleva bastante tiempo lograr que el documento valide por primera vez.
Juntando el espíritu del viejo...
Abstract: Education methods for millennials must accommodate
their expectations and behaviors. Active learning methodologies
seem to be adequate for this requirement. In particular, in this
paper, we discuss the design and deployment of Team-Based
Learning (TBL) in two undergraduate Software Engineering
courses. TBL is a type of Active Learning Met...
Resumen El aprendizaje basado en equipos (TBL) es un enfo-que pedagógico colaborativo que estructura la plani-ficación, ejecución y evaluación de asignaturas con el fin de mejorar el compromiso de los estudiantes y la calidad del aprendizaje, y que puede clasificarse dentro del conjunto de los métodos de clase inverti-da 1. TBL pone énfasis en las...
The aim of this project is to provide a wide collection of useful text processing tools as online services. This includes tools related to XML documents, DTDs, Schemas, XSLT processing, HTML, etc.
Most of these tools are already available, but to be useful, they have to be downloaded and properly installed on the user's machine. The idea of having...
The estimate of digitization costs is a very difficult task. It is difficult to make exact predictions due to the great quantity of unknown factors. However, digitization projects need to have a precise idea of the economic costs and the times involved in the development of their contents. The common practice when we start digitizing a new collecti...
We often use UML diagrams for our software development projects, and also for modeling XML DTDs and Schemas [1], finding that
although UML diagrams can effectively be made to represent DTDs and Schemas (either using Class or Component diagrams), in
real practice, complex DTDs and Schemas produce unreadable, unmanageable, complex UML diagrams. Recen...
The estimate of digitization costs is a very difficult task. It is difficult to obtain accurate values because of the great quantity of unknown factors. However, digitization projects need to have a precise idea of the economic costs and the times involved in the development of their contents. The common practice when we start digitizing a new coll...
This paper describes the engineering foundations of VisualWADE, a CASE tool to automate the production of Web applications. VisualWADE follows a model-driven approach focusing on requirements analysis, high level design, and rapid prototyping. In this way, an application evolves smoothly from the first prototype to the final product, and its mainte...
In this short paper, we present a Ubiquitous ContentDistribution with Semantic Web Interfaces (UCD/SW). The UCD/SW has been designed as a distributed system to promote cooperative learning using semantic
web and ubiquitous computing. This short paper argues how UCD/SW has been designed with research technologies dealing with
cooperative design, dis...
Markup is based on mnemonics (i.e. element names, attribute names and attribute values). These mnemonics have meaning, being this one of the most interesting features of markup. Human understanding of this meaning is lost when the encoder doesn't understand the language the mnemonics are based on. By "multilingual markup" we refer to the use of par...
This paper describes the engineering foundations of VisualWADE, a CASE tool to automate the production of Web applications.
VisualWADE follows a model-driven approach focusing on requirements analysis, high level design, and rapid prototyping. In
this way, an application evolves smoothly from the first prototype to the final product, and its mainte...
This paper discusses the applicability of modeling methods originally meant for business applications, on the design of the complex markup vocabularies used for XML Web-content production.
We are working on integrating these technologies to create a dynamic and interactive environment for the design of document markup schemes. This paper focuses on...
The estimate of web-content production costs is a very difficult task. It is difficult to make exact predictions due to the
great quantity of unknown factors. However, digitization projects need to have a precise idea of the economic costs and times
involved in the development of their contents. As it happens with software development projects, inc...
Markup is based on mnemonics (i.e. element names, attribute names and attribute values). These mnemonics have meaning, being this one of the most interesting features of markup. Human understanding of this meaning is lost when the encoder doesn't have a good command of the language the mnemonics are based on. By "multilingual markup" we refer to th...
Digital Humanities (DH) and Digital Library (DL) projects are complex systems that require specialized programming skills. Many encoders cannot take their work to the next level by transforming their collections of structured XML texts into a web searchable and browsable database. Often teams of text encoders are able to encode their texts with a h...
Digital Libraries are complex systems that take a long time to create and tailor to specific requirements [1]. Their implementation requires specialized computer skills, which are not usually found within humanities text encoding projects. Many encoders working on text encoding projects find they cannot take their work to the next level by transfor...
In this article we describe our experience in the development of a personalizable dissemination model for the Miguel de Cervantes
Digital Library’s Web-based newsletter-service, which combines adaptive with adaptable personalization techniques, being capable
or ranking news according to navigation-inferred preferences and then filter them according...
Digital Libraries of literary works usually store a huge amount of textual information. It is obvious that the mere accumulation of texts leads only to a limited-use library. Hence the need for efficient information retrieval services. The use of indices to speed up the search is advisable in cases like ours, the "Miguel de Cervantes" digital libra...
In this paper we are introducing the MatchDetectReveal system, which is capable of identifying the similarity between documents. Different applications of the system are discussed including cross-referencing multiple editions of literary works, plagiarism detection, organizing collections of documents and comparative analysis of texts. The system u...
The purpose of this article is to describe our approach to the massive production of facsimile-type hypertext books that contain digital images of manuscripts and old printings to be published on the Internet as one of our DL services . The goal of this project is to offer an easy-to-use interface that allows customizable views of facsimile images...
The Miguel de Cervantes Digital Library hopes to act as a vehicle for the Hispanic academy to promote their works, as a window to Hispanic literature and culture for scholars of Hispanic languages, and as a voice for the Hispanic community worldwide meant to reach an international multiracial and multilingual student and academic community of Inter...
We describe the digital-book-production flow of the Miguel de Cervantes Virtual Library, from book acquisition up to Internet publishing, highlighting the main requirements and design considerations of the workflow system.
According to [1] Internet is a market with many micromarkets, based on needs, interests and trends, both personal and professional. Each and every space of the net is atomized to reach the users, with their own preferences and behaviors. Our DL project intends to give the users a customized view where they could receive personalized information. We...
With an aim of bringing cultural contents to cyberspace and spreading some unknown aspects of the history of the Americas, the Miguel de Cervantes Digital Library has embarked in a joint e#ort with the Library of the Royal Palace of Spain, to develop the digital web publication of the Manuscripts of the Americas in the Royal Collections funds. In t...
In this paper we are introducing the MatchDetectReveal system, which is capable of identifying the similarity between documents. Different applications of the system are discussed including cross-referencing multiple editions of literary works, plagiarism detection, organizing collections of documents and comparative analysis of texts. The system u...
Regular tree automata (RTA) or, equivalently, forest regular grammars (FRG) have been recently proposed for use as XML (extended markup language) schemata. They are more powerful than usual XML DTDs (document-type definitions) , make the implementation, optimization and pruning of XML queries easier and allow for the implementation of context-sensi...
Experimenting with some changes and simplifications to the Alopex algorithm, we obtained a new faster version (Alopex-B), that also shows lower failure rates on training attempts. Like Alopex, our version is network-architecture independent, does not require error or transfer functions to be differentiable, has a high potential for parallelism, and...
With an aim of bringing cultural contents to cyberspace and spreading some unknown aspects of the history of the Americas, the Miguel de Cervantes Digital Library has embarked in a joint effort with the Library of the Royal Palace of Spain, to develop the digital web publication of the Manuscripts of the Americas in the Royal Collections funds. In...
This demo describes the philosophy behind what represents one of the most ambitious projects of its kind in the Spanish-speaking world: The Miguel de Cervantes Digital Library (http://cervantesvirtual.com/). It shows the new ground being explored in terms of the wide variety of contents, media, functionality and services it offers to a worldwide au...
This demo describes the philosophy behind what represents one of the most ambitious projects of its kind in the Spanish-speaking world: The Miguel de Cervantes Digital Library (http://cervantesvirtual.com/). It shows the new ground being explored in terms of the wide variety of contents, media, functionality and services it offers to a worldwide au...
La preservación digital es una batalla perdida a largo plazo, ¿o no es así? La ley de la entropía juega en contra de este propósito. No importa el medio físico que elijamos: éste se degradará con el paso de tiempo con la consiguiente pérdida de información. El objetivo de la preservación digital es retardar esta degradación tanto como sea posible....
This article describes a joint research work between Monash University and the University of Alicante, where software originally
meant for plagiarisman and copy detection in academic works is successfully applied to perform comparative analysis of different
editions of literary works. The experiments were performed with Spanish texts from the Migue...
This paper describes the philosophy behind what represents one of the most ambitious projects of its kind ever to have been undertaken in the Spanish-speaking world: the Miguel de Cervantes Digital Library (http://cervantesvirtual. com/). It explains the reasons behind its creation, the private-public sector alliance that has made it possible, and...
The purpose of this poster is to describe our approach to provide facsimiles of manuscripts and old books as one of our DL services publicly available by Internet.
En este art´ iculo se pretende explicar la investigacion llevada a cabo por la Biblioteca Virtual Miguel de Cervantes 1 en el desarrollo de una pol´ itica de marcado de textos literarios y la automatizacion de su procesamiento. Tambien se intenta explicar y argumentar algunas de las decisiones de diseno tomadas y las soluciones de compromiso a que...
We studied the performance of the Alopex algorithm, and proposed
modifications that improve the training time, and simplified the
algorithm. We tested different variations of the algorithm. We describe
the best cases and summarize the conclusions we arrived at. One of the
proposed variations (99/B) performs slightly faster than the Alopex
algorithm...
Con frecuencia, las Bibliotecas Digitales tienen la necesidad de extraer información a partir de documentos pobremente marcados para almacenarla en bases de datos o crear nuevos documentos hipertexto con un marcado altamente estructurado. En este trabajo, abordaremos el problema de extraer información bibliográfica a partir de informes literarios e...
Electronic publishing makes it possible to reach every corner of the world and opens up new research and communication paths. In this article we describe the production model and implementation of an electronic news service for a DL, that manages altogether five different DL-newsletters plus a monthly journal, each one of them delivered in several...
Ph.D. theses are a fundamental pillar for the application and devel-opment of knowledge. Generally, Ph.D. theses have a very limited reach. Their printed publication is rarely viable in a massive way. Often, only a partial publication is possible. Unfortunately, many original research projects, to which authors have devoted long working hours, lie...
Universidad de Alicante, apartado de correos 99, E-03080, España 20 de febrero de 2002 Resumen Most often, Digital Libraries have the need to extract information from poorly marked-up documents to fill databases or create new hy-pertext documents with a highly structured markup. In this work, we approach the problem of extracting bibliographic info...
This article describes the requirements and technological solutions adopted by the National Library of Spain for its Digital Library section concerning metadata. It also discusses different approaches for metadata handling in general. ♣
The largest effort in the area of standardisation of computer encoding of language resources has been the Text Encoding Initiative (TEI), established in 1987. TEI chose as its underlying standard SGML (Standard Generalized Markup Language), and in the years before the inception of XML, a number of projects encoded their data according to some SGML...
Resumen. En este artículo mostraremos y defenderemos los beneficios del uso de esquemas de marcado XML multilingües para grandes proyectos de digitalización como la Biblioteca Virtual Miguel de Cervantes 1 y el consecuente incremento en la producción debido a la utilización de las etiquetas en la lengua propia. Del mismo modo, también mostraremos e...
In this paper we will show the benefits of using multilingual markup schemes for large digitization projects like the Miguel de Cervantes Digital Library, and the advantages of using markup tags in one's own language, like the consequent increase in production, reduction of markup errors, and improved facilities for advanced XML based retrieval. Th...
La Biblioteca Virtual Miguel de Cervantes es probablemente el proyecto de biblioteca digital de lenguas hispanas con el mayor número de libros en línea en la actualidad. Casi tres años después de su puesta en funcionamiento, la Biblioteca Virtual Miguel de Cervantes se ha convertido en la página web de consulta obligada para los amantes de la liter...