Jornada Nacional BIbliotecaria, XI Jornada Nacional Bibliotecaria

Tool for batch processing of digital documents for the REPXOS 3.0 system.

José Javier Hernández Benítez, Aray Villar Machado

Abstract


REPXOS 3.0 is a system for the implantation of digital repositories in scientific and academic institutions, developed in the University of Computer Science (UCI), specifically in the Center for Computerization of Document Management (CIGED). To store the information in REPXOS 3.0 it is necessary introduce manually a metadata with its description inside a form, document by document. This process is easy to perform when it is little information, but when it is in the order of the hundreds or thousands of documents, it brings with it loss of time and use of human resources, reason why sometimes the older documents are not incorporated and also part of the academic and scientific heritage of the institutions is lost. For this reason, it was very important to develop the web application DEPXOS to make more efficient the deposit of digital documents in REPXOS 3.0. DEPXOS allows the user to upload the files he wants to deposit, manage them, extract his metadata automatically and edit those metadata in case he considers it necessary to then make the deposit. To achieve this result, the authors characterized the current worldwide trends in the use of computer systems in the automatic extraction of digital document metadata. In addition, solution was designed to contributes to the batch processing of the digital documents, then the solution proposal was implemented using standards and good practices and finally was validated applying different methods of tests, being used like methodology of development Process Unified Agile UCI variation.