TDG IntegraWeb Committed to research! TDG Site Manager 1.1

Seminars

Web Information Extraction using Machine Learning Techniques

By Gretel Fernández
on Tuesday, December 13, 2011
at 09:30 AM

Gretel's reported on the work she's carrying out to develop a new technique to extract information from web pages that builds on Bayes probabilities.

Enterprise Application Integration

By Rafael Z. Frantz
on Saturday, December 03, 2011
at 10:25 PM

Rafael presents his PhD thesis to the department.  The thesis will be defended on Feb, 17 2012!

Beyond Bayesian Learning

By Pablo M. Olmos
on Friday, November 25, 2011
at 11:30 AM

Pablo reports on his findings regarding linear parametric models for regression and classi cation.  The results might well be applied to several problems dealt with in the context of IntegraWeb.

On Balancing Datasets

By Gretel Fernández
on Wednesday, May 04, 2011
at 09:30 AM

Gretel reports on several techniques she has studied to balance datasets that are not balanced. Note that this is a major problem regarding the ideas we're exploring to infer web extraction rules using standard machine learning techniques like C4.5 or SVM.

Data Integration Architectures

By Alberto Pan
on Thursday, April 28, 2011
at 06:30 PM

Alberto reports on the following data integration architectures: SOA, data warehouse, and mashups. He provides a few details on three real-world integration problems.

An Experience on using Datamining Techniques to Extract Information from the Web

By Gretel Fernández
on Tuesday, April 19, 2011
at 09:30 AM

Gretel reports on the technique she's developing to extract information from web pages building on features and datamining (more generally machine learning) techniques.

The Guaraná Editor Revisited

By David Gallego
on Monday, April 18, 2011
at 09:30 AM

David reports on his work to implement a Guaraná editor using the DSL Tools implementation provided by Visual Studio 2010.

Web Downloader

By Adrián Mesa
on Friday, February 25, 2011
at 12:00 PM

Adrián reports on the work he's carrying out to build a web downloader.

Listening Platforms

By Fernando Ortega
on Wednesday, January 26, 2011
at 05:00 PM

Fernando, who works for Dinamic Area, reports on listening platforms: a definition, applications, case studies, and so on.

Fault Tolerance in Guaraná

By Félix Pérez
on Wednesday, December 15, 2010
at 04:30 PM

Félix reports on some preliminary ideas to endow Guaraná with fault tolerance capabilities.

On Decision-Making Regarding EAI

By Dania Pérez
on Friday, December 03, 2010
at 12:00 PM

Dania reports on her PhD work regarding a method for decision-making in the context of Enterprise Application Integration.

Automated Analysis of Orthogonal Variability Models using Constraint Programming

By Fabricia C. Roos-Frantz
on Friday, November 12, 2010
at 12:00 PM

Fabricia reports on the work she's conducting to automate the analysis of variability models in software product lines using constraint programming.

Information Mediators

By Carlos R. Rivero
on Friday, November 05, 2010
at 12:00 PM

Carlos reports on his progresses regarding devising and building information mediators for RDF-based data.

Advanced Separation of Concerns

By Toñi Reina
on Friday, October 29, 2010
at 12:00 PM

Toñi reports on her work regarding advanced separation of concerns in the development of web applications.

Advances in Guaraná

By Rafael Z. Frantz
on Friday, October 22, 2010
at 12:00 PM

Rafael reports on his progress regarding Guaraná, which is the name we use to refer to our DSL, framework, and toolkit for Enterprise Application Integration.

Recommender Techniques and Social Models applied to Web Services Selection and Composition

By Leandro K. Wives
on Friday, October 15, 2010
at 12:00 PM

Leandro reports on the research work he and his colleagues are conducting at UFRGS.  Their focus in on recommender systems, on service composition, and marginally on text mining.

Ubicomp and Model Driven Architecture at UFRGS

By Cláudio F.R. Geyer
on Thursday, October 14, 2010
at 12:00 PM

Cláudio reports on the research work on ubiquitous computing and model drive architectures he and his colleagues are conducting at UFRGS.

Overview of Model-Oriented Research Projects at UFRGS

By Marcelo S. Pimenta
on Wednesday, October 13, 2010
at 12:00 PM

Marcelo reports on the research that is been conducted in his research group regarding model orientation in software engineering.

Report on the TDG

By Rafael Corchuelo
on Monday, October 11, 2010
at 12:00 PM

Rafael presents a report on the TDG.  This is the first talk in a series of three seminars together with our colleagues from UFRGS.

One Class Classifiers and Information Verification

By Iñaki Fernández
on Friday, October 08, 2010
at 12:00 AM

Iñaki reports on his progress regarding using one-class classifiers to verify information.

Report on our Improvements to FOIL

By Patricia Jiménez
on Friday, October 01, 2010
at 12:00 AM

Patricia reports on the work she's conducting regarding improving the classic first-order learner FOIL:

The TGD Information Extraction Framework

By Hassan A. Sleiman
on Friday, September 24, 2010
at 12:00 PM

Hassan reports on his advances regarding information extraction and the framewrok he's devising and building.

On Intelligent Navigation

By Inma Hernández
on Friday, September 17, 2010
at 12:00 PM

Inma reports on her progress regarding intelligent navigation on the Web.

The SOLERES Project

By Luis Iribarne
on Thursday, April 08, 2010
at 05:00 PM

Luis reports on the SOLERES, project, which he co-ordinates at the University of Almería.  This project is a practical case in which software, data, and knowledge engineering are integrated smoothly.

On the Design of a FOIL Implementation

By Patricia Jiménez
on Wednesday, March 31, 2010
at 10:30 AM

Patricia reports on the implementation of FOIL she is devising.

A Web Page Tokenniser

By Francisco M. Pérez
on Wednesday, March 10, 2010
at 12:00 AM

Francisco reports on his preliminary design of a tokenniser.

An Annotation Tool

By Juan Infante
on Friday, February 12, 2010
at 05:00 PM

Juan reports on his preliminary ideas to implement a new semantic annotation tool.

Information Extraction using Data Mining Techniques

By Gretel Fernández
on Wednesday, February 10, 2010
at 05:00 PM

Gretel reports on her first experiments to extract information from web pages using classsical data mining techniques.

New Ideas regarding Information Extraction

By Jorge Moreno
on Friday, February 05, 2010
at 05:00 PM

Jorge, from Atenea Innova, reports on his practical applications of information extraction and his ideas to build a new learner of information extractors.

Reaping Tools (Extended Seminar)

By Rosa M. Burrueco, Pablo Íñigo
on Friday, January 22, 2010
at 05:00 PM

Rosa and Pablo report extensively on the reaping tools they have designed.  The presentation includes details about the user interface, the design of the tools, and several hand-on labs.

Information Integration

By Carlos Rivero
on Friday, January 15, 2010
at 05:00 PM

Carlos reports on several approaches to information integration.

Automatic Navigation

By Inmaculada Hernández
on Friday, December 18, 2009
at 12:00 AM

Inma reports on several approaches to automate web navigation.

Advances in Guaraná DSL

By Rafael Z. Frantz
on Friday, December 11, 2009
at 05:00 PM

Rafael reports on his advences regarding the design and implementation of the Guaraná DSL for Enterprise Application Integration.

A Revisitation of FivaTech

By Ahmed Y. Riveras
on Friday, December 04, 2009
at 05:00 PM

Ahmed revisits the implementation of the FivaTech algorithm on which he has been working.

An Instantiation of Open UP

By Raúl Sánchez
on Friday, November 27, 2009
at 05:00 PM

Raúl reports on an instantiation he has devised of the Open UP methodological framework.

A Revisitation of the SoftMealy Algorithm

By Antonio C. Maraver
on Friday, November 20, 2009
at 05:00 PM

Carlos has almost implemented the SoftMealy algorithm.  In this talk, he revisits it and the implementation on which he has been working.

A Revisitation of Reaping Tools

By Rosa M. Burrueco, Pablo Íñigo
on Friday, November 13, 2009
at 05:00 PM

Rosa and Pablo report on a tool they have devised, designed and implemeted to reap web data islands.  Reapers play an important role to gather the pages required to traing a learner of web extractions rules, for instance.

Distributed Data Integration

By Alberto Pan
on Thursday, October 15, 2009
at 05:00 PM

Alberto provides an overview of the problem and reports on the tools he and his team are developing to address it.

Integrating Multi-Similarity Systems

By Ismael Sanz
on Friday, July 03, 2009
at 05:00 PM

Ismael provides a comprehensive introduction to integrating multi-similarity system, which is very useful to retrieve data from unstructured collections of similar documents.

Introduction to Bio-Portals

By Ismael Sanz
on Friday, June 19, 2009
at 05:00 PM

Ismael provides a good introduction to web portals that provide bio-medical inforamation.  We are exploring together how to integrate them.

Data Mining Techniques

By Gretel Fernández
on Friday, May 29, 2009
at 05:00 PM

Gretel provides a comprehensive introduction to classical data mining techniques, and provides a few hints on our idea to use them to extract information from web pages.

The X-LRT Algorithms

By Patricia Jiménez
on Friday, May 15, 2009
at 05:00 PM

Patricia reports on a famility of algorithms by Kushmerick that can be used to learn information extractors for typical web pages.

Web Page Classification

By Inmaculada Hernández
on Friday, May 08, 2009
at 05:00 PM

Inma reports on several approaches to classify web pages.

The FivaTech Algorithm

By Ahmed Y. Riveras
on Friday, April 24, 2009
at 05:00 PM

Ahmed reports on the FivaTech algorithm, which can analyse a web page automatically and infer the structure of the data it contains and extraction rules.

Introduction to Enterprise Application and Information Integration

By Rafael Corchuelo
on Friday, April 17, 2009
at 05:00 PM

Rafael provides a comprenhensive introduction to the world of Enterprise Application and Information Integration. 

The SoftMealy Algorithm (Part 2)

By Antonio C. Maraver
on Monday, April 06, 2009
at 05:00 PM

Carlos provides additional details on his previous seminar.

The SoftMealy Algorithm (Part 1)

By Antonio C. Maraver
on Friday, April 03, 2009
at 05:00 PM

Carlos reports on SoftMealy, which is an algorithm to learn web information extractors.

On Information Extraction

By José L. Arjona
on Thursday, April 02, 2009
at 05:00 PM

José Luis provides a comprehensive introduction to the world of learning information extractor for web pages.

The FOIL Algorithm

By Patricia Jiménez
on Thursday, April 02, 2009
at 05:00 PM

Patricia reports on the well-know first-order logic rule learner called FOIL, and on our ideas to apply it to learn information extraction rules.

On the Design of a Reaper

By Rosa M. Burrueco
on Friday, March 27, 2009
at 05:00 PM

Rosa reports on her preliminary steps towards designing and implementing a reaper, which is a tool that allows to issue a number of queries to a number of web search forms and download the results.

Automatic Form Filling

By Carlos Rivero
on Friday, March 20, 2009
at 05:00 PM

Carlos reports on a technique that allows to map a SQL-like query onto a search form provided by a web application.

Information Verifiers

By Iñaki Fernández
on Friday, March 13, 2009
at 05:00 PM

Iñaki reports on how to build an information verifiers, and on a few preliminary ideas we are exploring to improve on the existing techniques.

Data and Knowledge Integration

By Rafael Berlanga
on Thursday, March 12, 2009
at 05:00 PM

Rafael reports on the work he and his team is conducting regarding Enterprise Information Integration in the context of The Semantic Web.

Introduction to Information Extractors

By Hassan A. Sleiman
on Friday, February 27, 2009
at 05:00 PM

Hassan introduces us to the world of information extractors.  He reports on the foundations and provides a few details about a number of learners.

Introduction to Classifiers

By Inmaculada Hernández
on Friday, February 20, 2009
at 05:00 PM

Inma reports on machine learning techniques to build classifiers.

Introduction to TDG Scholar

By Nicolás Amador, Agustín Domínquez
on Friday, February 13, 2009
at 05:00 PM

Nicolás and Agustín report on TDG Scholar 1.1, which is our bibliography search service, and glimpse at future features of TDG Scholar 2.0.

On the Design of TDG Scholar 2.0

By Nicolás Amador, Agustín Domínquez
on Friday, February 13, 2009
at 05:00 PM

Nicolás and Agustín report on our preliminary design of TDG Scholar 2.0.

The STAVIES Algorithm

By Rafael Maiquez
on Friday, February 06, 2009
at 05:00 PM

Rafael reports on the STAVIES algorithm, which can be used to detect areas of interest within a web page in a totally unsupervised manner.

Web Page Classification

By Rosario Arjona
on Friday, January 16, 2009
at 05:00 PM

Rosario reports on an algorithm that helps classify web pages; it builds on the structure of a number of training pages, and can determine if a new unseen page can be considered in the same category or not.

Introduction to ITIL

By Moisés Robles
on Wednesday, December 10, 2008
at 05:00 PM

Moisés shall report on ITIL, which is a library of good practices to manage information technology services.  Our interest is on integration services.

An Experience Regarding Project Management

By José M. Portero
on Friday, November 21, 2008
at 05:00 PM

José Manuel shall talk to us about the methodology on which he and his team have been working for the last months.  This methodology is intended to help software engineers manage their integration projects more effectively.

Optimising FOIL

By Pablo Palacios
on Wednesday, May 28, 2008
at 05:00 PM

Pablo shall present his ideas to optimise FOIL so that it can be used efficiently to learn information extraction rules.

Future Trends regarding Wrappers

By Vicente Luque
on Sunday, May 18, 2008
at 05:00 PM

Vicente shall explore current trends regarding wrappers, and shall make a point of envisioning future trends in this area.

Wrappers for the Deep Web

By Vicente Luque
on Friday, May 16, 2008
at 05:00 PM

Vicente shall report on creating wrappers for deep web applications.  His talk shall include a hands-on lab about a tool for creating wrappers.

Applications of the Semantic Web

By Óscar Corcho
on Thursday, May 15, 2008
at 05:00 PM

Óscar shall report on the work his research team is conducting regarding tools for the Semantic Web, and on the applications they are developing.

On the Development of An Enquirer

By Carlos R. Rivero
on Friday, May 09, 2008
at 05:00 PM

Carlos shall report on an algorithm that can map SQL queries onto search forms, and also on his implementation.

An Annotation Tool

By Nicolás Amador and Agustín Domínguez
on Tuesday, May 06, 2008
at 05:00 PM

Nicolás and Agustín shall report on the tools to annotate web pages on which they have been working.

Introduction to DSL Tools

By Abdul W. Sultán
on Friday, April 25, 2008
at 05:00 PM

Abdul shall introduce us to Microsoft's DSL Tools, which we are planning on using to support our integration tools. 

Introduction to WFF

By Hassan A. Sleiman
on Friday, April 18, 2008
at 12:00 AM

Hassan shall introduce us to the Windows Workflow Foundation, which a tool we are planning on using to support our integration tools.

The FivaTech Algorithm

By Antonio R. Gómez
on Friday, March 28, 2008
at 12:00 AM

Antonio shall introduce us to an unsupervised algorithm called FivaTech, which is able to learn how to extract data records from web pages in a manner that is totally automatic.

Co-referencing Objects

By Antonio C. Maraver
on Friday, March 14, 2008
at 05:00 PM

Antonio shall report on an algorithm that allows to find which data records can be considered the same, despite they are lexically and/or syntactically different.

The RoadRunner Algorithm

By Santiago Lozano y María Arías de Reina
on Friday, March 07, 2008
at 05:00 PM

Santi y María shall report on an algorithm that can be used to compare pages and extract their common patterns; it is referred to as RoadRunner.

The DataProg Algorithm

By Samuel Pérez
on Friday, February 22, 2008
at 05:00 PM

Samuel shall report on an algorithm that can be used to extract lexical patterns from data.  This can be used in information verification tasks.

The Stavies Algorithm

By Jesús Cozar
on Friday, February 15, 2008
at 05:00 PM

Jesús shall report on the Stavies algorithm, which can be used to identify data records and attributes in semi-structured web pages.

On the Design of Integration Solutions

By Rafael Z. Frantz
on Friday, February 08, 2008
at 12:00 AM

Rafael shall report on the language to design application integration solutions that he is devising.

The SoftMealy Algorithm

By Raúl Sánchez
on Friday, February 01, 2008
at 05:00 PM

Raúl shall report on an algorithm called SoftMealy, which can be used to learn information extraction rules.

The Daikon Algorithm

By Edgar Miranda
on Friday, January 18, 2008
at 12:00 AM

Edgar shall report on an algorithm called Daikon that can be used to induce invariants from a set of data.  This can be useful for information verification.

Introduction to Enterprise Application Integration (II)

By Rafael Z. Frantz
on Saturday, January 12, 2008
at 05:00 PM

Rafael shall report on enterprise application integration and a few preliminary ideas regarding his PhD thesis.  This is a two-part seminar

Introduction to Enterprise Application Integration (I)

By Rafael Z. Frantz
on Friday, January 11, 2008
at 05:00 PM

Rafael shall report on enterprise application integration and a few preliminary ideas regarding his PhD thesis.  This is a two-part seminar.

The WHISK Algorithm

By Javier Márquez
on Wednesday, December 19, 2007
at 05:00 PM

Javier shall report on an algorithm to learn rules to extract information from web pages that is called WHISK.

The MDR Algorithm

By Christian Marmolejo
on Friday, December 14, 2007
at 05:00 PM

Christian shall report on an algorithm called MDR, which can be used to identify data records in a web page.

The Maveric Framework

By Diego Campoy
on Thursday, November 29, 2007
at 05:00 PM

Diego shall present a framework for information verification called Maveric

The Stalker Algorithm

By Juan M. Piñero
on Friday, November 23, 2007
at 05:00 PM

Juanma shall report on an algorithm called Stalker, which can be used to learn rules to extract information from semi-structured web pages.

The IntegraWeb Reference Architecture

By José L. Arjona
on Friday, June 15, 2007
at 05:00 PM

José Luis shall report on the reference architecture on which we are working to address the integration of non-dismantelable web applications.

Integrating Non-Dismatelable Web Applications

By José L. Arjona
on Thursday, June 14, 2007
at 05:00 PM

José Luis shall report on the problems related to integrating web applications that do not provide a programmatic interface, but a web user interface only.

Hands-on Lab about Semantic Web Technologies

By José L. Arjona
on Wednesday, June 13, 2007
at 05:00 PM

José Luis shall helps us get started with a few tools from the Semantic Web.

Semantic Web Technologies for Integration

By José L. Arjona
on Tuesday, June 12, 2007
at 05:00 PM

José Luis shall report on the Semantic Web technologies that can be used to solve integration problems.

Introduction to Enterprise Application Integration

By José L. Arjona
on Monday, June 11, 2007
at 05:00 PM

José Luis shall introduce the field of enterprise application integration and shall motivate the need for research regarding engineering methods to tackle integration problem effectively.

Using FOIL to Extract Information from Web Pages

By Pablo Palacios
on Thursday, June 07, 2007
at 05:00 PM

Pablo shall report on the well-known first-order learner FOIL; then he shall report on our many ideas to improve it so that it can be used to learn rules that can help extract information from web pages.

Using Wrappers to Extract Information from Web Sites

By Rafael Corchuelo
on Friday, May 11, 2007
at 05:00 PM

Rafael shall introduce wrappers as an effective mechanism to extract structured information from web sites.  He shall also report on a few existing techniques.

Building Bayesian Networks by Means of Genetic Programming

By Francisco Roche
on Friday, April 20, 2007
at 05:00 PM

Paco shall present his ideas on using genetic programming to build complex Bayesian Networks.

Reasoners for the Semantic Web

By Gonzalo Aranda
on Friday, March 23, 2007
at 05:00 PM

Gonzalo shall complement Rafael's talk on the Semantic Web with an insight into the state-of-the-art reasoners. 

Advanced Separation of Concerns

By Antonia M. Reina
on Friday, March 16, 2007
at 07:38 PM

Toñi shall introduce us a well-known topic in the literature about software engineering: separation of concerns.  She shall also report on the current status of her PhD dissertation, which is about separation of concerns in the design of web application navigation.

Building Classifiers by means of Genetic Programming

By Francisco J. Bejarano
on Friday, February 23, 2007
at 05:00 PM

Paco shall talk about genetic programming and how it can used to learn a classifier for complex data.

Supervised Learning

By Pablo Palacios
on Friday, February 09, 2007
at 05:00 PM

This is a two-part seminar in which Pablo shall introduce us to machine learning.  In this part, Pablo shall talk about supervised learning and, amongst the many existing method, he shall report on support vector machines.

Unsupervised Learning

By Pablo Palacios
on Friday, February 02, 2007
at 05:00 PM

This is a two-part seminar in which Pablo shall introduce us to machine learning. In this part, Pablo shall talk about unsupervised learning and clustering, which is one of the most prominent machine-learning methods in this category.

The Semantic Web

By Rafael Corchuelo
on Friday, January 19, 2007
at 05:00 PM

In this talk, Rafael shall introduce the main ideas behind the Semantic Web, whith an emphasis on making it clear what it is and what it is not.