Skip to content
@wbsg-uni-mannheim

Web-based Systems Group @ University of Mannheim

We explore technical and empirical questions concerning the development of global, decentralized information environments.

Pinned Loading

  1. WDCFramework WDCFramework Public

    Java Framework which is used by the Web Data Commons project to extract Microdata, Microformats and RDFa data, Web graphs, and HTML tables from the web crawls provided by the Common Crawl Foundation.

    Java 9 1

  2. TabAnnGPT TabAnnGPT Public

    This repository contains code and data for reproducing the experiments of three papers that focus on two subtasks of table annotation: column type annotation (CTA) the task of annotating table colu…

    Python 11 2

  3. MatchGPT MatchGPT Public

    This repository contains code and extensive prompt examples to reproduce and extend the experiments in our papers "Using ChatGPT for Entity Matching" and "Entity Matching using Large Language Models".

    Jupyter Notebook 59 12

  4. ExtractGPT ExtractGPT Public

    Attribute Value Extraction using Large Language Models

    Python 26 11

  5. wdcproducts wdcproducts Public

    This repository contains the code and data download links to reproduce building the WDC Products Benchmark.

    Python 13 4

  6. WebMall WebMall Public

    This repository contains the code and data of the WebMall benchmark for evaluating the capability of Web agents to find and compare product offers from multiple e-shops.

    HTML 3 2

Repositories

Showing 10 of 32 repositories

Top languages

Loading…

Most used topics

Loading…