interview-eval

LLM-as-an-Interviewer 🎤📄

This is the official GitHub repository for LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation.

LLM-as-an-Interviewer is an evaluation framework that assesses the capabilities of LLMs through an interview-style process. In this approach, the LLM acting as the interviewer evaluates other LLMs by providing feedback and asking follow-up questions, enabling a more comprehensive assessment of their capabilities.

Our framework includes a flexible pipeline that can be easily adapted to various tasks by incorporating a customized evaluation rubric.

Code for Paper Replication

Access the code used to replicate the results discussed in our paper. github

Code for the Framework

Explore the framework implementation. github

PyPI

The framework is also available as a Python package on PyPI. Install it using:

pip install interview-eval

Visit the PyPI page

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

interview-eval

LLM-as-an-Interviewer 🎤📄

Code for Paper Replication

Code for the Framework

PyPI

Popular repositories Loading

Repositories

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!