job-match-ml

How to install dependencies

To install them, run:

pip install -r src/requirements.txt

How to run full pipeline

You can run all project with:

kedro run

How to run only part of pipeline

Pipeline consist of two parts:

Data processing pipeline
Data science pipeline

Data processing pipeline for extracting and transforming raw data from HH resume dataset and open API with vacancies

Data science pipeline for evaluation different methods of sentence similarity finding

Data processing pipeline

To run data processing pipeline run:

kedro run --pipeline data_engineering

Data science pipeline

Download validation set from google drive
Place it in data/03_primary directory
Run data science pipeline with:

kedro run --pipeline data_science

Results

Evaluation results: e5 https://app.clear.ml/projects/8e7a87fb96ed45a3951f29c5ed13cd65/experiments/00a1ced96ca24a358534debe15c36a7f/output/execution

mp5 https://app.clear.ml/projects/8e7a87fb96ed45a3951f29c5ed13cd65/experiments/3591761e3a3e461e8370ee89588ed4eb/output/execution

navec: https://app.clear.ml/projects/8e7a87fb96ed45a3951f29c5ed13cd65/experiments/7b8abea984984056853cb70ed4fa677a/output/execution

Main metric was Roc-AUC, so based on them best model was intfloat/multilingual-e5-large

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
conf		conf
data		data
docs/source		docs/source
notebooks		notebooks
src		src
.flake8		.flake8
.gitignore		.gitignore
.telemetry		.telemetry
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

job-match-ml

How to install dependencies

How to run full pipeline

How to run only part of pipeline

Data processing pipeline

Data science pipeline

Results

About

Uh oh!

Releases

Packages

Uh oh!

Languages

AstuteVisionDL/resume-matching-ml

Folders and files

Latest commit

History

Repository files navigation

job-match-ml

How to install dependencies

How to run full pipeline

How to run only part of pipeline

Data processing pipeline

Data science pipeline

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages