Open
Description
Hi there team code2vec,
I am working on a personal project. My aim is to store a Java codebase in a vector database to run similarity searches and retrieve code files from the db relevant to my query. Queries can be of the type:
- Method creating database pool connection.
- Entity class linked to 'Subjects' table
Basically a query will be an activity performed by the codebase and I should return the package, classname, (and method if required).
My plan is to vectorize these search queries using a vectorizer present in your codebase, perform similarity search and return results.
My questions are:
- How can I generate vectors for Java code using a your pretrained model?
- Will it be a good idea to vectorize an English query for similarity search?
Metadata
Metadata
Assignees
Labels
No labels