Skip to content
Change the repository type filter

All

    Repositories list

    • verdict

      Public
      Inference-time scaling for LLMs-as-a-judge.
      Jupyter Notebook
      2431631Updated Nov 5, 2025Nov 5, 2025
    • annotate

      Public
      Skill to annotate and create ai judges from agent logs
      TypeScript
      11501Updated Oct 28, 2025Oct 28, 2025
    • j1-micro

      Public
      j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
      Python
      69900Updated Jul 19, 2025Jul 19, 2025
    • spoken

      Public
      a single interface around speech-to-speech foundation models
      Python
      22700Updated Jun 27, 2025Jun 27, 2025
    • nyc is so back
      11900Updated Jun 27, 2025Jun 27, 2025
    • ⚖️ Awesome LLM Judges ⚖️
      614600Updated Apr 28, 2025Apr 28, 2025
    • get-haized

      Public
      A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.
      1210000Updated Apr 13, 2025Apr 13, 2025
    • dspy-redteam

      Public
      Red-Teaming Language Models with DSPy
      Python
      2424720Updated Feb 13, 2025Feb 13, 2025
    • sphynx

      Public
      Sphynx Hallucination Induction
      Python
      25300Updated Jan 31, 2025Jan 31, 2025
    • A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
      Python
      6356710Updated Jan 26, 2025Jan 26, 2025
    • Jupyter Notebook
      62610Updated Oct 22, 2024Oct 22, 2024
    • MongoDB + Haize = Safe & Secure RAG with RBAC
      Jupyter Notebook
      51300Updated Oct 12, 2024Oct 12, 2024
    • Python
      64900Updated Aug 3, 2024Aug 3, 2024
    • Thorn in a HaizeStack test for evaluating long-context adversarial robustness.
      Python
      12600Updated Aug 3, 2024Aug 3, 2024
    • Python
      41600Updated May 30, 2024May 30, 2024
    • An ensembled perplexity API.
      Python
      0200Updated May 2, 2024May 2, 2024