Skip to content
View ree2raz's full-sized avatar
🤖
clankers taking over...
🤖
clankers taking over...

Block or report ree2raz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ree2raz/README.md

profile_banner

I ship LLM features into regulated workflows and prove they work the way auditors do.

Verifiable output. Deterministic grading. Traceable failure modes.

Currently shipping: 30-day sprint. Four artifacts across the verifiability stack. New repo and post weekly.


Featured

  • audited-tool-mcp — A compliance-aware MCP server. Seven-step pipeline: RBAC, PII detection, policy enforcement, audit logging. 15-case eval, 100% pass.
  • RegTriage-OpenEnv — An OpenEnv RL environment where the reward signal is auditor approval. 12 tasks, severity-weighted F1, auto-fail caps.
  • rubric-grader-eval — A reference pattern for compiling unstructured rubrics into evaluable schemas. Handles clean CSV, boolean composites, document masquerades.

Production work

Previously at Yactraq Online Inc.: shipped three LLM features to enterprise production — automated quality scoring, real-time compliance assistant, conversational analytics engine. Self-hosted on customer infrastructure. Regulated-industry deployment.

Writing

rituraj.info — notes on production ML, compliance systems, and agent architectures.

Contact

Open to short engagements: verifiability audits, eval-harness builds, fractional engineering for agentic systems in regulated domains.

Pinned Loading

  1. audited-tool-mcp audited-tool-mcp Public

    Python 1

  2. RegTriage-OpenEnv RegTriage-OpenEnv Public

    RegTriage is an OpenEnv RL environment that trains agents to perform regulatory compliance auditing on financial services contact center transcripts

    Python 1

  3. rubric-grader-eval rubric-grader-eval Public

    Python 1