Skip to content

APR-100: CODEX integration support for verificar #78

@noahgift

Description

@noahgift

Summary

Ensure aprender APIs support the CODEX pipeline in verificar for Python-to-Rust training data generation.

Required APIs

  • RandomForestClassifier with predict_proba
  • GradientBoostingClassifier with predict_proba
  • KMeans clustering with centroids() access
  • PCA for dimensionality reduction
  • Expose silhouette_score in metrics (if not already)
  • TF-IDF vectorizer in text module (for code embeddings)

Integration Points

  • verificar VER-050: Quality gate uses RandomForest
  • verificar VER-051: Bug predictor uses GradientBoosting
  • verificar VER-052: Active learning uses KMeans + PCA
  • verificar VER-054: Data quality scoring uses silhouette

Acceptance Criteria

  • All required APIs available and documented
  • Example usage in verificar integration tests
  • No breaking changes to existing APIs

Ref

verificar: docs/specifications/codex-multi-tech-python-to-rust-spec.md

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions