Back to projects

ranking-str-data

Local Java CLI for ranking Y-STR haplotypes against a selected base haplotype. It computes TMRCA-related research metrics and appends ranked results to a semicolon-separated CSV file.

Key Features

  • Ranks Y-STR haplotypes against a uniquely selected base haplotype
  • Computes ASD, TMRCA, lambda, and Klyosov back-mutation correction metrics
  • Reads semicolon-separated CSV files through Spark and writes RankedData.csv next to the input

Technologies

JavaMavenApache SparkJCommanderLog4jJUnit
Local Java CLI for ranking Y-STR haplotypes against a selected base haplotype and appending TMRCA-related research metrics to a semicolon-separated CSV file. The project was built for a genetics research workflow. It reads Y-STR haplotype rows, compares each row with a uniquely selected base haplotype, computes research metrics, and writes a ranked CSV next to the input file. The code implements a linear Y-STR TMRCA calculation based on the Klyosov method with back-mutation correction. This is a local research tool, not a clinical or diagnostic system.