pyOpenSci, Looking for better data splits for #machinelearning? Look no further than astartes, a #pyOpenSci package from Jackson Burns, Kevin Spiekermann, and himaghna!
astartes is an #openscience, #opensource #Python package that implements many similarity- and distance-based algorithms to partition data into more challenging splits. Separate from astartes, you can use these splits to better assess out-of-sample performance with any ML model of choice.