Illegally scraped, high-quality datasets from premium publishers could be compiled and sold to smaller or unethical AI developers who want training data without paying for licenses.