Microsoft’s deleted Harry Potter AI blog highlights the messy ethics of training large language models on pirated content.
The blog recommended that users learn to train their own AI models by downloading the Harry Potter dataset and then uploading text files to Azure Blob Storage. It included example models based on a ...