Proper data sharing essential for language models
SONG CHEN/CHINA DAILY The potential for artificial intelligence to improve lives has captured the attention of governments across the world. There are a range of challenges involved in doing this including sharing sensitive or proprietary data sets, ensuring the outcomes truly benefit human beings, and designing policies that can make all of this possible. Sharing data sets for training AI large language models is a particularly powerful and yet sensitive issue. AI analysis of those data sets could bring benefits in a fraction of the time otherwise required. If governments remain focused on using AI to address human-centric goals, the significant benefits of shared data sets could not only set us up for technological innovation, but also sufficiently bind us together in ways that make continued international cooperation the bedrock of that success.