“...we pre-trained the AgroNT on approximately 10.5 million genomic sequences comprising trillions of base pairs. The data set of 48 species included genomes of row crops, fruits, legumes, vegetables and species important for industry and research, so that it could learn the language of plant DNA.”
Gurnek Singh, Head of Business Development for Applications and Life Sciences, Bio AI at InstaDeep