This folder contains the training and external test datasets. It also has the newly identified articles, extracted drug/protein entities and predicted assay formats Using manual curation, we also validated 100 DTIs articles (from 0.316 M articles at PubTator that are predicted as DrugTarget articles and contain both drug and protein entities). We confirmed that all the articles contain relationship words (such as inhibition or binding) in the abstracts of the articles