Dataset: ProteOMZ identified protein sequences