Text this: A methodology for optimizing the cost matrix in cost sensitive learning models applied to prediction of molecular functions in embryophyta plants