Implement baseline model + evaluation split

taskdone