evaluation methods and NeuMF in NeuRec3.X

I find the evaluation always run for every user. do you have some example for evaluation on sampled item?

I know that this metrics have some bias, but in some paper still use this.
![image](https://user-images.githubusercontent.com/22851795/106594953-cb44a600-658d-11eb-9311-425b335b23b7.png)

and why there is no NeuMF implements in NeuRec3.X? is too slow to evaluation on all negative items?