You have to have so called "ground truth" - manually checked correspondences or transformation matrix (fundamental or homography) between two images. Correspondences which are consistent with this matrix are correct.
Check approach used in classical papers by Mykolajczyk et al. "A comparison of affine region detectors", "A PERFORMANCE EVALUATION OF LOCAL DESCRIPTORS" and Moreels and Perona "Evaluation of Features Detectors and Descriptors based on 3D Objects"