I have a large data set of integers (about several Million), all of them are positive integers from experiment. They are stored on a file on local machine. I want to select the largest 1000 values then make analysis based on the 1000 values.
Could anyone suggest some efficient solutions? I think sorting the several M data in memory is not feasible.
thanks in advance,