Integrating large-scale genomics data to improve variant interpretation in coding and non-coding regions