Mathematical expression similarity calculation plays an important role in information retrieval, but the existing calculation methods seldom consider the impact of mathematical expression focus on similarity calculation accuracy. To solve this problem, a method for calculating the similarity of mathematical expressions based on focus clustering is proposed. Aiming at the strong subjectivity of the focus, expression element mapping rules are defined, and the [K]-means++ algorithm is used to cluster mathematical expressions based on operators, thereby summarizing the focus clusters of mathematical expressions. Based on the focus cluster, genetic algorithm is used to optimize and adjust the related parameters in the similarity calculation method to strengthen the influence of focus on the similarity results. The comparative experiments show that the similarity calculation performance of this method is improved, and the expression result list obtained is more ideal.

