On the combinatorics of the 2-class classification problem

Articles

On the combinatorics of the 2-class classification problem

Année

2019

Auteurs

DELLE DONNE Diego, CORRÊA Ricardo, MARENCO Javier

Abstract

A set of points is linearly separable if the convex hulls of and are disjoint, hence there exists a hyperplane separating from . Such a hyperplane provides a method for classifying new points, according to which side of the hyperplane the new points lie. When such a linear separation is not possible, it may still be possible to partition and into prespecified numbers of groups, in such a way that every group from is linearly separable from every group from . We may also discard some points as outliers, and seek to minimize the number of outliers necessary to find such a partition. Based on these ideas, Bertsimas and Shioda proposed the classification and regression by integer optimization (CRIO) method in 2007. In this work we explore the integer programming aspects of the classification part of CRIO, in particular theoretical properties of the associated formulation. We are able to find facet-inducing inequalities coming from the stable set polytope, hence showing that this classification problem has exploitable combinatorial properties.

CORRÊA, R., DELLE DONNE, D. et MARENCO, J. (2019). On the combinatorics of the 2-class classification problem. Discrete Optimization, 31(1), pp. 40-55.

Mots clés

Classification -Integer programming, Polyhedral combinatorics