Fitting algebraic curves to noisy data

作者:

Highlights:

摘要

We introduce the following problem which is motivated by applications in vision and pattern detection: We are given pairs of datapoints (x1,y1),(x2,y2),…,(xm,ym)∈[−1,1]×[−1,1], a noise parameter δ>0, a degree bound d, and a threshold ρ>0. We desire an algorithm that enlists every degree d polynomial h such that (1)|h(xi)−yi|⩽δforatleastρfractionoftheindicesi.If δ=0, this is just the list decoding problem that has been popular in complexity theory and for which Sudan gave a poly(m,d) time algorithm. However, for δ>0, the problem as stated becomes ill-posed and one needs a careful reformulation (see the Introduction). We prove a few basic results about this (reformulated) problem. We show that the problem has no polynomial-time algorithm (our counterexample works for ρ=0.5). This is shown by exhibiting an instance of the problem where the number of solutions is as large as exp(d0.5−ε) and every pair of solutions is far from each other in ℓ∞ norm. On the algorithmic side, we give a rigorous analysis of a brute force algorithm that runs in exponential time. Also, in surprising contrast to our lowerbound, we give a polynomial-time algorithm for learning the polynomials assuming the data is generated using a mixture model in which the mixing weights are “nondegenerate.”

论文关键词:Curve Fitting,Noisy polynomial reconstruction,List decoding,Learning theory,Vision

论文评审过程:Received 10 July 2002, Revised 10 December 2002, Available online 19 July 2003.

论文官网地址:https://doi.org/10.1016/S0022-0000(03)00012-6