Influence Function Data evaluation implementation.
TODO it may be useful to compute gradients of the validation dataset in batches
to save time/space.
TODO H^{-1} implementation, Current implementation is for first-order gradients
Number of models to fit to take to find data values, by default 1000
proportionfloat, optional
Proportion of data points to be in each sample, cardinality of each subset is
\((p)(num_points)\), by default 0.7 as specified by V. Feldman and C. Zhang
Compute data values using the Influence Subsample data valuator. Finds
the difference of average performance of all sets including data point minus
not-including.
Trains the Influence Subsample Data Valuator by sampling from subsets of
\((p)(num_points)\) cardinality and computing the performance with the
\(i\) data point and without the \(i\) data point. The form of sampling
is similar to the shapely value when \(p\) is :math:`0.5: (V. Feldman).
Likewise, if we sample not from the subsets of a specific cardinality but the
uniform across all subsets, it is similar to the Banzhaf value.
NOTE InfluenceSubsample was previously named InfluenceFunctionEval and may be
referred to as such in the demos. This docstring is here to clarify the confusion
between the naming of InfluenceFunction and InfluenceSubsample.