PerClassTransformer#

class skfda.preprocessing.feature_construction.PerClassTransformer(transformer, *, array_output=False)[source]#

Per class feature transformer for functional data.

This class takes a transformer and performs the following map:

\[\begin{split}\mathcal{X} &\rightarrow \mathbb{R}^G \\ x &\rightarrow \textbf{t} = (T_1(x), T_2(x),...,T_k(x))\end{split}\]

Where \(T_i(x)\) is the transformation \(x\) with respect to the data in the \(i\)-th group.

Note that \(\mathcal{X}\) is possibly multivariate, that is, \(\mathcal{X} = \mathcal{X}_1 \times ... \times \mathcal{X}_p\).

Parameters:

transformer (TransformerMixin[Input, TransformerOutput, object]) – The transformer that we want to apply to the given data. It should use target data while fitting. This is checked by looking at the ‘stateless’ and ‘requires_y’ tags
array_output (bool) – Indicates if the transformed data is requested to be a NumPy array output. By default the value is False.

Examples

Firstly, we will import the Berkeley Growth Study dataset:

>>> from skfda.datasets import fetch_growth
>>> X, y = fetch_growth(return_X_y=True, as_frame=True)
>>> X = X.iloc[:, 0].values
>>> y = y.values.codes

>>> from skfda.preprocessing.feature_construction import (
...     PerClassTransformer,
... )

Then we will need to select a fda transformer, and so we will use RecursiveMaximaHunting. We need to fit the data and transform it:

>>> from skfda.preprocessing.dim_reduction.variable_selection import (
...     RecursiveMaximaHunting,
... )
>>> t1 = PerClassTransformer(
...     RecursiveMaximaHunting(),
...     array_output=True,
... )
>>> x_transformed1 = t1.fit_transform(X, y)

x_transformed1 will be a vector with the transformed data. We will split the generated data and fit a KNN classifier.

>>> from sklearn.model_selection import train_test_split
>>> from sklearn.neighbors import KNeighborsClassifier
>>> X_train1, X_test1, y_train1, y_test1 = train_test_split(
...     x_transformed1,
...     y,
...     test_size=0.25,
...     stratify=y,
...     random_state=0,
... )
>>> neigh1 = KNeighborsClassifier()
>>> neigh1 = neigh1.fit(X_train1, y_train1)

Finally we can predict and check the score:

>>> neigh1.predict(X_test1)
array([ 0,  0,  1,  0,  1,  1,  1,  0,  0,  0,  0,  1,  1,  0,  0,  0,
        0, 1,  1,  1,  1,  1,  1,  1], dtype=int8)

>>> round(neigh1.score(X_test1, y_test1), 3)
0.958

We can also use a transformer that returns a FData object when predicting. In our example we are going to use the Fisher Rao Elastic Registration.

>>> from skfda.preprocessing.registration import (
...     FisherRaoElasticRegistration,
... )
>>> t2 = PerClassTransformer(
...     FisherRaoElasticRegistration(),
... )
>>> x_transformed2 = t2.fit_transform(X, y)

x_transformed2 will be a DataFrame with the transformed data. Each column of the frame contains a FDataGrid describing a transformed curve. Now we are able to use it to fit a KNN classifier. Again we split the data into train and test.

>>> X_train2, X_test2, y_train2, y_test2 = train_test_split(
...     x_transformed2.iloc[:, 0].values,
...     y,
...     test_size=0.25,
...     stratify=y,
...     random_state=0,
... )

This time we need a functional data classifier. We fit the classifier and predict.

>>> from skfda.ml.classification import KNeighborsClassifier
>>> neigh2 = KNeighborsClassifier()
>>> neigh2 = neigh2.fit(X_train2, y_train2)
>>> neigh2.predict(X_test2)
array([ 1,  1,  1,  1,  1,  1,  1,  0,  0,  0,  0,  1,  1,  0,  0,  0,
        0,  1,  1,  1,  1,  0,  1,  1], dtype=int8)

>>> round(neigh2.score(X_test2, y_test2), 3)
0.875

Methods

`fit`(X, y)	Fit the model on each class.
`fit_transform`(X, y)	Fits and transforms the provided data.
`get_metadata_routing`()	Get metadata routing of this object.
`get_params`([deep])	Get parameters for this estimator.
`set_output`(*[, transform])	Set output container.
`set_params`(**params)	Set the parameters of this estimator.
`transform`(X[, y])	Transform the provided data using the already fitted transformer.

fit(X, y)[source]#

Fit the model on each class.

It uses X as training data and y as target values.

Parameters:

X (Input) – FDataGrid with the training data.
y (ndarray[tuple[Any, ...], dtype[integer[Any]]]) – Target values of shape = (n_samples).

Returns:

self

Return type:

PerClassTransformer[Input, Output]

fit_transform(X, y)[source]#

Fits and transforms the provided data.

It uses the transformer specified when initializing the class.

Parameters:

X (Input) – FDataGrid with the samples.
y (ndarray[tuple[Any, ...], dtype[integer[Any]]]) – Target values of shape = (n_samples)

Returns:

Eiter array of shape (n_samples, G) or a Data Frame including the transformed data.

Return type:

Output

get_metadata_routing()#

Get metadata routing of this object.

Please check User Guide on how the routing mechanism works.

Returns:: routing – A MetadataRequest encapsulating routing information.
Return type:: MetadataRequest

get_params(deep=True)#

Get parameters for this estimator.

Parameters:: deep (bool, default=True) – If True, will return the parameters for this estimator and contained subobjects that are estimators.
Returns:: params – Parameter names mapped to their values.
Return type:: dict

set_output(*, transform=None)#

Set output container.

See Introducing the set_output API for an example on how to use the API.

Parameters:

transform ({"default", "pandas", "polars"}, default=None) –

Configure output of transform and fit_transform.

”default”: Default output format of a transformer
”pandas”: DataFrame output
”polars”: Polars output
None: Transform configuration is unchanged

Added in version 1.4: “polars” option was added.

Returns:

self – Estimator instance.

Return type:

estimator instance

set_params(**params)#

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as Pipeline). The latter have parameters of the form <component>__<parameter> so that it’s possible to update each component of a nested object.

Parameters:: **params (dict) – Estimator parameters.
Returns:: self – Estimator instance.
Return type:: estimator instance

transform(X, y=None)[source]#

Transform the provided data using the already fitted transformer.

Parameters:

X (Input) – FDataGrid with the test samples.
y (object)

Returns:

Eiter array of shape (n_samples, G) or a Data Frame including the transformed data.

Return type:

Output

Examples using `skfda.preprocessing.feature_construction.PerClassTransformer`#

Depth based classification

PerClassTransformer#

Examples using skfda.preprocessing.feature_construction.PerClassTransformer#

This Page

Examples using `skfda.preprocessing.feature_construction.PerClassTransformer`#