Create Basis Set for Functional Data Analysis
Functional data analysis proceeds by selecting a finite basis set and
fitting data to it. The current fda
package supports fitting
via least squares penalized with lambda times the integral over the
(finite) support of the basis set of the squared deviations from a
linear differential operator.
The most commonly used basis in fda
is probably B-splines. For
periodic phenomena, Fourier bases are quite useful. A constant basis
is provided to facilitation arithmetic with functional data objects.
To restrict attention to solutions of certain differential equations,
it may be useful to use a corresponding basis set such as exponential,
monomial or power basis sets.
Power bases support the use of negative and fractional powers, while monomial bases are restricted only to nonnegative integer exponents.
The polygonal basis is essentially a B-spline of order 2, degree 1.
The following summarizes arguments used by some or all of the current
create.basis
functions:
rangeval a vector of length 2 giving the lower and upper limits of the range of permissible values for the function argument.
For bspline
bases, this can be inferred from
range(breaks). For polygonal
bases, this can be inferred
from range(argvals). In all other cases, this defaults to 0:1.
nbasis an integer giving the number of basis functions.
This is not used for two of the create.basis
functions:
For constant
this is 1, so there is no need to specify it.
For polygonal
bases, it is length(argvals), and again there
is no need to specify it.
For bspline
bases, if nbasis
is not specified, it
defaults to (length(breaks) + norder - 2) if breaks
is
provided. Otherwise, nbasis
defaults to 20 for
bspline
bases.
For exponential
bases, if nbasis
is not specified,
it defaults to length(ratevec) if ratevec
is provided.
Otherwise, in fda_2.0.2
, ratevec
defaults to 1,
which makes nbasis
= 1; in fda_2.0.4
,
ratevec
will default to 0:1, so nbasis
will then
default to 2.
For monomial
and power
bases, if nbasis
is
not specified, it defaults to length(exponents) if
exponents
is provided. Otherwise, nbasis
defaults
to 2 for monomial
and power
bases. (Temporary
exception: In fda_2.0.2
, the default nbasis
for
power
bases is 1. This will be increased to 2 in
fda_2.0.4
.)
In addition to rangeval
and nbasis
, all but
constant
bases have one or two parameters unique to that
basis type or shared with one other:
bspline
Argument norder
= the order of the spline, which is one
more than the degree of the polynomials used. This defaults to
4, which gives cubic splines.
Argument breaks
= the locations of the break or join
points; also called knots
. This defaults to
seq(rangeval[1], rangeval[2], nbasis-norder+2).
polygonal
Argument argvals
= the locations of the break or join
points; also called knots
. This defaults to
seq(rangeval[1], rangeval[2], nbasis).
fourier
Argument period
defaults to diff(rangeval).
exponential
Argument ratevec
. In fda_2.0.2
, this defaulted to
1. In fda_2.0.3
, it will default to 0:1.
monomial, power
Argument exponents
. Default = 0:(nbasis-1). For
monomial
bases, exponents
must be distinct
nonnegative integers. For power
bases, they must be
distinct real numbers.
Beginning with fda_2.1.0
, the last 6 arguments for all the
create.basis
functions will be as follows; some but not all
are available in the previous versions of fda
:
dropind a vector of integers specifiying the basis functions to be dropped, if any.
quadvals
a matrix with two columns and a number of rows equal to the
number of quadrature points for numerical evaluation of the
penalty integral. The first column of quadvals
contains
the quadrature points, and the second column the quadrature
weights. A minimum of 5 values are required for each inter-knot
interval, and that is often enough. For Simpson's rule, these
points are equally spaced, and the weights are proportional to
1, 4, 2, 4, ..., 2, 4, 1.
values
a list of matrices with one row for each row of quadvals
and one column for each basis function. The elements of the
list correspond to the basis functions and their derivatives
evaluated at the quadrature points contained in the first column
of quadvals
.
basisvalues
A list of lists, allocated by code such as vector("list",1).
This field is designed to avoid evaluation of a basis system
repeatedly at a set of argument values. Each list within the
vector corresponds to a specific set of argument values, and
must have at least two components, which may be tagged as you
wish. 'The first component in an element of the list vector
contains the argument values. The second component in an
element of the list vector contains a matrix of values of the
basis functions evaluated at the arguments in the first
component. The third and subsequent components, if present,
contain matrices of values their derivatives up to a maximum
derivative order. Whenever function getbasismatrix is called,
it checks the first list in each row to see, first, if the
number of argument values corresponds to the size of the first
dimension, and if this test succeeds, checks that all of the
argument values match. This takes time, of course, but is much
faster than re-evaluation of the basis system. Even this time
can be avoided by direct retrieval of the desired array. For
example, you might set up a vector of argument values called
"evalargs" along with a matrix of basis function values for
these argument values called "basismat". You might want too use
tags like "args" and "values", respectively for these. You
would then assign them to basisvalues
with code such as
the following:
basisobj\$basisvalues <- vector("list",1)
basisobj\$basisvalues[[1]] <- list(args=evalargs, values=basismat)
names either a character vector of the same length as the number of basis functions or a simple stem used to construct such a vector.
For bspline
bases, this defaults to paste('bspl', norder,
'.', 1:nbreaks, sep=”).
For other bases, there are crudely similar defaults.
axes
an optional list used by selected plot
functions to
create custom axes
. If this axes
argument is not
NULL, functions plot.basisfd
, plot.fd
,
plot.fdSmooth
plotfit.fd
, plotfit.fdSmooth
,
and plot.Lfd
will create axes via
do.call(x$axes[[1]], x$axes[-1])
. The primary example of
this is to create CanadianWeather
plots using
list("axesIntervals")
J. O. Ramsay and Spencer Graves
Ramsay, James O., and Silverman, Bernard W. (2006), Functional Data Analysis, 2nd ed., Springer, New York.
Ramsay, James O., and Silverman, Bernard W. (2002), Applied Functional Data Analysis, Springer, New York.
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.