cvpartition | For tall arrays only stratified-HoldOut partitions
are supported. c = cvpartition(group,'HoldOut',p) randomly
partitions observations into a training set and a test set with stratification,
using the class information in group . P is
a scalar such that 0 < P < 1 .
To obtain nonstratified partitions, set a uniform
grouping variable from the data samples. For example, assuming X is
a tall numeric array, you can use groups = X(:,1).*0;
C = cvpartition(groups,'HoldOut',P)
|
fitlm | If any input argument to fitlm is
a tall array, then all of the other inputs must be tall arrays as
well. This includes nonempty variables supplied with the 'Weights' and 'Exclude' name-value
pairs. The 'RobustOpts' name-value pair
is not supported with tall arrays. For tall data, fitlm returns
a CompactLinearModel object that contains most
of the same properties as a LinearModel object.
The main difference is that the compact object is sensitive to memory
requirements. The compact object does not include properties that
include the data, or that include an array of the same size as the
data. The compact object does not contain these LinearModel properties: Diagnostics
Fitted
ObservationInfo
ObservationNames
Residuals
Steps
Variables
You can compute the residuals directly from the compact
object returned by LM = fitlm(X,Y) using RES = Y - predict(LM,X);
S = LM.RMSE;
histogram(RES,linspace(-3*S,3*S,51))
If the CompactLinearModel object
is missing lower order terms that include categorical factors:
|
fitglm | If any input argument to fitglm is
a tall array, then all of the other inputs must be tall arrays as
well. This includes nonempty variables supplied with the 'Weights' , 'Exclude' , 'Offset' ,
and 'BinomialSize' name-value pairs. The default number of iterations is 5. You can change
the number of iterations using the 'Options' name-value
pair to pass in an options structure. Create an options structure
using statset to specify a different value for MaxIter . For tall data, fitglm returns
a CompactGeneralizedLinearModel object that contains
most of the same properties as a GeneralizedLinearModel object.
The main difference is that the compact object is sensitive to memory
requirements. The compact object does not include properties that
include the data, or that include an array of the same size as the
data. The compact object does not contain these GeneralizedLinearModel properties: Diagnostics
Fitted
Offset
ObservationInfo
ObservationNames
Residuals
Steps
Variables
You can compute the residuals directly from the compact
object returned by GLM = fitglm(X,Y) using RES = Y - predict(GLM,X);
S = sqrt(GLM.SSE/GLM.DFE);
histogram(RES,linspace(-3*S,3*S,51))
|