Copyright | (c) 2013-2021 Brendan Hay |
---|---|
License | Mozilla Public License, v. 2.0. |
Maintainer | Brendan Hay <brendan.g.hay+amazonka@gmail.com> |
Stability | auto-generated |
Portability | non-portable (GHC extensions) |
Safe Haskell | None |
Synopsis
- data Channel = Channel' {}
- newChannel :: Text -> DataSource -> Channel
- channel_shuffleConfig :: Lens' Channel (Maybe ShuffleConfig)
- channel_recordWrapperType :: Lens' Channel (Maybe RecordWrapper)
- channel_inputMode :: Lens' Channel (Maybe TrainingInputMode)
- channel_compressionType :: Lens' Channel (Maybe CompressionType)
- channel_contentType :: Lens' Channel (Maybe Text)
- channel_channelName :: Lens' Channel Text
- channel_dataSource :: Lens' Channel DataSource
Documentation
A channel is a named input source that training algorithms can consume.
See: newChannel
smart constructor.
Channel' | |
|
Instances
Create a value of Channel
with all optional fields omitted.
Use generic-lens or optics to modify other optional fields.
The following record fields are available, with the corresponding lenses provided for backwards compatibility:
$sel:shuffleConfig:Channel'
, channel_shuffleConfig
- A configuration for a shuffle option for input data in a channel. If you
use S3Prefix
for S3DataType
, this shuffles the results of the S3 key
prefix matches. If you use ManifestFile
, the order of the S3 object
references in the ManifestFile
is shuffled. If you use
AugmentedManifestFile
, the order of the JSON lines in the
AugmentedManifestFile
is shuffled. The shuffling order is determined
using the Seed
value.
For Pipe input mode, shuffling is done at the start of every epoch. With
large datasets this ensures that the order of the training data is
different for each epoch, it helps reduce bias and possible overfitting.
In a multi-node training job when ShuffleConfig is combined with
S3DataDistributionType
of ShardedByS3Key
, the data is shuffled
across nodes so that the content sent to a particular node on the first
epoch might be sent to a different node on the second epoch.
$sel:recordWrapperType:Channel'
, channel_recordWrapperType
- Specify RecordIO as the value when input data is in raw format but the
training algorithm requires the RecordIO format. In this case, Amazon
SageMaker wraps each individual S3 object in a RecordIO record. If the
input data is already in RecordIO format, you don't need to set this
attribute. For more information, see
Create a Dataset Using RecordIO.
In File mode, leave this field unset or set it to None.
$sel:inputMode:Channel'
, channel_inputMode
- (Optional) The input mode to use for the data channel in a training job.
If you don't set a value for InputMode
, Amazon SageMaker uses the
value set for TrainingInputMode
. Use this parameter to override the
TrainingInputMode
setting in a AlgorithmSpecification request when you
have a channel that needs a different input mode from the training
job's general setting. To download the data from Amazon Simple Storage
Service (Amazon S3) to the provisioned ML storage volume, and mount the
directory to a Docker volume, use File
input mode. To stream data
directly from Amazon S3 to the container, choose Pipe
input mode.
To use a model for incremental training, choose File
input model.
$sel:compressionType:Channel'
, channel_compressionType
- If training data is compressed, the compression type. The default value
is None
. CompressionType
is used only in Pipe input mode. In File
mode, leave this field unset or set it to None.
$sel:contentType:Channel'
, channel_contentType
- The MIME type of the data.
$sel:channelName:Channel'
, channel_channelName
- The name of the channel.
$sel:dataSource:Channel'
, channel_dataSource
- The location of the channel data.
channel_shuffleConfig :: Lens' Channel (Maybe ShuffleConfig) Source #
A configuration for a shuffle option for input data in a channel. If you
use S3Prefix
for S3DataType
, this shuffles the results of the S3 key
prefix matches. If you use ManifestFile
, the order of the S3 object
references in the ManifestFile
is shuffled. If you use
AugmentedManifestFile
, the order of the JSON lines in the
AugmentedManifestFile
is shuffled. The shuffling order is determined
using the Seed
value.
For Pipe input mode, shuffling is done at the start of every epoch. With
large datasets this ensures that the order of the training data is
different for each epoch, it helps reduce bias and possible overfitting.
In a multi-node training job when ShuffleConfig is combined with
S3DataDistributionType
of ShardedByS3Key
, the data is shuffled
across nodes so that the content sent to a particular node on the first
epoch might be sent to a different node on the second epoch.
channel_recordWrapperType :: Lens' Channel (Maybe RecordWrapper) Source #
Specify RecordIO as the value when input data is in raw format but the training algorithm requires the RecordIO format. In this case, Amazon SageMaker wraps each individual S3 object in a RecordIO record. If the input data is already in RecordIO format, you don't need to set this attribute. For more information, see Create a Dataset Using RecordIO.
In File mode, leave this field unset or set it to None.
channel_inputMode :: Lens' Channel (Maybe TrainingInputMode) Source #
(Optional) The input mode to use for the data channel in a training job.
If you don't set a value for InputMode
, Amazon SageMaker uses the
value set for TrainingInputMode
. Use this parameter to override the
TrainingInputMode
setting in a AlgorithmSpecification request when you
have a channel that needs a different input mode from the training
job's general setting. To download the data from Amazon Simple Storage
Service (Amazon S3) to the provisioned ML storage volume, and mount the
directory to a Docker volume, use File
input mode. To stream data
directly from Amazon S3 to the container, choose Pipe
input mode.
To use a model for incremental training, choose File
input model.
channel_compressionType :: Lens' Channel (Maybe CompressionType) Source #
If training data is compressed, the compression type. The default value
is None
. CompressionType
is used only in Pipe input mode. In File
mode, leave this field unset or set it to None.
channel_dataSource :: Lens' Channel DataSource Source #
The location of the channel data.