Copyright | (c) 2013-2021 Brendan Hay |
---|---|
License | Mozilla Public License, v. 2.0. |
Maintainer | Brendan Hay <brendan.g.hay+amazonka@gmail.com> |
Stability | auto-generated |
Portability | non-portable (GHC extensions) |
Safe Haskell | None |
Synopsis
Documentation
data ShuffleConfig Source #
A configuration for a shuffle option for input data in a channel. If you
use S3Prefix
for S3DataType
, the results of the S3 key prefix
matches are shuffled. If you use ManifestFile
, the order of the S3
object references in the ManifestFile
is shuffled. If you use
AugmentedManifestFile
, the order of the JSON lines in the
AugmentedManifestFile
is shuffled. The shuffling order is determined
using the Seed
value.
For Pipe input mode, when ShuffleConfig
is specified shuffling is done
at the start of every epoch. With large datasets, this ensures that the
order of the training data is different for each epoch, and it helps
reduce bias and possible overfitting. In a multi-node training job when
ShuffleConfig
is combined with S3DataDistributionType
of
ShardedByS3Key
, the data is shuffled across nodes so that the content
sent to a particular node on the first epoch might be sent to a
different node on the second epoch.
See: newShuffleConfig
smart constructor.
Instances
Create a value of ShuffleConfig
with all optional fields omitted.
Use generic-lens or optics to modify other optional fields.
The following record fields are available, with the corresponding lenses provided for backwards compatibility:
$sel:seed:ShuffleConfig'
, shuffleConfig_seed
- Determines the shuffling order in ShuffleConfig
value.
shuffleConfig_seed :: Lens' ShuffleConfig Integer Source #
Determines the shuffling order in ShuffleConfig
value.