Name | Description | Type | Package | Framework |
Aggregate | Methods for performing various types of aggregations over PCollection instances. | Class | org.apache.crunch.lib | Apache Crunch |
|
Aggregate .PairValueComparator | Constructor SummaryAggregate. | Class | org.apache.crunch.lib.Aggregate | Apache Crunch |
|
Aggregate .TopKCombineFn | See Also:Serialized FormConstructor SummaryAggregate. | Class | org.apache.crunch.lib.Aggregate | Apache Crunch |
|
Aggregate .TopKFn | See Also:Serialized FormConstructor SummaryAggregate. | Class | org.apache.crunch.lib.Aggregate | Apache Crunch |
|
BloomFilterJoinStrategy | Join strategy that uses a Bloom filter that is trained on the keys of the left-side table to filter the key/value pairs of the right-side | Class | org.apache.crunch.lib.join | Apache Crunch |
|
Cartesian | Utilities for Cartesian products of two PTable or PCollectionConstructor SummaryCartesian() | Class | org.apache.crunch.lib | Apache Crunch |
|
Channels | Utilities for splitting Pair instances emitted by DoFn into separate PCollection instances. | Class | org.apache.crunch.lib | Apache Crunch |
|
Cogroup | Constructor SummaryCogroup() PTable | Class | org.apache.crunch.lib | Apache Crunch |
|
DefaultJoinStrategy | Default join strategy that simply sends all data through the map, shuffle, and reduce phase. | Class | org.apache.crunch.lib.join | Apache Crunch |
|
Distinct | Functions for computing the distinct elements of a PCollection. | Class | org.apache.crunch.lib | Apache Crunch |
|
FullOuterJoinFn | Used to perform the last step of an full outer join. | Class | org.apache.crunch.lib.join | Apache Crunch |
|
InnerJoinFn | Used to perform the last step of an inner join. | Class | org.apache.crunch.lib.join | Apache Crunch |
|
Join | Utilities for joining multiple PTable instances based on a commonConstructor SummaryJoin() | Class | org.apache.crunch.lib | Apache Crunch |
|
JoinFn | Represents a DoFn for performing joins. | Class | org.apache.crunch.lib.join | Apache Crunch |
|
JoinStrategy | Defines a strategy for joining two PTables together on a common key. | Interface | org.apache.crunch.lib.join | Apache Crunch |
|
JoinType | enum JoinTypeextends EnumSpecifies the specific behavior of how a join should be performed in terms of requiring matching keys | Class | org.apache.crunch.lib.join | Apache Crunch |
|
JoinUtils | Utilities that are useful in joining multiple data sets via a MapReduce. | Class | org.apache.crunch.lib.join | Apache Crunch |
|
JoinUtils .AvroIndexedRecordPartitioner | Constructor SummaryJoinUtils. | Class | org.apache.crunch.lib.join.JoinUtils | Apache Crunch |
|
JoinUtils .AvroPairGroupingComparator | Constructor SummaryJoinUtils. | Class | org.apache.crunch.lib.join.JoinUtils | Apache Crunch |
|
JoinUtils .TupleWritableComparator | Constructor SummaryJoinUtils. | Class | org.apache.crunch.lib.join.JoinUtils | Apache Crunch |
|
JoinUtils .TupleWritablePartitioner | Constructor SummaryJoinUtils. | Class | org.apache.crunch.lib.join.JoinUtils | Apache Crunch |
|
LeftOuterJoinFn | Used to perform the last step of an left outer join. | Class | org.apache.crunch.lib.join | Apache Crunch |
|
Mapred | Static functions for working with legacy Mappers and Reducers that live under the org. | Class | org.apache.crunch.lib | Apache Crunch |
|
Mapreduce | Static functions for working with legacy Mappers and Reducers that live under the org. | Class | org.apache.crunch.lib | Apache Crunch |
|
MapsideJoinStrategy | Utility for doing map side joins on a common key between two PTables. | Class | org.apache.crunch.lib.join | Apache Crunch |
|
OneToManyJoin | Optimized join for situations where exactly one value is being joined with any other number of values based on a common key. | Class | org.apache.crunch.lib.join | Apache Crunch |
|
PTables | Methods for performing common operations on PTables. | Class | org.apache.crunch.lib | Apache Crunch |
|
ReverseAvroComparator | | Class | org.apache.crunch.lib.sort | Apache Crunch |
|
ReverseWritableComparator | | Class | org.apache.crunch.lib.sort | Apache Crunch |
|
RightOuterJoinFn | Used to perform the last step of an right outer join. | Class | org.apache.crunch.lib.join | Apache Crunch |
|
Sample | Methods for performing random sampling in a distributed fashion, either by accepting each record in a PCollection with an independent probability in order to sample some | Class | org.apache.crunch.lib | Apache Crunch |
|
SecondarySort | Utilities for performing a secondary sort on a PTable> collection. | Class | org.apache.crunch.lib | Apache Crunch |
|
Set | Utilities for performing set operations (difference, intersection, etc) on PCollection instances. | Class | org.apache.crunch.lib | Apache Crunch |
|
Shard | Utilities for controlling how the data in a PCollection is balanced across reducersConstructor SummaryShard() | Class | org.apache.crunch.lib | Apache Crunch |
|
ShardedJoinStrategy | JoinStrategy that splits the key space up into shards. | Class | org.apache.crunch.lib.join | Apache Crunch |
|
ShardedJoinStrategy .ShardingStrategy | Determines over how many shards a key will be split in a sharded join. | Interface | org.apache.crunch.lib.join.ShardedJoinStrategy | Apache Crunch |
|
Sort | Utilities for sorting PCollection instances. | Class | org.apache.crunch.lib | Apache Crunch |
|
Sort .ColumnOrder | To sort by column 2 ascending then column 1 descending, you would use: sortPairs(coll, by(2, ASCENDING), by(1, DESCENDING)) | Class | org.apache.crunch.lib.Sort | Apache Crunch |
|
Sort .Order | For signaling the order in which a sort should be done. | Class | org.apache.crunch.lib.Sort | Apache Crunch |
|
SortFns | A set of DoFns that are used by Crunch's Sort library. | Class | org.apache.crunch.lib.sort | Apache Crunch |
|
SortFns .AvroGenericFn | Pulls a composite set of keys from an Avro GenericRecord instance. | Class | org.apache.crunch.lib.sort.SortFns | Apache Crunch |
|
SortFns .KeyExtraction | Utility class for encapsulating key extraction logic and serialization information aboutConstructor SummarySortFns. | Class | org.apache.crunch.lib.sort.SortFns | Apache Crunch |
|
SortFns .SingleKeyFn | Extracts a single indexed key from a Tuple instance. | Class | org.apache.crunch.lib.sort.SortFns | Apache Crunch |
|
SortFns .TupleKeyFn | Extracts a composite key from a Tuple instance. | Class | org.apache.crunch.lib.sort.SortFns | Apache Crunch |
|
TotalOrderPartitioner | | Class | org.apache.crunch.lib.sort | Apache Crunch |
|
TupleWritableComparator | | Class | org.apache.crunch.lib.sort | Apache Crunch |