Pigpen 0.3.3
Installation
To install, add the following dependency to your project or build file:
[pigpen "0.3.3"]
Namespaces
pigpen.cascading
Functions to convert a PigPen query into a Cascading flow.
Public variables and functions:
pigpen.core
The core PigPen operations. These are the primary functions that you use to build a PigPen query.
Public variables and functions:
- cogroup
- concat
- constantly
- difference
- difference-multiset
- distinct
- dump
- filter
- filter-by
- fold
- group-by
- intersection
- intersection-multiset
- into
- join
- keys-fn
- load-clj
- load-csv
- load-json
- load-lazy
- load-string
- load-tsv
- map
- map-indexed
- mapcat
- reduce
- remove
- remove-by
- return
- sample
- sort
- sort-by
- store-clj
- store-json
- store-many
- store-string
- store-tsv
- take
- union
- union-multiset
pigpen.fold
Fold operations for use with pig/fold, pig/group-by, and pig/cogroup.
pigpen.parquet
Functions for reading and writing parquet data and for creating parquet schemas. Start with `load-parquet` and `store-parquet`.
Public variables and functions:
pigpen.rx
A high performance dump operator. The default implementation in pigpen.core uses lazy seqs, which can be inefficient on larger data. The rx implementation uses rx-java to deliver a slightly more performance for large local datasets.
Public variables and functions:
pigpen.viz
Functions to create graph-viz graphs from PigPen queries.