Documentation
=============
``mfpcluster``
--------------
``mfpcluster`` clusters the sequences of a fasta file over a range of similarity values using `CD-HIT `_.
* **Inputs**
* **-i, --in [input_fasta]**
The fasta file to cluster.
* **Outputs**
* A directory containing:
* *0.n.fa.clstr*: The clusters found by ``CD-HIT`` when clustering ``input_fasta`` at similarity threhsold *n*.
* *0.n.fa*: Fasta files containing the representative sequences for the clusters found when clustering ``input_fasta`` at similarity threshold *n*.
* *.log*: A plain-text document containing diagnostic information from the current run of ``mfpcluster``.
* *command.cluster.sh*: A newline separated document containing the commands passed to `CD-HIT` to perform clustering.
* *cluster_counts.tsv*: A tsv containing with columns for the similarity threshold clustered at, the number of clusters found at that simlarity threshold, and the first-order difference calculated at that point.
* **Optional arguments**
* **-o, --out [string]** *(Default: .clustering)*
The name of the output directory.
Version History
---------------
* v0.51 (11/14/2019): Adding documentation.
* v0.5 (11/14/2019): Beta release.