abstract |
Computer-based techniques for grouping documents are described herein. Documents may be grouped, organized, named, and/or indexed by their document character features. Document character features may comprise character counts, character difference counts, missing character counts, and any combination thereof. The comparison of documents may use a comparison threshold value for grouping documents. Documents may be processed in any language. |