Package: lingdist 2.4.0
lingdist: Fast Linguistic Distance and Alignment Computation
A fast linguistic distance and alignment computation package. It implements generalized edit distance, Pointwise Mutual Information (PMI) distance, and Weighted Jaccard Distance (WJD). For generalized edit distance, the package allows users to define custom cost for every symbol's insertion, deletion, and substitution, and supports treating character combinations as single symbols (e.g., IPA with diacritics). It also provides detailed alignment information. PMI distance automatically learns costs from data. WJD is suitable for hierarchical categorical data with multiple forms. All functions are implemented in 'C++' and distance matrix computation is parallelized leveraging the 'RcppThread' package.
Authors:
lingdist_2.4.0.tar.gz
lingdist_2.4.0.zip(r-4.7)lingdist_2.4.0.zip(r-4.6)lingdist_2.4.0.zip(r-4.5)
lingdist_2.4.0.tgz(r-4.6-x86_64)lingdist_2.4.0.tgz(r-4.6-arm64)lingdist_2.4.0.tgz(r-4.5-x86_64)lingdist_2.4.0.tgz(r-4.5-arm64)
lingdist_2.4.0.tar.gz(r-4.7-arm64)lingdist_2.4.0.tar.gz(r-4.7-x86_64)lingdist_2.4.0.tar.gz(r-4.6-arm64)lingdist_2.4.0.tar.gz(r-4.6-x86_64)
lingdist_2.4.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
lingdist/json (API)
NEWS
| # Install 'lingdist' in R: |
| install.packages('lingdist', repos = c('https://fncokg.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/fncokg/lingdist/issues
Last updated from:84ed161715. Checks:11 ERROR, 2 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-arm64 | ERROR | 128 | ||
| linux-devel-x86_64 | ERROR | 127 | ||
| source / vignettes | OK | 182 | ||
| linux-release-arm64 | ERROR | 151 | ||
| linux-release-x86_64 | ERROR | 117 | ||
| macos-release-arm64 | ERROR | 86 | ||
| macos-release-x86_64 | ERROR | 197 | ||
| macos-oldrel-arm64 | ERROR | 117 | ||
| macos-oldrel-x86_64 | ERROR | 339 | ||
| windows-devel | ERROR | 127 | ||
| windows-release | ERROR | 134 | ||
| windows-oldrel | ERROR | 179 | ||
| wasm-release | OK | 128 |
Exports:check_cost_mat_symbolsgenerate_default_cost_matrixlong2squareformpw_edit_distpw_pmi_distpw_wjdstring_edit_dist
Dependencies:RcppRcppThread
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Check which symbols in `data` are missing from a cost matrix | check_cost_mat_symbols |
| Generate a default cost matrix | generate_default_cost_matrix |
| Convert long table to square form | long2squareform |
| Compute edit distance between all row pairs of a dataframe | pw_edit_dist |
| Compute PMI distance between all row pairs of a dataframe | pw_pmi_dist |
| Compute Weighted Jaccard Distance between all row pairs of a dataframe | pw_wjd |
| Compute edit distance between two strings | string_edit_dist |
