Package: lingdist 1.0
lingdist: Fast Linguistic Distance and Alignment Computation
A fast generalized edit distance and string alignment computation mainly for linguistic aims. As a generalization to the classic edit distance algorithms, the package allows users to define custom cost for every symbol's insertion, deletion, and substitution. The package also allows character combinations in any length to be seen as a single symbol which is very useful for International Phonetic Alphabet (IPA) transcriptions with diacritics. In addition to edit distance result, users can get detailed alignment information such as all possible alignment scenarios between two strings which is useful for testing, illustration or any further usage. Either the distance matrix or its long table form can be obtained and tools to do such conversions are provided. All functions in the package are implemented in 'C++' and the distance matrix computation is parallelized leveraging the 'RcppThread' package.
Authors:
lingdist_1.0.tar.gz
lingdist_1.0.zip(r-4.5)lingdist_1.0.zip(r-4.4)lingdist_1.0.zip(r-4.3)
lingdist_1.0.tgz(r-4.4-x86_64)lingdist_1.0.tgz(r-4.4-arm64)lingdist_1.0.tgz(r-4.3-x86_64)lingdist_1.0.tgz(r-4.3-arm64)
lingdist_1.0.tar.gz(r-4.5-noble)lingdist_1.0.tar.gz(r-4.4-noble)
lingdist_1.0.tgz(r-4.4-emscripten)lingdist_1.0.tgz(r-4.3-emscripten)
lingdist.pdf |lingdist.html✨
lingdist/json (API)
NEWS
# Install 'lingdist' in R: |
install.packages('lingdist', repos = c('https://fncokg.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/fncokg/lingdist/issues
Last updated 1 years agofrom:37d25a0d9d. Checks:OK: 9. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 07 2024 |
R-4.5-win-x86_64 | OK | Nov 07 2024 |
R-4.5-linux-x86_64 | OK | Nov 07 2024 |
R-4.4-win-x86_64 | OK | Nov 07 2024 |
R-4.4-mac-x86_64 | OK | Nov 07 2024 |
R-4.4-mac-aarch64 | OK | Nov 07 2024 |
R-4.3-win-x86_64 | OK | Nov 07 2024 |
R-4.3-mac-x86_64 | OK | Nov 07 2024 |
R-4.3-mac-aarch64 | OK | Nov 07 2024 |
Exports:check_cost_definededit_dist_dfedit_dist_stringgenerate_default_cost_matrixlong2squareform
Dependencies:RcppRcppThread
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Check whether there's missing characters in the cost matrix. | check_cost_defined |
Compute edit distance between all row pairs of a dataframe | edit_dist_df |
Compute edit distance between two strings | edit_dist_string |
Generate a default cost matrix | generate_default_cost_matrix |
Convert long table to square form | long2squareform |