nc: Named Capture to Data Tables

User-friendly functions for extracting a data table (row for each match, column for each group) from non-tabular text data using regular expressions, and for melting columns that match a regular expression. Patterns are defined using a readable syntax that makes it easy to build complex patterns in terms of simpler, re-usable sub-patterns. Named R arguments are translated to column names in the output; capture groups without names are used internally in order to provide a standard interface to three regular expression 'C' libraries ('PCRE', 'RE2', 'ICU'). Output can also include numeric columns via user-specified type conversion functions.

Version: 2024.9.20
Depends: R (≥ 2.14)
Imports: data.table (≥ 1.15.0)
Suggests: testthat, re2, stringi, ggplot2, tidyr (≥ 1.0.0), cdata, reshape2, knitr, markdown, rmarkdown, R.utils, covr, arrow
Published: 2024-09-20
DOI: 10.32614/CRAN.package.nc
Author: Toby Hocking [aut, cre]
Maintainer: Toby Hocking <toby.hocking at r-project.org>
BugReports: https://github.com/tdhock/nc/issues
License: GPL-3
URL: https://github.com/tdhock/nc
NeedsCompilation: no
Materials: NEWS
CRAN checks: nc results

Documentation:

Reference manual: nc.pdf
Vignettes: Overview of nc functionality (source, R code)
Capture first match (source, R code)
Capture all matches in a single subject string (source, R code)
Capture melt (source, R code)
Comparisons with other packages (source, R code)
Helper functions (source, R code)
Uniform interface to three regex engines (source, R code)
Reading regularly named files (source, R code)

Downloads:

Package source: nc_2024.9.20.tar.gz
Windows binaries: r-devel: nc_2024.9.20.zip, r-release: nc_2024.9.20.zip, r-oldrel: nc_2024.9.20.zip
macOS binaries: r-release (arm64): nc_2024.9.20.tgz, r-oldrel (arm64): nc_2024.9.20.tgz, r-release (x86_64): nc_2024.9.20.tgz, r-oldrel (x86_64): nc_2024.9.20.tgz
Old sources: nc archive

Reverse dependencies:

Reverse suggests: aum, mlr3resampling

Linking:

Please use the canonical form https://CRAN.R-project.org/package=nc to link to this page.