Struct regex_automata::Regex

source · [−]

pub struct Regex<D: DFA = DenseDFA<Vec<usize>, usize>> { /* private fields */ }

Expand description

A regular expression that uses deterministic finite automata for fast searching.

A regular expression is comprised of two DFAs, a “forward” DFA and a “reverse” DFA. The forward DFA is responsible for detecting the end of a match while the reverse DFA is responsible for detecting the start of a match. Thus, in order to find the bounds of any given match, a forward search must first be run followed by a reverse search. A match found by the forward DFA guarantees that the reverse DFA will also find a match.

The type of the DFA used by a Regex corresponds to the D type parameter, which must satisfy the DFA trait. Typically, D is either a DenseDFA or a SparseDFA, where dense DFAs use more memory but search faster, while sparse DFAs use less memory but search more slowly.

By default, a regex’s DFA type parameter is set to DenseDFA<Vec<usize>, usize>. For most in-memory work loads, this is the most convenient type that gives the best search performance.

Sparse DFAs

Since a Regex is generic over the DFA trait, it can be used with any kind of DFA. While this crate constructs dense DFAs by default, it is easy enough to build corresponding sparse DFAs, and then build a regex from them:

use regex_automata::Regex;

// First, build a regex that uses dense DFAs.
let dense_re = Regex::new("foo[0-9]+")?;

// Second, build sparse DFAs from the forward and reverse dense DFAs.
let fwd = dense_re.forward().to_sparse()?;
let rev = dense_re.reverse().to_sparse()?;

// Third, build a new regex from the constituent sparse DFAs.
let sparse_re = Regex::from_dfas(fwd, rev);

// A regex that uses sparse DFAs can be used just like with dense DFAs.
assert_eq!(true, sparse_re.is_match(b"foo123"));

Struct regex_automata::Regex

Implementations

impl Regex

pub fn new(pattern: &str) -> Result<Regex, Error>

impl Regex<SparseDFA<Vec<u8>, usize>>

pub fn new_sparse( pattern: &str) -> Result<Regex<SparseDFA<Vec<u8>, usize>>, Error>

impl<D: DFA> Regex<D>

pub fn is_match(&self, input: &[u8]) -> bool

pub fn shortest_match(&self, input: &[u8]) -> Option<usize>

pub fn find(&self, input: &[u8]) -> Option<(usize, usize)>

pub fn is_match_at(&self, input: &[u8], start: usize) -> bool

pub fn shortest_match_at(&self, input: &[u8], start: usize) -> Option<usize>

pub fn find_at(&self, input: &[u8], start: usize) -> Option<(usize, usize)>

pub fn find_iter<'r, 't>(&'r self, input: &'t [u8]) -> Matches<'r, 't, D>

pub fn from_dfas(forward: D, reverse: D) -> Regex<D>

pub fn forward(&self) -> &D

pub fn reverse(&self) -> &D

Trait Implementations

impl<D: Clone + DFA> Clone for Regex<D>

fn clone(&self) -> Regex<D>

fn clone_from(&mut self, source: &Self)

impl<D: Debug + DFA> Debug for Regex<D>

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Auto Trait Implementations

impl<D> RefUnwindSafe for Regex<D> where D: RefUnwindSafe,

impl<D> Send for Regex<D> where D: Send,

impl<D> Sync for Regex<D> where D: Sync,

impl<D> Unpin for Regex<D> where D: Unpin,

impl<D> UnwindSafe for Regex<D> where D: UnwindSafe,

Blanket Implementations

impl<T> Any for T where T: 'static + ?Sized,

pub fn type_id(&self) -> TypeId

impl<T> Borrow<T> for T where T: ?Sized,

pub fn borrow(&self) -> &T

impl<T> BorrowMut<T> for T where T: ?Sized,

pub fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

pub fn from(t: T) -> T

impl<T, U> Into<U> for T where U: From<T>,

pub fn into(self) -> U

impl<T> ToOwned for T where T: Clone,

type Owned = T

pub fn to_owned(&self) -> T

pub fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for T where U: Into<T>,

type Error = Infallible

pub fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for T where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

pub fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

pub fn new_sparse(
pattern: &str
) -> Result<Regex<SparseDFA<Vec<u8>, usize>>, Error>

pub fn is_match(&self, input: &[u8 ]) -> bool

pub fn shortest_match(&self, input: &[u8 ]) -> Option<usize>

pub fn find(&self, input: &[u8 ]) -> Option<(usize, usize )>

pub fn is_match_at(&self, input: &[u8 ], start: usize) -> bool

pub fn shortest_match_at(&self, input: &[u8 ], start: usize) -> Option<usize>

pub fn find_at(&self, input: &[u8 ], start: usize) -> Option<(usize, usize )>

pub fn find_iter<'r, 't>(&'r self, input: &'t [u8 ]) -> Matches<'r, 't, D>

impl<D> RefUnwindSafe for Regex<D> where
D: RefUnwindSafe,

impl<D> Send for Regex<D> where
D: Send,

impl<D> Sync for Regex<D> where
D: Sync,

impl<D> Unpin for Regex<D> where
D: Unpin,

impl<D> UnwindSafe for Regex<D> where
D: UnwindSafe,

impl<T> Any for T where
T: 'static + ?Sized,

impl<T> Borrow<T> for T where
T: ?Sized,

impl<T> BorrowMut<T> for T where
T: ?Sized,

impl<T, U> Into<U> for T where
U: From<T>,

impl<T> ToOwned for T where
T: Clone,

impl<T, U> TryFrom<U> for T where
U: Into<T>,

impl<T, U> TryInto<U> for T where
U: TryFrom<T>,