Crate tree_sitter
source ·Expand description
Rust Tree-sitter
Rust bindings to the Tree-sitter parsing library.
Basic Usage
First, create a parser:
use tree_sitter::{InputEdit, Language, Parser, Point};
let mut parser = Parser::new();
Add the cc
crate to your Cargo.toml
under [build-dependencies]
:
[build-dependencies]
cc="*"
Then, add a language as a dependency:
[dependencies]
tree-sitter = "0.22"
tree-sitter-rust = "0.21"
To then use a language, you assign them to the parser.
parser.set_language(&tree_sitter_rust::language()).expect("Error loading Rust grammar");
Now you can parse source code:
let source_code = "fn test() {}";
let mut tree = parser.parse(source_code, None).unwrap();
let root_node = tree.root_node();
assert_eq!(root_node.kind(), "source_file");
assert_eq!(root_node.start_position().column, 0);
assert_eq!(root_node.end_position().column, 12);
Editing
Once you have a syntax tree, you can update it when your source code changes.
Passing in the previous edited tree makes parse
run much more quickly:
let new_source_code = "fn test(a: u32) {}";
tree.edit(&InputEdit {
start_byte: 8,
old_end_byte: 8,
new_end_byte: 14,
start_position: Point::new(0, 8),
old_end_position: Point::new(0, 8),
new_end_position: Point::new(0, 14),
});
let new_tree = parser.parse(new_source_code, Some(&tree));
Text Input
The source code to parse can be provided either as a string, a slice, a vector, or as a function that returns a slice. The text can be encoded as either UTF8 or UTF16:
// Store some source code in an array of lines.
let lines = &[
"pub fn foo() {",
" 1",
"}",
];
// Parse the source code using a custom callback. The callback is called
// with both a byte offset and a row/column offset.
let tree = parser.parse_with(&mut |_byte: usize, position: Point| -> &[u8] {
let row = position.row as usize;
let column = position.column as usize;
if row < lines.len() {
if column < lines[row].as_bytes().len() {
&lines[row].as_bytes()[column..]
} else {
b"\n"
}
} else {
&[]
}
}, None).unwrap();
assert_eq!(
tree.root_node().to_sexp(),
"(source_file (function_item (visibility_modifier) (identifier) (parameters) (block (number_literal))))"
);
Features
- std - This feature is enabled by default and allows
tree-sitter
to use the standard library.- Error types implement the
std::error:Error
trait. regex
performance optimizations are enabled.- The DOT graph methods are enabled.
- Error types implement the
- wasm - This feature is enabled for Wasm targets.
tree-sitter
to be built for Wasm targets using thewasmtime-c-api
crate.
Modules
Structs
- An error that occurred in
Parser::set_included_ranges
. - A summary of a change to a text document.
- An opaque object that defines how to parse a particular language. The code for each
Language
is generated by the Tree-sitter CLI. - A stateful object that is used to look up symbols valid in a specific parse state
- A single node within a syntax
Tree
. - A stateful object that this is used to produce a
Tree
based on some source code. - A position in a multi-line text document, in terms of rows and columns.
- A set of patterns that match nodes in a syntax tree.
- A sequence of
QueryCapture
s associated with a givenQueryCursor
. - An error that occurred when trying to create a
Query
. - A sequence of
QueryMatch
es associated with a givenQueryCursor
. - A key-value pair associated with a particular pattern in a
Query
. - A key-value pair associated with a particular pattern in a
Query
. - A range of positions in a multi-line text document, both in terms of bytes and of rows and columns.
- A tree that represents the syntactic structure of a source code file.
- A stateful object for walking a syntax
Tree
efficiently.
Enums
- A quantifier for captures
- A type of log message.
Constants
- The latest ABI version that is supported by the current version of the library.
- The earliest ABI version that is supported by the current version of the library.
Traits
Functions
- Sets the memory allocation functions that the core library should use.