2025-01-15 10:27:00
github.com
Ropey is a utf8 text rope for Rust, designed to be the backing text-buffer for
applications such as text editors. Ropey is fast, robust, and can handle huge
texts and memory-incoherent edits with ease.
// Load a text file.
let mut text = ropey::Rope::from_reader(
File::open("my_great_book.txt")?
)?;
// Print the 516th line (zero-indexed).
println!("{}", text.line(515));
// Get the start/end char indices of the line.
let start_idx = text.line_to_char(515);
let end_idx = text.line_to_char(516);
// Remove the line...
text.remove(start_idx..end_idx);
// ...and replace it with something better.
text.insert(start_idx, "The flowers are... so... dunno.\n");
// Print the changes, along with the previous few lines for context.
let start_idx = text.line_to_char(511);
let end_idx = text.line_to_char(516);
println!("{}", text.slice(start_idx..end_idx));
// Write the file back out to disk.
text.write_to(
BufWriter::new(File::create("my_great_book.txt")?)
)?;
Ropey is designed and built to be the backing text buffer for applications
such as text editors, and its design trade-offs reflect that. Ropey is good
at:
- Handling frequent edits to medium-to-large texts. Even on texts that are
multiple gigabytes large, edits are measured in single-digit microseconds. - Handling Unicode correctly. It is impossible to create invalid utf8 through
Ropey, and all Unicode line endings are correctly tracked including CRLF. - Having flat, predictable performance characteristics. Ropey will never be
the source of hiccups or stutters in your software.
On the other hand, Ropey is not good at:
- Handling texts smaller than a couple of kilobytes or so. That is to say,
Ropey will handle them fine, but Ropey allocates space in kilobyte chunks,
which introduces unnecessary bloat if your texts are almost always small. - Handling texts that are larger than available memory. Ropey is an in-memory
data structure. - Getting the best performance for every possible use-case. Ropey puts work
into tracking both line endings and unicode scalar values, which is
performance overhead you may not need depending on your use-case.
Keep this in mind when selecting Ropey for your project. Ropey is very good
at what it does, but like all software it is designed with certain
applications in mind.
Ropey’s atomic unit of text is
Unicode scalar values
(or char
s in Rust)
encoded as utf8. All of Ropey’s editing and slicing operations are done
in terms of char indices, which prevents accidental creation of invalid
utf8 data.
Ropey also supports converting between scalar value indices and utf16 code unit
indices, for interoperation with external APIs that may still use utf16.
Ropey knows about line breaks, allowing you to index into and iterate over
lines of text.
The line breaks Ropey recognizes are also configurable at build time via
feature flags. See Ropey’s documentation for details.
Ropey has rope slices that allow you to work with just parts of a rope, using
all the read-only operations of a full rope including iterators and making
sub-slices.
Although Ropey is intentionally limited in scope, it also provides APIs for
efficiently accessing and working with its internal text chunk
representation, allowing additional functionality to be efficiently
implemented by client code with minimal overhead.
Ropey is fast and minimizes memory usage:
- On a recent mobile i7 Intel CPU, Ropey performed over 1.8 million small
incoherent insertions per second while building up a text roughly 100 MB
large. Coherent insertions (i.e. all near the same place in the text) are
even faster, doing the same task at over 3.3 million insertions per
second. - Freshly loading a file from disk only incurs about 10% memory overhead. For
example, a 100 MB text file will occupy about 110 MB of memory when loaded
by Ropey. - Cloning ropes is extremely cheap. Rope clones share data, so an initial
clone only takes 8 bytes of memory. After that, memory usage will grow
incrementally as the clones diverge due to edits.
Ropey ensures that even though clones share memory, everything is thread-safe.
Clones can be sent to other threads for both reading and writing.
Ropey uses unsafe code to help achieve some of its space and performance
characteristics. Although effort has been put into keeping the unsafe code
compartmentalized and making it correct, please be cautious about using Ropey
in software that may face adversarial conditions.
Auditing, fuzzing, etc. of the unsafe code in Ropey is extremely welcome.
If you find any unsoundness, please file an issue! Also welcome are
recommendations for how to remove any of the unsafe code without introducing
significant space or performance regressions, or how to compartmentalize the
unsafe code even better.
Ropey is licensed under the MIT license (LICENSE.md or http://opensource.org/licenses/MIT)
Contributions are absolutely welcome! However, please open an issue to discuss
larger changes, to avoid doing a lot of work that may get rejected. Also note
that PRs that add dependencies are very likely to be rejected (Ropey aims to
have minimal dependencies).
An overview of Ropey’s design can be found here.
Unless you explicitly state otherwise, any contribution intentionally submitted
for inclusion in Ropey by you will be licensed as above, without any additional
terms or conditions.
Keep your files stored safely and securely with the SanDisk 2TB Extreme Portable SSD. With over 69,505 ratings and an impressive 4.6 out of 5 stars, this product has been purchased over 8K+ times in the past month. At only $129.99, this Amazon’s Choice product is a must-have for secure file storage.
Help keep private content private with the included password protection featuring 256-bit AES hardware encryption. Order now for just $129.99 on Amazon!
Support Techcratic
If you find value in Techcratic’s insights and articles, consider supporting us with Bitcoin. Your support helps me, as a solo operator, continue delivering high-quality content while managing all the technical aspects, from server maintenance to blog writing, future updates, and improvements. Support Innovation! Thank you.
Bitcoin Address:
bc1qlszw7elx2qahjwvaryh0tkgg8y68enw30gpvge
Please verify this address before sending funds.
Bitcoin QR Code
Simply scan the QR code below to support Techcratic.
Please read the Privacy and Security Disclaimer on how Techcratic handles your support.
Disclaimer: As an Amazon Associate, Techcratic may earn from qualifying purchases.