|
| 1 | +> **Rust Edition Guide は現在 Rust 2024 のアップデート作業に向けて翻訳作業中です。本ページは英語版をコピーしていますが、一部のリンクが動作しないなどの問題が発生する場合があります。問題が発生した場合は、[原文(英語版)](https://doc.rust-lang.org/nightly/edition-guide/introduction.html)をご参照ください。** |
| 2 | +
|
| 3 | +# C-string literals |
| 4 | + |
| 5 | +## Summary |
| 6 | + |
| 7 | +- Literals of the form `c"foo"` or `cr"foo"` represent a string of type [`&core::ffi::CStr`][CStr]. |
| 8 | + |
| 9 | +[CStr]: ../../core/ffi/struct.CStr.html |
| 10 | + |
| 11 | +## Details |
| 12 | + |
| 13 | +Starting with Rust 1.77, C-strings can be written using C-string literal syntax with the `c` or `cr` prefix. |
| 14 | + |
| 15 | +Previously, it was challenging to properly produce a valid string literal that could interoperate with C APIs which terminate with a NUL byte. |
| 16 | +The [`cstr`] crate was a popular solution, but that required compiling a proc-macro which was quite expensive. |
| 17 | +Now, C-strings can be written directly using literal syntax notation, which will generate a value of type [`&core::ffi::CStr`][CStr] which is automatically terminated with a NUL byte. |
| 18 | + |
| 19 | +```rust,edition2021 |
| 20 | +# use core::ffi::CStr; |
| 21 | +
|
| 22 | +assert_eq!(c"hello", CStr::from_bytes_with_nul(b"hello\0").unwrap()); |
| 23 | +assert_eq!( |
| 24 | + c"byte escapes \xff work", |
| 25 | + CStr::from_bytes_with_nul(b"byte escapes \xff work\0").unwrap() |
| 26 | +); |
| 27 | +assert_eq!( |
| 28 | + c"unicode escapes \u{00E6} work", |
| 29 | + CStr::from_bytes_with_nul(b"unicode escapes \xc3\xa6 work\0").unwrap() |
| 30 | +); |
| 31 | +assert_eq!( |
| 32 | + c"unicode characters αβγ encoded as UTF-8", |
| 33 | + CStr::from_bytes_with_nul( |
| 34 | + b"unicode characters \xce\xb1\xce\xb2\xce\xb3 encoded as UTF-8\0" |
| 35 | + ) |
| 36 | + .unwrap() |
| 37 | +); |
| 38 | +assert_eq!( |
| 39 | + c"strings can continue \ |
| 40 | + on multiple lines", |
| 41 | + CStr::from_bytes_with_nul(b"strings can continue on multiple lines\0").unwrap() |
| 42 | +); |
| 43 | +``` |
| 44 | + |
| 45 | +C-strings do not allow interior NUL bytes (such as with a `\0` escape). |
| 46 | + |
| 47 | +Similar to regular strings, C-strings also support "raw" syntax with the `cr` prefix. |
| 48 | +These raw C-strings do not process backslash escapes which can make it easier to write strings that contain backslashes. |
| 49 | +Double-quotes can be included by surrounding the quotes with the `#` character. |
| 50 | +Multiple `#` characters can be used to avoid ambiguity with internal `"#` sequences. |
| 51 | + |
| 52 | +```rust,edition2021 |
| 53 | +assert_eq!(cr"foo", c"foo"); |
| 54 | +// Number signs can be used to embed interior double quotes. |
| 55 | +assert_eq!(cr#""foo""#, c"\"foo\""); |
| 56 | +// This requires two #. |
| 57 | +assert_eq!(cr##""foo"#"##, c"\"foo\"#"); |
| 58 | +// Escapes are not processed. |
| 59 | +assert_eq!(cr"C:\foo", c"C:\\foo"); |
| 60 | +``` |
| 61 | + |
| 62 | +See [The Reference] for more details. |
| 63 | + |
| 64 | +[`cstr`]: https://crates.io/crates/cstr |
| 65 | +[The Reference]: ../../reference/tokens.html#c-string-and-raw-c-string-literals |
| 66 | + |
| 67 | +## Migration |
| 68 | + |
| 69 | +Migration is only necessary for macros which may have been assuming a sequence of tokens that looks similar to `c"…"` or `cr"…"`, which previous to the 2021 edition would tokenize as two separate tokens, but in 2021 appears as a single token. |
| 70 | + |
| 71 | +As part of the [syntax reservation] for the 2021 edition, any macro input which may run into this issue should issue a warning from the `rust_2021_prefixes_incompatible_syntax` migration lint. |
| 72 | +See that chapter for more detail. |
| 73 | + |
| 74 | +[syntax reservation]: reserved-syntax.md |
0 commit comments