Copyright | (c) 2009, 2010 Bryan O'Sullivan |
---|---|
License | BSD-style |
Maintainer | bos@serpentine.com |
Stability | experimental |
Portability | GHC |
Safe Haskell | None |
Language | Haskell98 |
Support for using Text
data with native code via the Haskell
foreign function interface.
- data I16
- fromPtr :: Ptr Word16 -> I16 -> IO Text
- useAsPtr :: Text -> (Ptr Word16 -> I16 -> IO a) -> IO a
- asForeignPtr :: Text -> IO (ForeignPtr Word16, I16)
- peekCStringLen :: CStringLen -> IO Text
- withCStringLen :: Text -> (CStringLen -> IO a) -> IO a
- lengthWord16 :: Text -> Int
- unsafeCopyToPtr :: Text -> Ptr Word16 -> IO ()
- dropWord16 :: I16 -> Text -> Text
- takeWord16 :: I16 -> Text -> Text
Interoperability with native code
The Text
type is implemented using arrays that are not guaranteed
to have a fixed address in the Haskell heap. All communication with
native code must thus occur by copying data back and forth.
The Text
type's internal representation is UTF-16, using the
platform's native endianness. This makes copied data suitable for
use with native libraries that use a similar representation, such
as ICU. To interoperate with native libraries that use different
internal representations, such as UTF-8 or UTF-32, consider using
the functions in the Encoding
module.
A type representing a number of UTF-16 code units.
Safe conversion functions
useAsPtr :: Text -> (Ptr Word16 -> I16 -> IO a) -> IO a Source
O(n) Perform an action on a temporary, mutable copy of a
Text
. The copy is freed as soon as the action returns.
asForeignPtr :: Text -> IO (ForeignPtr Word16, I16) Source
O(n) Make a mutable copy of a Text
.
Encoding as UTF-8
peekCStringLen :: CStringLen -> IO Text Source
O(n) Decode a C string with explicit length, which is assumed
to have been encoded as UTF-8. If decoding fails, a
UnicodeException
is thrown.
withCStringLen :: Text -> (CStringLen -> IO a) -> IO a Source
Marshal a Text
into a C string encoded as UTF-8 in temporary
storage, with explicit length information. The encoded string may
contain NUL bytes, and is not followed by a trailing NUL byte.
The temporary storage is freed when the subcomputation terminates (either normally or via an exception), so the pointer to the temporary storage must not be used after this function returns.
Unsafe conversion code
lengthWord16 :: Text -> Int Source
O(1) Return the length of a Text
in units of Word16
. This
is useful for sizing a target array appropriately before using
unsafeCopyToPtr
.
Low-level manipulation
Foreign functions that use UTF-16 internally may return indices in
units of Word16
instead of characters. These functions may
safely be used with such indices, as they will adjust offsets if
necessary to preserve the validity of a Unicode string.
dropWord16 :: I16 -> Text -> Text Source