IO extensions
Overview
This extension to boost::gil provides an easy to use interface for reading and writing various image formats. It also includes a framework for adding new formats.
Please see section 3.3 for all supported image formats. A basic tutorial is provided in section [link gil.io.tutorial Tutorial]. Also, this extension requires Boost version 1.42 and up. Furthermore the GIL extension Toolbox is used.
For adding new image formats please refer to section [link gil.io.using_io.extending_gil__io_with_new_formats Extending GIL::IO with new Formats].
Supported Platforms
All platforms supported by Boost which have a decent C++ compiler. Depending on the image format one or more of the following image libraries might be needed:
- libtiff
- libjpeg
- libpng
- libraw
- zlib
The library is designed to support as many formats as required by the user. For instance, if the user only needs bmp support none of the above mentioned dependencies are required.
There are more details available in this documentation on the image format dependencies. Please see section [link gil.io.using_io.supported_image_formats Supported Image Formats].
Tutorial
Thanks to modern C++ programming techniques the interface for this library is rather small and easy to use. In this tutorial I’ll give you a short walk-around on how to use this boost::gil extension. For more details please refer to section 3.
For each supported IO format a single top-level header file is provided. For instance, include boost/gil/extension/io/tiff.hpp to be able to read or write TIFF files.
Reading An Image
Probably the most common case to read a tiff image can be done as follows:
std::string filename( "image.tif" );
rgb8_image_t img;
read_image( filename, img, tiff_tag() );
The code would be same for all other image formats. The only thing that needs
to change is the tag type ( tiff_tag ) in the read_image call.
The read_image() expects the supplied image type to be compatible with the
image stored in the file. If the user doesn’t know what format an image has she
can use read_and_convert_image().
Another important fact is that read_image() will allocate the appropriate
memory needed for the read operation. There are read_view
or
read_and_convert_view
counterparts, if the memory is already allocated.
Sometimes the user only wants to read a sub-part of an image, then the above call would look as follows:
read_image( filename
, img
, image_read_settings< tiff_tag >( point_t( 0, 0 ), point_t( 50, 50 ) )
);
The image_read_settings class will provide the user with image format independent reading setting but can also serves as a pointer for format dependent settings. Please see the specific image format sections [link gil.io.using_io.supported_image_formats Supported Image Formats] for more details.
Writing An Image
Besides reading the information also writing is the second part of this Boost.GIL extension. Writing is a lot simpler than reading since an existing image view contains all the information.
For instance writing an image can be done as follows:
std::string filename( "image.tif" );
rgb8_image_t img( 640, 480 );
// write data into image
write_view( filename
, view( img )
, tiff_tag()
);
The interface is similar to reading an image. To add image format specific
parameter the user can use image_write_info
class.
For instance, a user can specify the JPEG quality when writing like this:
std::string filename( "image.jpg" );
rgb8_image_t img( 640, 480 );
// write data into image
write_view( filename
, view( img )
, image_write_info< jpeg_tag >( 95 )
);
The above example will write an image where the jpeg quality is set to 95 percent.
Reading And Writing In-Memory Buffers
Reading and writing in-memory buffers are supported as well. See as follows:
// 1. Read an image.
ifstream in( "test.tif", ios::binary );
rgb8_image_t img;
read_image( in, img, tiff_tag() );
// 2. Write image to in-memory buffer.
stringstream out_buffer( ios_base::out | ios_base::binary );
rgb8_image_t src;
write_view( out_buffer, view( src ), tiff_tag() );
// 3. Copy in-memory buffer to another.
stringstream in_buffer( ios_base::in | ios_base::binary );
in_buffer << out_buffer.rdbuf();
// 4. Read in-memory buffer to gil image
rgb8_image_t dst;
read_image( in_buffer, dst, tag_t() );
// 5. Write out image.
string filename( "out.tif" );
ofstream out( filename.c_str(), ios_base::binary );
write_view( out, view( dst ), tiff_tag() );
In case the user is using his own stream classes he has to make sure it has the common interface read, write, seek, close, etc. Interface.
Using IO
General Overview
The tutorial pointed out some use cases for reading and writing images in various image formats. This section will provide a more thorough overview.
The next sections will introduce the Read and Write interface. But it might be worth pointing out that by using some advanced metaprogramming techniques the interface is rather small and hopefully easy to understand.
Besides the general interface the user also has the ability to interface directly with the underlying image format. For that each reader or writer provides access to the so-called backend.
For instance:
typedef get_reader_backend< const std::string
, tag_t
>::type backend_t;
backend_t backend = read_image_info( bmp_filename
, tag_t()
);
BOOST_CHECK_EQUAL( backend._info._width , 127 );
BOOST_CHECK_EQUAL( backend._info._height, 64 );
Of course, the typedef can be removed when using c++11’s auto feature.
Read Interface
As the Tutorial demonstrated there are a few ways to read images. Here is an enumeration of all read functions with a short description:
read_image
- read into a gil image with no conversion. Memory is allocated.read_view
- read into a gil view with no conversion.read_and_convert_image
- read and convert into a gil image. Memory is allocated.read_and_convert_view
- read and convert into a gil view.read_image_info
- read the image header.
Conversion in this context is necessary if the source (file) has an incompatible color space with the destination (gil image type). If that’s the case the user has to use the xxx_and_convert_xxx variants.
All functions take the filename or a device as the first parameter.
The filename can be anything from a C-string, std::string
,
std::wstring
and boost::filesystem
path. When using the path
object the user needs to define the ADD_FS_PATH_SUPPORT compiler symbol to
include the boost::filesystem dependency.
Devices could be a FILE*
, std::ifstream
, and TIFF*
for TIFF images.
The second parameter is either an image or view type depending on the
read_xxx
function.
The third and last parameter is either an instance of the
image_read_settings<FormatTag>
or just the FormatTag
.
The settings can be various depending on the format which is being read.
But the all share settings for reading a partial image area.
The first point describes the top left image coordinate whereas the second
are the dimensions in x and y directions.
Here an example of setting up partial read:
read_image( filename
, img
, image_read_settings< tiff_tag >( point_t( 0, 0 ), point_t( 50, 50 ) )
);
Each format supports reading just the header information,
using read_image_info
. Please refer to the format specific sections
under 3.3. A basic example follows:
image_read_info< tiff_t > info = read_image_info( filename
, tiff_t()
);
GIL also comes with a dynamic image extension.
In the context of GIL.IO a user can define an any_image
type based on
several image types. The IO extension would then pick the matching image type
to the current image file.
The following example shows this feature:
typedef mpl::vector< gray8_image_t
, gray16_image_t
, rgb8_image_t
, rgba_image_t
> my_img_types;
any_image< my_img_types > runtime_image;
read_image( filename
, runtime_image
, tiff_tag()
);
During the review it became clear that there is a need to read big images
scanline by scanline. To support such use case a scanline_reader
is
implemented for all supported image formats.
The scanline_read_iterators
will then allow to traverse through the image.
The following code sample shows the usage:
typedef tiff_tag tag_t;
typedef scanline_reader< typename get_read_device< const char*
, tag_t
>::type
, tag_t
> reader_t;
reader_t reader = make_scanline_reader( "C:/boost/libs/gil/test/extension/io/images/tiff/test.tif", tag_t() );
typedef rgba8_image_t image_t;
image_t dst( reader._info._width, reader._info._height );
fill_pixels( view(dst), image_t::value_type() );
typedef reader_t::iterator_t iterator_t;
iterator_t it = reader.begin();
iterator_t end = reader.end();
for( int row = 0; it != end; ++it, ++row )
{
copy_pixels( interleaved_view( reader._info._width
, 1
, ( image_t::view_t::x_iterator ) *it
, reader._scanline_length
)
, subimage_view( view( dst )
, 0
, row
, reader._info._width
, 1
)
);
}
There are many ways to traverse an image but for as of now only by scanline is supported.
Write Interface
There is only one function for writing out images, write_view.
Similar to reading the first parameter is either a filename or a device.
The filename can be anything from a C-string, std::string
,
std::wstring
, and boost::filesystem
path. When using the path object
the user needs to define the ADD_FS_PATH_SUPPORT
compiler symbol to
include the boost::filesystem
dependency.
Devices could be FILE*
, std::ifstream
, and TIFF*
for TIFF images.
The second parameter is an view object to image being written.
The third and last parameter is either a tag or an
image_write_info<FormatTag>
object containing more settings.
One example for instance is the JPEG quality.
Refer to the format specific sections under 3.3. to have a list of all
the possible settings.
Writing an any_image<...> is supported. See the following example:
typedef mpl::vector< gray8_image_t
, gray16_image_t
, rgb8_image_t
, rgba_image_t
> my_img_types;
any_image< my_img_types > runtime_image;
// fill any_image
write_view( filename
, view( runtime_image )
, tiff_tag()
);
Compiler Symbols
The following table gives an overview of all supported compiler symbols that can be set by the user:
Symbol | Description |
---|---|
BOOST_GIL_IO_ENABLE_GRAY_ALPHA | Enable the color space “gray_alpha”. |
BOOST_GIL_IO_ADD_FS_PATH_SUPPORT | Enable boost::filesystem 3.0 library. |
BOOST_GIL_IO_PNG_FLOATING_POINT_SUPPORTED | Use libpng in floating point mode. This symbol is incompatible with BOOST_GIL_IO_PNG_FIXED_POINT_SUPPORTED. |
BOOST_GIL_IO_PNG_FIXED_POINT_SUPPORTED | Use libpng in integer mode. This symbol is incompatible with BOOST_GIL_IO_PNG_FLOATING_POINT_SUPPORTED. |
BOOST_GIL_IO_PNG_DITHERING_SUPPORTED | Look up “dithering” in libpng manual for explanation. |
BOOST_GIL_IO_PNG_1_4_OR_LOWER | Allow compiling with libpng 1.4 or lower. |
BOOST_GIL_EXTENSION_IO_JPEG_C_LIB_COMPILED_AS_CPLUSPLUS | libjpeg is compiled as c++ lib. |
BOOST_GIL_EXTENSION_IO_PNG_C_LIB_COMPILED_AS_CPLUSPLUS | libpng is compiled as c++ lib. |
BOOST_GIL_EXTENSION_IO_TIFF_C_LIB_COMPILED_AS_CPLUSPLUS | libtiff is compiled as c++ lib. |
BOOST_GIL_EXTENSION_IO_ZLIB_C_LIB_COMPILED_AS_CPLUSPLUS | zlib is compiled as c++ lib. |
BOOST_GIL_IO_TEST_ALLOW_READING_IMAGES | Allow basic test images to be read from local hard drive. The paths can be set in paths.hpp |
BOOST_GIL_IO_TEST_ALLOW_WRITING_IMAGES | Allow images to be written to the local hard drive. The paths can be set in paths.hpp |
BOOST_GIL_IO_USE_BMP_TEST_SUITE_IMAGES | Run tests using the bmp test images suite. See _BMP_TEST_FILES |
BOOST_GIL_IO_USE_PNG_TEST_SUITE_IMAGES | Run tests using the png test images suite. See _PNG_TEST_FILES |
BOOST_GIL_IO_USE_PNM_TEST_SUITE_IMAGES | Run tests using the pnm test images suite. Send me an email for accessing the files. |
BOOST_GIL_IO_USE_TIFF_LIBTIFF_TEST_SUITE_IMAGES | Run tests using the targa file format test images suite. See _TIFF_LIB_TIFF_TEST_FILES |
BOOST_GIL_IO_USE_TIFF_GRAPHICSMAGICK_TEST_SUITE_IMAGES | Run tests using the targa file format test images suite. See _TIFF_GRAPHICSMAGICK_TEST_FILES |
Supported Image Formats
BMP
For a general overview of the BMP image file format go to the following BMP_Wiki.
Please note, the code has not been tested on X Windows System variations of the BMP format which are usually referred to XBM and XPM formats.
Here, only the MS Windows and OS/2 format is relevant.
Currently the code is able to read and write the following image types:
Read: | gray1_image_t , gray4_image_t , gray8_image_t , rgb8_image_t and, rgba8_image_t |
---|---|
Write: | rgb8_image_t and, rgba8_image_t |
The lack of having an indexed image type in gil restricts the current interface to only write out non-indexed images. This is subject to change soon.
JPEG
For a general overview of the JPEG image file format go to the following JPEG_Wiki.
This jpeg extension is based on the libjpeg library which can be found here, JPEG_Lib.
All versions starting from 8x are supported.
The user has to make sure this library is properly installed. I strongly recommend the user to build the library yourself. It could potentially save you a lot of trouble.
Currently the code is able to read and write the following image types:
Read: | gray8_image_t , rgb8_image_t , cmyk8_image_t |
---|---|
Write: | gray8_image_t , rgb8_image_t , cmyk8_image_t |
Reading YCbCr or YCCK images is possible but might result in inaccuracies since both color spaces aren’t available yet for gil. For now these color space are read as rgb images. This is subject to change soon.
PNG
For a general overview of the PNG image file format go to the following PNG_Wiki.
This png extension is based on the libpng, which can be found here, PNG_Lib.
All versions starting from 1.5.x are supported.
The user has to make sure this library is properly installed. I strongly recommend the user to build the library yourself. It could potentially save you a lot of trouble.
Currently the code is able to read and write the following image types:
Read: | gray1, gray2, gray4, gray8, gray16, gray_alpha_8, gray_alpha_16, rgb8, rgb16, rgba8, rgba16 |
---|---|
Write: | gray1, gray2, gray4, gray8, gray16, gray_alpha_8, gray_alpha_16, rgb8, rgb16, rgba8, rgba16 |
For reading gray_alpha images the user has to compile application with BOOST_GIL_IO_ENABLE_GRAY_ALPHA
macro defined. This color space is defined in the toolbox by using gray_alpha.hpp
.
PNM
For a general overview of the PNM image file format go to the following PNM_Wiki. No external library is needed for the pnm format.
The extension can read images in both flavours of the formats, ASCII and binary, that is types from P1 through P6; can write only binary formats.
Currently the code is able to read and write the following image types:
Read: | gray1, gray8, rgb8 |
---|---|
Write: | gray1, gray8, rgb8 |
When reading a mono text image the data is read as a gray8 image.
TARGA
For a general overview of the BMP image file format go to the following TARGA_Wiki.
Currently the code is able to read and write the following image types:
Read: | rgb8_image_t and rgba8_image_t |
---|---|
Write: | rgb8_image_t and rgba8_image_t |
The lack of having an indexed image type in gil restricts the current interface to only write out non-indexed images. This is subject to change soon.
TIFF
For a general overview of the TIFF image file format go to the following TIFF_Wiki.
This tiff extension is based on the libtiff, which can be found, TIFF_Lib.
All versions starting from 3.9.x are supported.
The user has to make sure this library is properly installed. I strongly recommend the user to build the library yourself. It could potentially save you a lot of trouble.
TIFF images can virtually encode all kinds of channel sizes representing
various color spaces. Even planar images are possible.
For instance, rbg323
or gray7
. The channels also can have specific
formats, like integer values or floating point values.
For a complete set of options please consult the following websites:
The author of this extension is not claiming all tiff formats are supported. This extension is likely to be a moving target adding new features with each new milestone. Here is an incomplete lists:
- Multi-page TIFF - read only
- Strip TIFF - read and write support
- Tiled TIFF - read and write support with user defined tiled sizes
- Bit images TIFF - fully supported, like
gray1_image_t
(minisblack) - Planar TIFF - fully supported
- Floating-point TIFF - fully supported
- Palette TIFF - supported but no indexed image type is available as of now
This gil extension uses two different test image suites to test read and
write capabilities. See test_image
folder.
It’s advisable to use ImageMagick test viewer to display images.
Extending GIL::IO with new Formats
Extending the gil::io with new formats is meant to be simple and straightforward. Before adding I would recommend to have a look at existing implementations and then trying to follow a couple of guidelines:
- Create the following files for your new xxx format
xxx_read.hpp
- Only includes read codexxx_write.hpp
- Only includes write codexxx_all.hpp
- includes xxx_read.hpp and xxx_write.hpp
Add the code to the
boost::gil::detail
namespaceCreate a tag type for the new format. Like this:
struct xxx_tag : format_tag {};
Create the image_read_info for the new format. It contains all the information that are necessary to read an image. It should be filled and returned by the
get_info
member of the reader class. See below:template<> struct image_read_info< xxx_tag > {};
Create the image_write_info for the new format. It contains all the information that are necessary to write an image:
template<> struct image_write_info< xxx_tag > {};
Use the following reader skeleton as a start:
template< typename Device , typename ConversionPolicy > class reader< Device , xxx_tag , ConversionPolicy > : public reader_base< xxx_tag , ConversionPolicy > { private: typedef typename ConversionPolicy::color_converter_type cc_t; public: reader( Device& device ) : _io_dev( device ) {} reader( Device& device , const cc_t& cc ) : _io_dev( device ) , reader_base< xxx_tag , ConversionPolicy >( cc ) {} image_read_info< xxx_tag > get_info() { // your implementation here } template< typename View > void apply( const View& dst_view ) { // your implementation here } };
The writer skeleton:
template< typename Device > class writer< Device , xxx_tag > { public: writer( Device & file ) : out(file) {} template<typename View> void apply( const View& view ) { // your implementation here } template<typename View> void apply( const View& view , const image_write_info< xxx_tag >& info ) { // your implementation here } };
Running gil::io tests
gil::io comes with a large suite of test cases which reads and writes various file formats. It uses some test image suites which can be found online or which can be demanded from me by sending me an email.
There are some test images created by me in the test folder.
To enable unit tests which make use of them set the following compiler options
BOOST_GIL_IO_TEST_ALLOW_READING_IMAGES
and
BOOST_GIL_IO_TEST_ALLOW_WRITING_IMAGES
.
The following list provides all links to the image suites the compiler symbol to enable the tests:
BMP: | BMP_TEST_FILES – BOOST_GIL_IO_USE_BMP_TEST_SUITE_IMAGES |
---|---|
PNG: | PNG_TEST_FILES – BOOST_GIL_IO_USE_PNG_TEST_SUITE_IMAGES |
PNM: | request files from me – BOOST_GIL_IO_USE_PNM_TEST_SUITE_IMAGES |
TIFF: | TIFF_LIB_TIFF_TEST_FILES – BOOST_GIL_IO_USE_TIFF_LIBTIFF_TEST_SUITE_IMAGES |
TIFF: | TIFF_GRAPHICSMAGICK_TEST_FILES – BOOST_GIL_IO_USE_TIFF_GRAPHICSMAGICK_TEST_SUITE_IMAGES |