pixman

mirror of https://salsa.debian.org/xorg-team/lib/pixman synced 2025-09-01 10:35:16 +00:00

Author	SHA1	Message	Date
Maarten Lankhorst	d6b69d4f63	update symbols file and addd lintian override for hidden symbol	2013-01-08 17:10:12 +01:00
Maarten Lankhorst	0f8c56fe52	new upstream release	2013-01-08 16:12:25 +01:00
Maarten Lankhorst	818af795d4	pixman 0.28.2 release -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) iEYEABECAAYFAlDF0BkACgkQmxfmIW/3waiEegCcCVDzXL2gGouDGCBqJVOmzUcv ZnMAoI50IhP5KXKKEEx2dJlfFkzKVo5N =J62R -----END PGP SIGNATURE----- Merge tag 'pixman-0.28.2' into debian-experimental pixman 0.28.2 release	2013-01-08 16:10:57 +01:00
Søren Sandmann Pedersen	35cc965514	pixman-filter.c: Cope with NULL returns from malloc() v2: Don't return a pointer to uninitialized memory when the allocation of horz and vert fails, but allocation of params doesn't.	2013-01-06 17:38:23 -05:00
Søren Sandmann Pedersen	58526cfc72	Handle solid images in the noop iterator The noop src iterator already has code to handle solid images, but that code never actually runs currently because it is not possible for an image to have both a format code of PIXMAN_solid and a flag of FAST_PATH_BITS_IMAGE. If these two were to be set at the same time, the fast_composite_tiled_repeat() fast path would trigger for solid images (because it triggers for PIXMAN_any formats, which includes PIXMAN_solid), but for solid images we can usually do better than that fast path. So this patch removes _pixman_solid_fill_iter_init() and instead handles such images (along with repeating 1x1 bits images without an alpha map) in pixman-noop.c. When a 1x1R image is involved in the general composite path, before this patch, it would hit this code in repeat() in pixman-inlines.h: while (c >= size) c -= size; while (c < 0) c += size; and those loops could run for a huge number of iteratons (proportional to the composite width). For such cases, the performance improvement is really big: ./test/lowlevel-blt-bench -n add_n_8888: Before: add_n_8888 = L1: 3.86 L2: 3.78 M: 1.40 ( 0.06%) HT: 1.43 VT: 1.41 R: 1.41 RT: 1.38 ( 19Kops/s) After: add_n_8888 = L1:1236.86 L2:2468.49 M:1097.88 ( 49.04%) HT:476.49 VT:429.05 R:417.04 RT:155.12 ( 817Kops/s)	2013-01-06 17:30:12 -05:00
Marko Lindqvist	480dd38fd1	Fix build with automake-1.13 Automake-1.13 has removed long obsolete AM_CONFIG_HEADER macro ( http://lists.gnu.org/archive/html/automake/2012-12/msg00038.html ) and autoreconf errors out upon seeing it. Attached patch replaces obsolete AM_CONFIG_HEADER with now proper AC_CONFIG_HEADERS.	2013-01-04 01:54:10 +02:00
Siarhei Siamashka	1abde88ae6	Use more appropriate types and remove a magic constant	2013-01-04 01:27:06 +02:00
Siarhei Siamashka	c1fd5a4243	Define SIZE_MAX if it is not provided by the standard C headers C++ compilers do not define SIZE_MAX. It is also not available if the code is compiled by some C compilers: http://lists.freedesktop.org/archives/pixman/2012-August/002196.html	2013-01-04 01:26:55 +02:00
Siarhei Siamashka	66c4292822	Rename 'xor' variable to 'filler' (because 'xor' is a C++ keyword)	2012-12-20 03:14:21 +02:00
Søren Sandmann Pedersen	4dfda2adfe	float-combiner.c: Change tests for x == 0.0 tests to - FLT_MIN < x < FLT_MIN pixman-float-combiner.c currently uses checks like these: if (x == 0.0f) ... else ... / x; to prevent division by 0. In theory this is correct: a division-by-zero exception is only supposed to happen when the floating point numerator is exactly equal to a positive or negative zero. However, in practice, the combination of x87 and gcc optimizations causes issues. The x87 registers are 80 bits wide, which means the initial test: if (x == 0.0f) may be false when x is an 80 bit floating point number, but when x is rounded to a 32 bit single precision number, it becomes equal to 0.0. In principle, gcc should compensate for this quirk of x87, and there are some options such as -ffloat-store, -fexcess-precision=standard, and -std=c99 that will make it do so, but these all have a performance cost. It is also possible to set the FPU to a mode that makes it do all computation with single or double precision, but that would require pixman to save the existing mode before doing anything with floating point and restore it afterwards. Instead, this patch side-steps the issue by replacing exact checks for equality with zero with a new macro that checkes whether the value is between -FLT_MIN and FLT_MIN. There is extensive reading material about this issue linked off the infamous gcc bug 323: http://gcc.gnu.org/bugzilla/show_bug.cgi?id=323	2012-12-19 13:49:32 -05:00
Siarhei Siamashka	2734071d7b	ARM: make use of UQADD8 instruction even in generic C code paths ARMv6 has UQADD8 instruction, which implements unsigned saturated addition for 8-bit values packed in 32-bit registers. It is very useful for UN8x4_ADD_UN8x4, UN8_rb_ADD_UN8_rb and ADD_UN8 macros (which would otherwise need a lot of arithmetic operations to simulate this operation). Since most of the major ARM linux distros are built for ARMv7, we are much less dependent on runtime CPU detection and can get practical benefits from conditional compilation here for a lot of users. The results of cairo-perf-trace benchmark on ARM Cortex-A15 with pixman compiled by gcc 4.7.2 and PIXMAN_DISABLE set to "arm-simd arm-neon": Speedups ======== image firefox-talos-gfx (29938.22 0.12%) -> (27814.76 0.51%) : 1.08x speedup image firefox-asteroids (23241.11 0.07%) -> (21795.19 0.07%) : 1.07x speedup image firefox-canvas-alpha (174519.85 0.08%) -> (164788.64 0.20%) : 1.06x speedup image poppler (9464.46 1.61%) -> (8991.53 0.14%) : 1.05x speedup	2012-12-18 20:49:58 +02:00
Siarhei Siamashka	f9a41703b2	Faster conversion from a8r8g8b8 to r5g6b5 in C code This change reduces 3 shifts, 3 ANDs and 2 ORs (total 8 arithmetic operations) to 3 shifts, 2 ANDs and 2 ORs (total 7 arithmetic operations). We get garbage in the high 16 bits of the result, which might need to be cleared when casting to uint16_t (it would bring us back to total 8 arithmetic operations). However in the case if the result of a8r8g8b8->r5g6b5 conversion is immediately stored to memory, no extra instructions for clearing these garbage bits are needed. This allows the a8r8g8b8->r5g6b5 conversion code to be compiled into 4 instructions for ARM instead of 5 (assuming a good optimizing compiler), which has no pipeline stalls on ARM11 as an additional bonus. The change in benchmark results for 'lowlevel-blt-bench src_8888_0565' with PIXMAN_DISABLE="arm-simd arm-neon mips-dspr2 mmx sse2" and pixman compiled by gcc-4.7.2: MIPS 74K 480MHz : 40.44 MPix/s -> 40.13 MPix/s ARM11 700MHz : 50.28 MPix/s -> 62.85 MPix/s ARM Cortex-A8 1000MHz : 124.38 MPix/s -> 141.85 MPix/s ARM Cortex-A15 1700MHz : 281.07 MPix/s -> 303.29 MPix/s Intel Core i7 2800MHz : 515.92 MPix/s -> 531.16 MPix/s The same trick was used in xomap (X server for Nokia N800/N810): http://repository.maemo.org/pool/diablo/free/x/xorg-server/ xorg-server_1.3.99.0~git20070321-0osso20083801.tar.gz	2012-12-18 20:45:57 +02:00
Siarhei Siamashka	3922e90c40	Change CONVERT_XXXX_TO_YYYY macros into inline functions It is easier and safer to modify their code in the case if the calculations need some temporary variables. And the temporary variables will be needed soon.	2012-12-18 20:45:47 +02:00
Siarhei Siamashka	e4519360c1	test: add "src_0565_8888" to lowlevel-blt-bench	2012-12-18 20:43:51 +02:00
Søren Sandmann Pedersen	6a6c8c51ed	pixman_composite_trapezoids(): Check for NULL return from create_bits() A check is needed that the creation of the temporary image in pixman_composite_trapezoids() succeeds. Fixes crash in stress-test -s 0x313c on my system.	2012-12-13 16:13:11 -05:00
Søren Sandmann Pedersen	c2cb303d33	pixman_composite_trapezoids: Return early if mask_format is not of TYPE_ALPHA stress-test -s 0x17ee crashes because pixman_composite_trapezoids() is given a mask_format of PIXMAN_c8, which causes it to create a temporary image with that format but without a palette. This causes crashes later. The only mask_format that we actually support are those of TYPE_ALPHA, so this patch add a return_if_fail() to ensure this. Similarly, although currently it won't crash if given an invalid format, alpha-only formats have always been the only thing that made sense for the pixman_rasterize_edges() functions, so add a return_if_fail() ensuring that the destination format is of type PIXMAN_TYPE_ALPHA.	2012-12-13 16:10:41 -05:00
Søren Sandmann Pedersen	1f0c02811e	Add testing of trapezoids to stress-test The entry points add_trapezoids(), rasterize_trapezoid() and composite_trapezoid() are exercised with random trapezoids. This uncovers crashes with stress-test seeds 0x17ee and 0x313c.	2012-12-13 15:59:18 -05:00
Søren Sandmann Pedersen	526dc06e56	demos/radial-test: Add checkerboard to display the alpha channel	2012-12-11 09:05:58 -05:00
Søren Sandmann Pedersen	6402b2aa0c	demos/conical-test: Use the draw_checkerboard() utility function Instead of having its own copy.	2012-12-11 09:05:58 -05:00
Søren Sandmann Pedersen	e382e52d67	test/utils.[ch]: Add utility function to draw a checkerboard This is useful in demo programs to display the alpha channel.	2012-12-11 09:05:58 -05:00
Søren Sandmann Pedersen	b0a6504122	radial: When comparing t to mindr, use >= rather than > Radial gradients are conceptually rendered as a sequence of circles generated by linearly extrapolating from the two circles given by the gradient specification. Any circles in that sequence that would end up with a negative radius are not drawn, a condition that is enforced by checking that t * dr is bigger than mindr: if (t * dr > mindr) However, it is legitimate for a circle to have radius exactly 0, so the test should use >= rather than >. This gets rid of the dots in demos/radial-test except for when the c2 circle has radius 0 and a repeat mode of either NONE or NORMAL. Both those dots correspond to a t value of 1.0, which is outside the defined interval of [0.0, 1.0) and therefore subject to the repeat algorithm. As a result, in the NONE case, a value of 1.0 turns into transparent black. In the NORMAL case, 1.0 wraps around and becomes 0.0 which is red, unlike 0.99 which is blue. Cc: ranma42@gmail.com	2012-12-11 09:05:38 -05:00
Søren Sandmann Pedersen	54aca22058	demos/radial-test: Add zero-radius circles to demonstrate rendering bugs Add two new gradient columns, one where the start circle is has radius 0 and one where the end circle has radius 0. All the new gradients except for one are rendered with a bright dot in the middle. In most but not all cases this is incorrect. Cc: ranma42@gmail.com	2012-12-11 08:20:45 -05:00
Siarhei Siamashka	fdab3c1b6c	test: Workaround unaligned MOVDQA bug (http://gcc.gnu.org/PR55614 ) Just use SSE2 intrinsics to do unaligned memory accesses as a workaround for this gcc bug related to vector extensions.	2012-12-10 20:05:15 +02:00
Siarhei Siamashka	2bc59006d7	Improve performance of combine_over_u The generic C over_u combiner can be a lot faster with the addition of special shortcuts for 0xFF and 0x00 alpha/mask values. This is already implemented in C and SSE2 fast paths. Profiling the run of cairo-perf-trace benchmarks with PIXMAN_DISABLE environment variable set to "fast mmx sse2" on Intel Core i7: === before === 37.32% cairo-perf-trac libpixman-1.so.0.29.1 [.] combine_over_u 21.37% cairo-perf-trac libpixman-1.so.0.29.1 [.] bits_image_fetch_bilinear_no_repeat_8888 13.51% cairo-perf-trac libpixman-1.so.0.29.1 [.] bits_image_fetch_bilinear_affine_none_a8r8g8b8 2.96% cairo-perf-trac libpixman-1.so.0.29.1 [.] radial_compute_color 2.74% cairo-perf-trac libpixman-1.so.0.29.1 [.] fetch_scanline_a8 2.71% cairo-perf-trac libpixman-1.so.0.29.1 [.] fetch_scanline_x8r8g8b8 2.17% cairo-perf-trac libpixman-1.so.0.29.1 [.] _pixman_gradient_walker_pixel 1.86% cairo-perf-trac libcairo.so.2.11200.0 [.] _cairo_tor_scan_converter_generate 1.57% cairo-perf-trac libpixman-1.so.0.29.1 [.] bits_image_fetch_bilinear_affine_pad_a8r8g8b8 0.97% cairo-perf-trac libpixman-1.so.0.29.1 [.] combine_in_reverse_u 0.96% cairo-perf-trac libpixman-1.so.0.29.1 [.] combine_over_ca === after === 28.79% cairo-perf-trac libpixman-1.so.0.29.1 [.] bits_image_fetch_bilinear_no_repeat_8888 18.44% cairo-perf-trac libpixman-1.so.0.29.1 [.] bits_image_fetch_bilinear_affine_none_a8r8g8b8 15.54% cairo-perf-trac libpixman-1.so.0.29.1 [.] combine_over_u 3.94% cairo-perf-trac libpixman-1.so.0.29.1 [.] radial_compute_color 3.69% cairo-perf-trac libpixman-1.so.0.29.1 [.] fetch_scanline_a8 3.69% cairo-perf-trac libpixman-1.so.0.29.1 [.] fetch_scanline_x8r8g8b8 2.94% cairo-perf-trac libpixman-1.so.0.29.1 [.] _pixman_gradient_walker_pixel 2.52% cairo-perf-trac libcairo.so.2.11200.0 [.] _cairo_tor_scan_converter_generate 2.08% cairo-perf-trac libpixman-1.so.0.29.1 [.] bits_image_fetch_bilinear_affine_pad_a8r8g8b8 1.31% cairo-perf-trac libpixman-1.so.0.29.1 [.] combine_in_reverse_u 1.29% cairo-perf-trac libpixman-1.so.0.29.1 [.] combine_over_ca	2012-12-10 20:02:08 +02:00
Søren Sandmann Pedersen	a5e5179b56	Pre-release version bump to 0.28.2	2012-12-10 06:46:36 -05:00
Benjamin Gilbert	6e270a7968	Fix thread safety on mingw-w64 and clang After finding a working TLS storage class specifier, configure was continuing to test other candidates. This caused it to prefer __declspec(thread) over __thread. However, __declspec(thread) is ignored with a warning by mingw-w64 [1] and silently ignored by clang [2]. The resulting binary behaved as if PIXMAN_NO_TLS was defined. Bug introduced by `a069da6c`. [1] https://bugs.freedesktop.org/show_bug.cgi?id=57591 [2] http://lists.freedesktop.org/archives/pixman/2012-October/002320.html	2012-12-10 06:46:36 -05:00
Stefan Weil	d91f550b2a	Always use xmmintrin.h for 64 bit Windows MinGW-w64 uses the GNU compiler and does not define _MSC_VER. Nevertheless, it provides xmmintrin.h and must be handled here like the MS compiler. Otherwise compilation fails due to conflicting declarations. Signed-off-by: Stefan Weil <sw@weilnetz.de>	2012-12-10 06:46:36 -05:00
Joshua Root	2092aa0d92	Fix undeclared variable use and sysctlbyname error handling on ppc Fixes bug 56889.	2012-12-10 06:46:36 -05:00
Søren Sandmann Pedersen	9029026edd	Post-release version bump to 0.28.1	2012-12-10 06:46:36 -05:00
Søren Sandmann Pedersen	8ca4e14472	Add fast paths for separable convolution Similar to the fast paths for general affine access, add some fast paths for the separable filter for all combinations of formats x8r8g8b8, a8r8g8b8, r5g6b5, a8 with the four repeat modes. It is easy to see the speedup in the demos/scale program.	2012-12-08 12:38:58 -05:00
Søren Sandmann Pedersen	4f18ba30ce	Add demo program for conical gradients This new test is derived from radial-test.c and displays conical gradients at various angles. It also demonstrates how PIXMAN_REPEAT_NORMAL is supposed to work when used with a gradient specification where the first stop is not a 0.0: In this case the gradient is supposed to have a smooth transition from the last stop back to the first stop with no sharp transitions. It also shows that the repeat mode is not ignored for conical gradients as one might be tempted to think.	2012-12-08 10:50:51 -05:00
Søren Sandmann Pedersen	3a98787bdd	Add demos/zone_plate.png The zone plate image is a useful test case for image scalers because it contains all representable frequencies, so any imperfection in resampling filters will show up as Moire patterns. This version is symmetric around the midpoint of the image, so since rotating it is supposed to be a noop, it can also be used to verify that the resampling filters don't shift the image. V2: Run the file through OptiPNG to cut the size in half, as suggested by Siarhei.	2012-12-08 10:50:51 -05:00
Søren Sandmann Pedersen	97491ed26c	demos: Add new demo program, "scale" This program allows interactively scaling and rotating images with using various filters and repeat modes. It uses pixman_filter_create_separate_convolution() to generate the filters.	2012-12-08 10:50:51 -05:00
Søren Sandmann Pedersen	7f5bb22d17	demos/gtk-utils.[ch]: Add pixman_image_from_file() This function uses GdkPixbuf to load various common formats such as .png and .jpg into a pixman image.	2012-12-08 10:50:51 -05:00
Søren Sandmann Pedersen	6915f3e24f	Add new pixman_filter_create_separable_convolution() API This new API is a helper function to create filter parameters suitable for use with PIXMAN_FILTER_SEPARABLE_CONVOLUTION. For each dimension, given a scale factor, reconstruction and sample filter kernels, and a subsampling resolution, this function will compute a convolution of the two kernels scaled appropriately, then sample that convolution and return the resulting vectors in a form suitable for being used as parameters to PIXMAN_FILTER_SEPARABLE_CONVOLUTION. The filter kernels offered are the following: - IMPULSE: Dirac delta function, ie., point sampling - BOX: Box filter - LINEAR: Linear filter, aka. "Tent" filter - CUBIC: Cubic filter, currently Mitchell-Netravali - GAUSSIAN: Gaussian function, sigma=1, support=3*sigma - LANCZOS2: Two-lobed Lanczos filter - LANCZOS3: Three-lobed Lanczos filter - LANCZOS3_STRETCHED: Three-lobed Lanczos filter, stretched by 4/3.0. This is the "Nice" filter from Dirty Pixels by Jim Blinn. The intended way to use this function is to extract scaling factors from the transformation and then pass those to this function to get a filter suitable for compositing with that transformation. The filter kernels can be chosen according to quality and performance tradeoffs. To get equivalent quality to GdkPixbuf for downscalings, use BOX for both reconstruction and sampling. For upscalings, use LINEAR for reconstruction and IMPULSE for sampling (though note that for upscaling in both X and Y directions, simply using PIXMAN_FILTER_BILINEAR will likely be a better choice).	2012-12-08 10:50:51 -05:00
Søren Sandmann Pedersen	68760d3fe1	rounding.txt: Describe how SEPARABLE_CONVOLUTION filter works Add some notes on how to compute the convolution matrices to be used with the SEPARABLE_CONVOLUTION filter.	2012-12-08 10:50:51 -05:00
Søren Sandmann Pedersen	6fd480b17c	Add new filter PIXMAN_FILTER_SEPARABLE_CONVOLUTION This filter is a new way to use a convolution matrix for filtering. In contrast to the existing CONVOLUTION filter, this new variant is different in two respects: - It is subsampled: Instead of just one convolution matrix, this filter chooses between a number of matrices based on the subpixel sample location, allowing the convolution kernel to be sampled at a higher resolution. - It is separable: Each matrix is specified as the tensor product of two vectors. This has the advantages that many fewer values have to be stored, and that the filtering can be done separately in the x and y dimensions (although the initial implementation doesn't actually do that). The motivation for this new filter is to improve image downsampling quality. Currently, the best pixman can do is the regular convolution filter which is limited to coarsely sampled convolution kernels. With this new feature, any separable filter can be used at any desired resolution.	2012-12-08 10:50:51 -05:00
Benjamin Gilbert	7e39861da3	Fix thread safety on mingw-w64 and clang After finding a working TLS storage class specifier, configure was continuing to test other candidates. This caused it to prefer __declspec(thread) over __thread. However, __declspec(thread) is ignored with a warning by mingw-w64 [1] and silently ignored by clang [2]. The resulting binary behaved as if PIXMAN_NO_TLS was defined. Bug introduced by `a069da6c`. [1] https://bugs.freedesktop.org/show_bug.cgi?id=57591 [2] http://lists.freedesktop.org/archives/pixman/2012-October/002320.html	2012-12-08 16:41:10 +02:00
Siarhei Siamashka	ebedd9a2ad	test: Get rid of the obsolete 'prng_rand_N' and 'prng_rand_u32' They are the same as 'prng_rand_n' and 'prng_rand'	2012-12-06 17:20:38 +02:00
Siarhei Siamashka	b31a696263	test: Switch to the new PRNG instead of old LCG Wallclock time for running pixman "make check" (compile time not included): ----------------------------+----------------+-----------------------------+ \| old PRNG (LCG) \| new PRNG (Bob Jenkins) \| Processor type +----------------+------------+----------------+ \| gcc 4.5 \| gcc 4.5 \| gcc 4.7 (simd) \| ----------------------------+----------------+------------+----------------+ quad Intel Core i7 @2.8GHz \| 0m49.494s \| 0m43.722s \| 0m37.560s \| dual ARM Cortex-A15 @1.7GHz \| 5m8.465s \| 4m37.375s \| 3m45.819s \| IBM Cell PPU @3.2GHz \| 23m0.821s \| 20m38.316s \| 16m37.513s \| ----------------------------+----------------+------------+----------------+ But some tests got a particularly large boost. For example benchmarking and profiling blitters-test on Core i7: === before === $ time ./blitters-test real 0m10.907s user 0m55.650s sys 0m0.000s 70.45% blitters-test blitters-test [.] create_random_image 15.81% blitters-test blitters-test [.] compute_crc32_for_image_internal 2.26% blitters-test blitters-test [.] _pixman_implementation_lookup_composite 1.07% blitters-test libc-2.15.so [.] _int_free 0.89% blitters-test libc-2.15.so [.] malloc_consolidate 0.87% blitters-test libc-2.15.so [.] _int_malloc 0.75% blitters-test blitters-test [.] combine_conjoint_general_u 0.61% blitters-test blitters-test [.] combine_disjoint_general_u 0.40% blitters-test blitters-test [.] test_composite 0.31% blitters-test libc-2.15.so [.] _int_memalign 0.31% blitters-test blitters-test [.] _pixman_bits_image_setup_accessors 0.28% blitters-test libc-2.15.so [.] malloc === after === $ time ./blitters-test real 0m3.655s user 0m20.550s sys 0m0.000s 41.77% blitters-test.n blitters-test.new [.] compute_crc32_for_image_internal 15.77% blitters-test.n blitters-test.new [.] prng_randmemset_r 6.15% blitters-test.n blitters-test.new [.] _pixman_implementation_lookup_composite 3.09% blitters-test.n libc-2.15.so [.] _int_free 2.68% blitters-test.n libc-2.15.so [.] malloc_consolidate 2.39% blitters-test.n libc-2.15.so [.] _int_malloc 2.27% blitters-test.n blitters-test.new [.] create_random_image 2.22% blitters-test.n blitters-test.new [.] combine_conjoint_general_u 1.52% blitters-test.n blitters-test.new [.] combine_disjoint_general_u 1.40% blitters-test.n blitters-test.new [.] test_composite 1.02% blitters-test.n blitters-test.new [.] prng_srand_r 1.00% blitters-test.n blitters-test.new [.] _pixman_image_validate 0.96% blitters-test.n blitters-test.new [.] _pixman_bits_image_setup_accessors 0.90% blitters-test.n libc-2.15.so [.] malloc	2012-12-06 17:20:35 +02:00
Siarhei Siamashka	309e66f047	test: Search/replace 'lcg_' -> 'prng_' The 'lcg' prefix is going to be misleading if we replace PRNG algorithm.	2012-12-06 17:20:31 +02:00
Siarhei Siamashka	d6545a2fc6	test: Added a better PRNG (pseudorandom number generator) This adds a fast SIMD-optimized variant of a small noncryptographic PRNG originally developed by Bob Jenkins: http://www.burtleburtle.net/bob/rand/smallprng.html The generated pseudorandom data is good enough to pass "Big Crush" tests from TestU01 (http://en.wikipedia.org/wiki/TestU01). SIMD code uses http://gcc.gnu.org/onlinedocs/gcc/Vector-Extensions.html which is a GCC specific extension. There is also a slower alternative code path, which should work with any C compiler. The performance of filling buffer with random data: Intel Core i7 @2.8GHz (SSE2) : ~5.9 GB/s ARM Cortex-A15 @1.7GHz (NEON) : ~2.2 GB/s IBM Cell PPU @3.2GHz (Altivec) : ~1.7 GB/s	2012-12-06 17:20:27 +02:00
Siarhei Siamashka	41f98a07fc	test: Change is_little_endian() into inline function Also dropped redundant volatile keyword because any object can be accessed via char* pointer without breaking aliasing rules. The compilers are able to optimize this function to either constant 0 or 1.	2012-12-06 17:20:23 +02:00
Cyril Brulebois	97a117ef1d	New upstream release.	2012-11-27 14:00:27 +01:00
Cyril Brulebois	e33dbc6c69	Merge branch 'upstream-experimental' into debian-experimental	2012-11-27 13:59:51 +01:00
Søren Sandmann Pedersen	978bab253d	Add text file rounding.txt describing how rounding works It is not entirely obvious how pixman gets from "location in the source image" to "pixel value stored in the destination". This file describes how the filters work, and in particular how positions are rounded to samples.	2012-11-22 01:16:54 -05:00
Søren Sandmann Pedersen	74319e9d39	Convolution filter: round color values instead of truncating The pixel computed by the convolution filter should be rounded off, not truncated. As a simple example consider a convolution matrix consisting of five times 0x3333. If all five all five input pixels are 0xff, then the result of truncating will be (5 * 0x3333 * 255) >> 16 = 254 But the real value of the computation is (5 * 0x3333 / 65536.0) * 254 = 254.9961, so the error is almost 1. If the user isn't very careful about normalizing the convolution kernel so that it sums to one in fixed point, such error might cause solid images to change color, or opaque images to become translucent. The fix is simply to round instead of truncate.	2012-11-22 01:06:29 -05:00
Søren Sandmann Pedersen	f0816ddaf4	Round fixed-point multiplication After two fixed-point numbers are multiplied, the result is shifted into place, but up until now pixman has simply discarded the low-order bits instead of rounding to the closest number. Fix that by adding 0x8000 (or 0x2 in one place) before shifting and update the test checksums to match.	2012-11-20 03:23:51 -05:00
Stefan Weil	44dd746bb6	test: Fix compiler warnings caused by unused code Signed-off-by: Stefan Weil <sw@weilnetz.de>	2012-11-14 18:02:14 -05:00
Stefan Weil	5f96022d3b	pixman: Use uintptr_t in type casts from pointer to integral value These modifications fix lots of compiler warnings for systems where sizeof(unsigned long) != sizeof(void *). This is especially true for MinGW-w64 (64 bit Windows). Signed-off-by: Stefan Weil <sw@weilnetz.de>	2012-11-14 18:02:14 -05:00

... 3 4 5 6 7 ...

2505 Commits