xz-archive

mirror of https://git.tukaani.org/xz.git synced 2024-04-04 12:36:23 +02:00

Author	SHA1	Message	Date
Jia Tan	61f8ec804a	liblzma: Refactor lzma_mf_is_supported() to use a switch-statement.	2022-07-25 18:30:10 +03:00
Lasse Collin	9595a3119b	liblzma: Add optional autodetection of LZMA end marker. Turns out that this is needed for .lzma files as the spec in LZMA SDK says that end marker may be present even if the size is stored in the header. Such files are rare but exist in the real world. The code in liblzma is so old that the spec didn't exist in LZMA SDK back then and I had understood that such files weren't possible (the lzma tool in LZMA SDK didn't create such files). This modifies the internal API so that LZMA decoder can be told if EOPM is allowed even when the uncompressed size is known. It's allowed with .lzma and not with other uses. Thanks to Karl Beldan for reporting the problem.	2022-07-13 22:24:07 +03:00
Lasse Collin	625f4c7c99	liblzma: Add rough support for output-size-limited encoding in LZMA1. With this it is possible to encode LZMA1 data without EOPM so that the encoder will encode as much input as it can without exceeding the specified output size limit. The resulting LZMA1 stream will be a normal LZMA1 stream without EOPM. The actual uncompressed size will be available to the caller via the uncomp_size pointer. One missing thing is that the LZMA layer doesn't inform the LZ layer when the encoding is finished and thus the LZ may read more input when it won't be used. However, this doesn't matter if encoding is done with a single call (which is the planned use case for now). For proper multi-call encoding this should be improved. This commit only adds the functionality for internal use. Nothing uses it yet.	2021-01-14 18:58:13 +02:00
Lasse Collin	7136f1735c	Rename unaligned_read32ne to read32ne, and similarly for the others.	2019-12-31 00:47:49 +02:00
Lasse Collin	8ce679125d	liblzma: Fix a buggy comment.	2019-06-25 23:15:21 +03:00
Lasse Collin	608517b9b7	liblzma: Remove incorrect uses of lzma_attribute((__unused__)). Caught by clang -Wused-but-marked-unused.	2019-06-24 22:50:36 +03:00
Lasse Collin	c460f6defe	liblzma: Fix one more unaligned read to use unaligned_read16ne().	2019-06-02 00:50:59 +03:00
Lasse Collin	2a22de439e	liblzma: Avoid memcpy(NULL, foo, 0) because it is undefined behavior. I should have always known this but I didn't. Here is an example as a reminder to myself: int mycopy(void dest, void src, size_t n) { memcpy(dest, src, n); return dest == NULL; } In the example, a compiler may assume that dest != NULL because passing NULL to memcpy() would be undefined behavior. Testing with GCC 8.2.1, mycopy(NULL, NULL, 0) returns 1 with -O0 and -O1. With -O2 the return value is 0 because the compiler infers that dest cannot be NULL because it was already used with memcpy() and thus the test for NULL gets optimized out. In liblzma, if a null-pointer was passed to memcpy(), there were no checks for NULL after the memcpy() call, so I cautiously suspect that it shouldn't have caused bad behavior in practice, but it's hard to be sure, and the problematic cases had to be fixed anyway. Thanks to Jeffrey Walton.	2019-05-13 20:05:17 +03:00
Antoine Cœur	2fb0ddaa55	spelling	2019-05-11 20:52:37 +03:00
Lasse Collin	d4a0462abe	liblzma: Avoid multiple definitions of lzma_coder structures. Only one definition was visible in a translation unit. It avoided a few casts and temp variables but seems that this hack doesn't work with link-time optimizations in compilers as it's not C99/C11 compliant. Fixes: http://www.mail-archive.com/xz-devel@tukaani.org/msg00279.html	2016-11-21 20:24:50 +02:00
Lasse Collin	14115f84a3	liblzma: Make Valgrind happier with optimized (gcc -O2) liblzma. When optimizing, GCC can reorder code so that an uninitialized value gets used in a comparison, which makes Valgrind unhappy. It doesn't happen when compiled with -O0, which I tend to use when running Valgrind. Thanks to Rich Prohaska. I remember this being mentioned long ago by someone else but nothing was done back then.	2015-11-04 23:14:00 +02:00
Lasse Collin	f243f5f44c	liblzma: Silence more uint32_t vs. size_t warnings.	2015-03-07 22:01:00 +02:00
Lasse Collin	fec88d41e6	liblzma: Silence harmless Valgrind errors. Thanks to Torsten Rupp for reporting this. I had forgotten to run Valgrind before the 5.2.0 release.	2015-01-26 20:39:28 +02:00
Lasse Collin	71e1437ab5	liblzma: Use lzma_memcmplen() in the BT3 match finder. I had missed this when writing the commit `5db75054e9`. Thanks to Jun I Jin.	2014-08-04 19:25:58 +03:00
Lasse Collin	5db75054e9	liblzma: Use lzma_memcmplen() in the match finders. This doesn't change the match finder output.	2014-07-25 21:15:07 +03:00
Lasse Collin	da1718f266	liblzma: Use lzma_alloc_zero() in LZ encoder initialization. This avoids a memzero() call for a newly-allocated memory, which can be expensive when encoding small streams with an over-sized dictionary. To avoid using lzma_alloc_zero() for memory that doesn't need to be zeroed, lzma_mf.son is now allocated separately, which requires handling it separately in normalize() too. Thanks to Vincenzo Innocente for reporting the problem.	2014-05-25 21:45:56 +03:00
Lasse Collin	3778db1be5	liblzma: Make the use of lzma_allocator const-correct. There is a tiny risk of causing breakage: If an application assigns lzma_stream.allocator to a non-const pointer, such code won't compile anymore. I don't know why anyone would do such a thing though, so in practice this shouldn't cause trouble. Thanks to Jan Kratochvil for the patch.	2012-07-17 18:19:59 +03:00
Lasse Collin	324cde7a86	liblzma: Remove unneeded semicolon.	2011-06-16 12:15:29 +03:00
Lasse Collin	4c6e146df9	Add underscores to attributes (__attribute((__foo__))).	2011-05-17 11:54:38 +03:00
Lasse Collin	77fe5954cd	liblzma: Adjust default depth calculation for HC3 and HC4. It was 8 + nice_len / 4, now it is 4 + nice_len / 4. This allows faster settings at lower nice_len values, even though it seems that I won't use automatic depth calcuation with HC3 and HC4 in the presets.	2010-09-03 12:28:41 +03:00
Lasse Collin	b5fbab6123	Silence a bogus Valgrind warning. When using -O2 with GCC, it liked to swap two comparisons in one "if" statement. It's otherwise fine except that the latter part, which is seemingly never executed, got executed (nothing wrong with that) and then triggered warning in Valgrind about conditional jump depending on uninitialized variable. A few people find this annoying so do things a bit differently to avoid the warning.	2010-06-02 23:09:22 +03:00
Lasse Collin	920a69a8d8	Rename MIN() and MAX() to my_min() and my_max(). This should avoid some minor portability issues.	2010-05-26 10:36:46 +03:00
Lasse Collin	eb7d51a3fa	Collection of language fixes to comments and docs. Thanks to Jonathan Nieder.	2010-02-12 13:16:15 +02:00
Lasse Collin	e330fb7e6b	Fix wrong indentation caused by incorrect settings in the text editor.	2009-11-15 12:54:45 +02:00
Lasse Collin	418d64a32e	Fix a design error in liblzma API. Originally the idea was that using LZMA_FULL_FLUSH with Stream encoder would read the filter chain from the same array that was used to intialize the Stream encoder. Since most apps wouldn't use LZMA_FULL_FLUSH, most apps wouldn't need to keep the filter chain available after initializing the Stream encoder. However, due to my mistake, it actually required keeping the array always available. Since setting the new filter chain via the array used at initialization time is not a nice way to do it for a couple of reasons, this commit ditches it and introduces lzma_filters_update(). This new function replaces also the "persistent" flag used by LZMA2 (and to-be-designed Subblock filter), which was also an ugly thing to do. Thanks to Alexey Tourbin for reminding me about the problem that Stream encoder used to require keeping the filter chain allocated.	2009-11-14 18:59:19 +02:00
Lasse Collin	ebfb2c5e1f	Use a tuklib module for integer handling. This replaces bswap.h and integer.h. The tuklib module uses <byteswap.h> on GNU, <sys/endian.h> on *BSDs and <sys/byteorder.h> on Solaris, which may contain optimized code like inline assembly.	2009-10-04 22:57:12 +03:00
Lasse Collin	3782b3fee4	Use unaligned access (if possible) on both endiannesses in lz_encoder_hash.h.	2009-10-02 11:28:17 +03:00
Lasse Collin	c5f68b5cc7	Make liblzma produce the same output on both endiannesses. Seems that it is a problem in some cases if the same version of XZ Utils produces different output on different endiannesses, so this commit fixes that problem. The output will still vary between different XZ Utils versions, but I cannot avoid that for now. This commit bloatens the code on big endian systems by 1 KiB, which should be OK since liblzma is bloated already. ;-)	2009-10-02 11:03:26 +03:00
Lasse Collin	4ab7b16b95	A few grammar fixes. Thanks to Christian Weisgerber for pointing out some of these.	2009-09-12 14:07:36 +03:00
Lasse Collin	18a4233a53	Fix a couple of warnings.	2009-09-11 09:25:09 +03:00
Lasse Collin	3ce1916c83	Fix data corruption in LZ/LZMA2 encoder. Thanks to Jonathan Stott for the bug report.	2009-08-16 22:15:13 +03:00
Lasse Collin	f42ee98166	Build system fixes Don't use libtool convenience libraries to avoid recently discovered long-standing subtle but somewhat severe bugs in libtool (at least 1.5.22 and 2.2.6 are affected). It was found when porting XZ Utils to Windows <http://lists.gnu.org/archive/html/libtool/2009-06/msg00070.html> but the problem is significant also e.g. on GNU/Linux. Unless --disable-shared is passed to configure, static library built from a set of convenience libraries will contain PIC objects. That is, while libtool builds non-PIC objects too, only PIC objects will be used from the convenience libraries. On 32-bit x86 (tested on mobile XP2400+), using PIC instead of non-PIC makes the decompressor 10 % slower with the default CFLAGS. So while xz was linked against static liblzma by default, it got the slower PIC objects unless --disable-shared was used. I tend develop and benchmark with --disable-shared due to faster build time, so I hadn't noticed the problem in benchmarks earlier. This commit also adds support for building Windows resources into liblzma and executables.	2009-06-30 17:09:57 +03:00
Lasse Collin	1c9360b7d1	Fix @variables@ to $(variables) in Makefile.am files. Fix the ordering of libgnu.a and LTLIBINTL on the linker command line and added missing LTLIBINTL to tests/Makefile.am.	2009-06-26 14:47:31 +03:00
Lasse Collin	02ddf09bc3	Put the interesting parts of XZ Utils into the public domain. Some minor documentation cleanups were made at the same time.	2009-04-13 11:27:40 +03:00
Lasse Collin	e79c42d854	Fix off-by-one in LZ decoder. Fortunately, this bug had no security risk other than accepting some corrupt files as valid.	2009-04-10 11:17:02 +03:00
Lasse Collin	0e27028d74	Add a separate internal function to initialize the CRC32 table, which is used also by LZ encoder. This was needed because calling lzma_crc32() and ignoring the result is a no-op due to lzma_attr_pure.	2009-02-08 18:24:50 +02:00
Lasse Collin	22a0c6dd94	Modify LZMA_API macro so that it works on Windows with other compilers than MinGW. This may hurt readability of the API headers slightly, but I don't know any better way to do this.	2009-02-02 20:14:03 +02:00
Lasse Collin	f76e39cf93	Added initial support for preset dictionary for raw LZMA1 and LZMA2. It is not supported by the .xz format or the xz command line tool yet.	2009-01-27 18:36:05 +02:00
Lasse Collin	7ed9d943b3	Remove lzma_init() and other init functions from liblzma API. Half of developers were already forgetting to use these functions, which could have caused total breakage in some future liblzma version or even now if --enable-small was used. Now liblzma uses pthread_once() to do the initializations unless it has been built with --disable-threads which make these initializations thread-unsafe. When --enable-small isn't used, liblzma currently gets needlessly linked against libpthread (on systems that have it). While it is stupid for now, liblzma will need threads in future anyway, so this stupidity will be temporary only. When --enable-small is used, different code CRC32 and CRC64 is now used than without --enable-small. This made the resulting binary slightly smaller, but the main reason was to clean it up and to handle the lack of lzma_init_check(). The pkg-config file lzma.pc was renamed to liblzma.pc. I'm not sure if it works correctly and portably for static linking (Libs.private includes -pthread or other operating system specific flags). Hopefully someone complains if it is bad. lzma_rc_prices[] is now included as a precomputed array even with --enable-small. It's just 128 bytes now that it uses uint8_t instead of uint32_t. Smaller array seemed to be at least as fast as the more bloated uint32_t array on x86; hopefully it's not bad on other architectures.	2008-12-31 00:30:49 +02:00
Lasse Collin	17781c2c20	The LZMA2 decoder fix introduced a bug to LZ decoder, which made LZ decoder return too early after dictionary reset. This fixes it.	2008-12-15 14:26:52 +02:00
Lasse Collin	ff7fb2c605	Fix data corruption in LZMA2 decoder.	2008-12-15 10:01:59 +02:00
Lasse Collin	e114502b2b	Oh well, big messy commit again. Some highlights: - Updated to the latest, probably final file format version. - Command line tool reworked to not use threads anymore. Threading will probably go into liblzma anyway. - Memory usage limit is now about 30 % for uncompression and about 90 % for compression. - Progress indicator with --verbose - Simplified --help and full --long-help - Upgraded to the last LGPLv2.1+ getopt_long from gnulib. - Some bug fixes	2008-11-19 20:46:52 +02:00
Lasse Collin	1dcecfb09b	Some API changes, bug fixes, cleanups etc.	2008-09-27 19:09:21 +03:00
Lasse Collin	f147666a5c	Miscellaneous LZ and LZMA encoder cleanups	2008-09-17 22:11:39 +03:00
Lasse Collin	13d68b0698	LZ decoder cleanup	2008-09-13 13:54:00 +03:00
Lasse Collin	13a74b78e3	Renamed constants: - LZMA_VLI_VALUE_MAX -> LZMA_VLI_MAX - LZMA_VLI_VALUE_UNKNOWN -> LZMA_VLI_UNKNOWN - LZMA_HEADER_ERRRO -> LZMA_OPTIONS_ERROR	2008-09-13 12:10:43 +03:00
Lasse Collin	32fe5fa541	Comments	2008-09-06 23:42:50 +03:00
Lasse Collin	fc68165745	Some fixes to LZ encoder.	2008-09-02 11:45:39 +03:00
Lasse Collin	3b34851de1	Sort of garbage collection commit. :-\| Many things are still broken. API has changed a lot and it will still change a little more here and there. The command line tool doesn't have all the required changes to reflect the API changes, so it's easy to get "internal error" or trigger assertions.	2008-08-28 22:53:15 +03:00
Lasse Collin	7d17818cec	Update the code to mostly match the new simpler file format specification. Simplify things by removing most of the support for known uncompressed size in most places. There are some miscellaneous changes here and there too. The API of liblzma has got many changes and still some more will be done soon. While most of the code has been updated, some things are not fixed (the command line tool will choke with invalid filter chain, if nothing else). Subblock filter is somewhat broken for now. It will be updated once the encoded format of the Subblock filter has been decided.	2008-06-18 18:02:10 +03:00

1 2

62 commits