townforge/zstd - zstd - Townforge git

Author	SHA1	Message	Date
Yann Collet	aba88fa996	Merge pull request #2829 from facebook/ZSTD_DECODER_INTERNAL_BUFFER minor : change build macro to ZSTD_DECODER_INTERNAL_BUFFER	2021-10-26 10:48:16 -07:00
Yann Collet	2b2a5c449a	fix minor cast warning	2021-10-26 08:38:17 -07:00
Yann Collet	518f06b281	added minimum for decoder buffer also : introduced macro BOUNDED()	2021-10-26 08:21:31 -07:00
Yann Collet	082d6c6775	separate compression level tables into their own files that's clearer than finding the tables somewhere in the middle of `compress.c`. Also, down the line, it may potentially allows zstd to feature adjusted tables depending on target cpu.	2021-10-25 08:49:54 -07:00
Nick Terrell	1bbb372e3e	[ldm] Fix ZSTD_c_ldmHashRateLog bounds check There is no minimum value check, so the parameter could be negative. Switch to the standard pattern of using `BOUNDCHECK()`. The bug was reported by Dan Carpenter and found by Smatch static checker. https://lore.kernel.org/all/20211008063704.GA5370@kili/	2021-10-08 11:17:40 -07:00
Nick Terrell	14772d97be	Merge pull request #2796 from terrelln/linux-fixes [lib] Make lib compatible with `-Wfall-through` excepting legacy	2021-09-23 16:11:53 -07:00
Nick Terrell	189e87bcbe	[lib] Make lib compatible with `-Wfall-through` excepting legacy Switch to a macro `ZSTD_FALLTHROUGH;` instead of a comment. On supported compilers this uses an attribute, otherwise it becomes a comment. This is necessary to be compatible with clang's `-Wfall-through`, and gcc's `-Wfall-through=2` which don't support comments. Without this the linux build emits a bunch of warnings. Also add a test to CI to ensure that we don't regress.	2021-09-23 10:51:18 -07:00
senhuang42	1d8143c84f	Move block splitter from stack to CCtx	2021-09-23 00:02:31 -04:00
senhuang42	06f42c3bfd	Use new paramSwitch enum for LDM	2021-09-21 14:22:09 -04:00
senhuang42	b5c35d7ea3	Use new paramSwitch enum for LCM, row matchfinder, and block splitter	2021-09-21 14:22:02 -04:00
Yann Collet	70d89e5a12	minor rebalancing of level 13 This new setup is slighly better on `silesia.tar` : Ratio : 3.649 -> 3.655 Speed : 11.9 MB/s -> 12.2 MB/s At the cost of more memory : 24 MB -> 32 MB The new memory budget is a reasonable interpolation between neighboring levels 12 and 14: level 12 : 24 MB level 13 : 32 MB (increased from 24 MB) level 14 : 48 MB Window size remains unaffected (4 MB)	2021-09-01 13:05:10 -07:00
Yann Collet	2de42174bb	make ZSTD_HASHLOG3_MAX private This is an implementation detail, it doesn't belong to public space (zstd.h).	2021-08-20 09:52:42 -07:00
senhuang42	e411040ea1	Add 64 row entry support for lazy	2021-08-04 16:19:12 -04:00
senhuang42	31820e032c	Rebalance clevels for lazy	2021-08-04 16:18:52 -04:00
Sen Huang	5ec7897a26	Fix static analyzer warnings	2021-07-29 09:11:12 -07:00
Binh Vo	dc5b693f1e	Proactively skip huffman compression based on sampling where non-compressibility is suspected	2021-06-30 11:02:47 -04:00
sen	45d707e908	Merge pull request #2715 from senhuang42/sequence_api_3 [RFC] Add internal API for converting ZSTD_Sequence into seqStore	2021-06-24 13:02:11 -04:00
senhuang42	76466dfadf	Add simple API for converting ZSTD_Sequence into seqStore	2021-06-23 12:10:48 -04:00
Nick Terrell	05b6773fbc	[fix] Add missing bounds checks during compression * The block splitter missed a bounds check, so when the buffer is too small it passes an erroneously large size to `ZSTD_entropyCompressSeqStore()`, which can then write the compressed data past the end of the buffer. This is a new regression in v1.5.0 when the block splitter is enabled. It is either enabled explicitly, or implicitly when using the optimal parser and `ZSTD_compress2()` or `ZSTD_compressStream()`. `HUF_writeCTable_wksp()` omits a bounds check when calling `HUF_compressWeights()`. If it is called with `dstCapacity == 0` it will pass an erroneously large size to `HUF_compressWeights()`, which can then write past the end of the buffer. This bug has been present for ages. However, I believe that zstd cannot trigger the bug, because it never calls `HUF_compress*()` with `dstCapacity == 0` because of [this check][1]. Credit to: Oss-Fuzz [1]: `89127e5ee2/lib/compress/zstd_compress_literals.c (L100)`	2021-06-14 11:35:33 -07:00
sen	d5f3568c4b	Merge pull request #2697 from senhuang42/entropy_repeat_fix [bug] Fix entropy repeat mode bug	2021-06-10 16:39:17 +03:00
aqrit	dd4f6aa9e6	Flatten ZSTD_row_getMatchMask (#2681 ) * Flatten ZSTD_row_getMatchMask * Remove the SIMD abstraction layer. * Add big endian support. * Align `hashTags` within `tagRow` to a 16-byte boundary. * Switch SSE2 to use aligned reads. * Optimize scalar path using SWAR. * Optimize neon path for `n == 32` * Work around minor clang issue for NEON (https://bugs.llvm.org/show_bug.cgi?id=49577) * replace memcpy with MEM_readST * silence alignment warnings * fix neon casts * Update zstd_lazy.c * unify simd preprocessor detection (#3) * remove duplicate asserts * tweak rotates * improve endian detection * add cast there is a fun little catch-22 with gcc: result from pmovmskb has to be cast to uint32_t to avoid a zero-extension but must be uint16_t to get gcc to generate a rotate instruction.. * more casts * fix casts better work-around for the (bogus) warning: unary minus on unsigned	2021-06-09 08:50:25 +03:00
Sen Huang	923e5ad3f5	Fix entropy repeat mode bug	2021-06-07 00:32:03 -07:00
senhuang42	939276cd0c	Add ldm and block splitter auto-enable to old api	2021-05-24 13:09:32 -04:00
Yann Collet	02ece5d59f	Merge pull request #2653 from TrianglesPCT/dev Enable SSE2 compression path to work on MSVC	2021-05-17 11:20:50 -07:00
TrianglesPCT	77d54eb3b3	Add files via upload	2021-05-14 16:40:32 -06:00
TrianglesPCT	52f44bb365	Add files via upload msvc	2021-05-14 16:33:07 -06:00
Stephen Kitt	e81d567547	Distinguish static symbols, allow hiding them Even with -fvisibility=hidden added to CFLAGS, any symbol which is given a default visibility attribute ends up exported in the dynamic library. This happens through zstd_internal.h which defines ..._STATIC_LINKING_ONLY before including various header files, and is included for example in lib/common/pool.c. To avoid this, this patch distinguishes static and non-static APIs, by using ZSTDLIB_API only for the latter, and introducing ZSTDLIB_STATIC_API for the former. For now, both are exported, but non-static APIs can be hidden by overriding the definition ZSTDLIB_STATIC_API. lib/Makefile is modified to allow this using make CPPFLAGS_DYNLIB=-DZSTDLIB_STATIC_API=ZSTDLIB_HIDDEN In addition, API declarations are dropped from zstd_compress.c (they aren't needed there). Signed-off-by: Stephen Kitt <steve@sk2.org>	2021-05-14 19:41:59 +02:00
Nick Terrell	03c4111299	[lib] Fix dictionary invalidation logic Call `ZSTD_enforceMaxDist()` before each block with the beginning of the block. This ensures that `lowLimit` is updated to `dictLimit` whenever the ext-dict is out of range, so we can use prefix mode for speed. This can cause non-determinism because prefix mode and ext-dict mode match finders can return different results. It can also hurt speed because ext-dict match finders are slower. The scenario is: 1. Compress large data with a dictionary. 2. The dictionary goes out of bounds, so we invalidate it. 3. However, we still have `lowLimit < dictLimit`, since it is never updated. 4. We will call the ext-dict match finder instead of the prefix one.	2021-05-13 17:05:59 -07:00
sen	9e94b7cac5	Assert no divison by 0, correct superblocks 0 sequences case (#2592 )	2021-05-07 13:26:56 -04:00
sen	698f261b35	[1.5.0] Deprecate some functions (#2582 ) * Add deprecated macro to zstd.h, mark certain functions as deprecated * Remove ZSTD_compress.c dependencies on deprecated functions	2021-05-06 17:59:32 -04:00
Nick Terrell	207e33bb61	Merge pull request #2616 from terrelln/deterministic-dict [lib] Add ZSTD_c_deterministicRefPrefix	2021-05-06 11:09:22 -07:00
Nick Terrell	172b4b6ac4	[lib] Add ZSTD_c_deterministicRefPrefix This flag forces zstd to always load the prefix in ext-dict mode, even if it happens to be contiguous, to force determinism. It also applies to dictionaries that are re-processed. A determinism test case is also added, which fails without `ZSTD_c_deterministicRefPrefix` and passes with it set. Question: Should this be the default behavior? It isn't in this PR.	2021-05-05 18:49:56 -07:00
Nick Terrell	eb7e74ccb7	[tests] Set `DEBUGLEVEL=2` by default This allows us to quickly check for compile errors in debug log messages, which are compiled out when `DEBUGLEVEL < 2`.	2021-05-05 13:29:06 -07:00
Nick Terrell	c2183d7cdf	[lib] Move some ZSTD_CCtx_params off the stack * Take `params` by const reference in `ZSTD_resetCCtx_internal()`. * Add `simpleApiParams` to the CCtx and use them in the simple API functions, instead of creating those parameters on the stack. I think this is a good direction to move in, because we shouldn't need to worry about adding parameters to `ZSTD_CCtx_params`, since it should always be on the heap (unless they become absoultely gigantic). Some `ZSTD_CCtx_params` are still on the stack in the CDict functions, but I've left them for now, because it was a little more complex, and we don't use those functions in stack-constrained currently.	2021-05-05 13:25:16 -07:00
Nick Terrell	0b88c2582c	[test] Add large dict/data --patch-from test Dictionary size must be > `ZSTD_CHUNKSIZE_MAX`.	2021-05-04 17:31:32 -07:00
Nick Terrell	94db4398a0	[lib] Always load the dictionary in one go Dictionaries larger than `ZSTD_CHUNKSIZE_MAX` used to have to be loaded in multiple segments. Instead, when we detect large dictionaries, ensure that we reset the context's indicies. Then, for dictionaries larger than `ZSTD_CURRENT_MAX - 1`, only load the suffix of the dictionary. Finally, enable DDS for large dictionaries, since we no longer load in multiple segments. This simplifes the dictionary loading code, and reduces opportunities for non-determinism to slip in.	2021-05-04 16:45:25 -07:00
Nick Terrell	1ffa80a09e	[easy] Rewrite rowHashLog computation `ZSTD_highbit32(1u << x) == x` when it isn't undefined behavior.	2021-05-04 11:43:20 -07:00
Nick Terrell	34aff7ea06	Bug fix & run overflow correction much more frequently in tests * Fix overflow correction when `windowLog < cycleLog`. Previously, we got the correction wrong in this case, and our chain tables and binary trees would be corrupted. Now, we work as long as `maxDist` is a power of two, by adding `MAX(maxDist, cycleSize)` to our indices. * When `ZSTD_WINDOW_OVERFLOW_CORRECT_FREQUENTLY` is defined to non-zero run overflow correction as frequently as allowed without impacting compression ratio. * Enable `ZSTD_WINDOW_OVERFLOW_CORRECT_FREQUENTLY` in `fuzzer` and `zstreamtest` as well as all the OSS-Fuzz fuzzers. This has a 5-10% speed penalty at most, which seems reasonable.	2021-05-03 15:21:47 -07:00
senhuang42	61fe571af6	Fix chaintable check to include rowhash in ZSTD_reduceIndex()	2021-04-30 19:52:04 -04:00
Nick Terrell	6cee3c2c4f	[trace] Remove default definitions of weak symbols Instead of providing a default no-op implementation, check the symbols for `NULL` before accessing them. Providing a default implementation doesn't reliably work with dynamic linking. Depending on link order the default implementations may not be overridden. By skipping the default implementation, all link order issues are resolved. If the symbols aren't provided the weak function will be `NULL`.	2021-04-26 16:05:39 -07:00
felixhandte	efa6dfa729	Apply DDS adjustments to avoid assert failures	2021-04-23 16:41:00 -04:00
sen	12c045f74d	Merge pull request #2574 from senhuang42/repcode_mismatch_detector_fix Correct the block splitter mismatched repcodes detection.	2021-04-12 23:27:43 -04:00
Sen Huang	550f76f131	Correct the detection of mismatched repcodes	2021-04-09 09:08:51 -07:00
Nick Terrell	4694423c4f	Add and integrate lazy row hash strategy	2021-04-07 09:53:34 -07:00
sen	f71aabb5b5	Move clevel override to after initLocalDict() (#2571 )	2021-04-06 21:05:37 -04:00
sen	f1e8b565c2	Maintain two repcode histories for block splitting, replace invalid repcodes (#2569 )	2021-04-06 17:25:55 -04:00
sen	e38124555e	Fix dictionary force reloading clevel selection (#2570 ) * Move cdict clevel override to before localdict init * Update results.csv after dict load changes	2021-04-06 15:35:09 -04:00
sen	980f3bbf83	[cwksp] Align all allocated "tables" and "aligneds" to 64 bytes (#2546 ) * Perform 64-byte alignment of wksp tables and aligneds internally * Clean up cwskp_finalize() function to only do two allocs * Refactor aligned/buffer reservation code, remove ASAN req for alignment reservations * Change from allocating 128 bytes always to allocating only buffer space as needed for tables/aligned * Back out aligned/table reservation order restriction * Add stricter bounds for new/resized wksps, fix comment in zstd_cwksp.h	2021-04-01 20:07:19 -04:00
sen	255925c231	Fix repcode-related OSS-fuzz issues in block splitter (#2560 ) * Do not emit last partitions of blocks as RLE/uncompressed * Fix repcode updates within block splitter * Add a entropytables confirm function, redo ZSTD_confirmRepcodesAndEntropyTables() for better function signature * Add a repcode updater to block splitter, no longer need to force emit compressed blocks	2021-03-31 15:14:59 -04:00
Nick Terrell	a494308ae9	[copyright][license] Switch to yearless copyright and some cleanup in the linux-kernel files * Switch to yearless copyright per FB policy * Fix up SPDX-License-Identifier lines in `contrib/linux-kernel` sources * Add zstd copyright/license header to the `contrib/linux-kernel` sources * Update the `tests/test-license.py` to check for yearless copyright * Improvements to `tests/test-license.py` * Check `contrib/linux-kernel` in `tests/test-license.py`	2021-03-30 10:30:43 -07:00
sen	84ccb81e7c	Merge pull request #2561 from senhuang42/longlength_enum Add enum for representing long length ID	2021-03-26 15:55:12 -04:00
Sen Huang	b1a43455f8	Add enum for representing long length ID	2021-03-26 10:41:09 -07:00
sen	4fe2e7ae14	Merge pull request #2558 from senhuang42/msan_block_splitter_fix Fix block splitter minor MSAN warning.	2021-03-25 13:51:43 -04:00
sen	b0407b9f0e	Merge pull request #2555 from senhuang42/default_clevel_func Add ZSTD_defaultCLevel() function to public API	2021-03-25 13:07:28 -04:00
Sen Huang	2a907bf4aa	Move lastCountSize into a returned struct, fix MSAN error	2021-03-25 09:11:15 -07:00
Sen Huang	e398744a35	Add ZSTD_defaultCLevel() function to public API	2021-03-25 08:04:00 -07:00
Nick Terrell	f8ac0ea7ef	Merge pull request #2539 from terrelln/linux-kernel-fixes Fixes for the next linux kernel patch version	2021-03-24 10:34:29 -07:00
sen	bf542c8a8d	Merge pull request #2447 from senhuang42/block_splitter_v2 Recursive block splitting	2021-03-24 12:27:22 -04:00
Sen Huang	5b566ebe08	Rename compressSequences() functions for clarity	2021-03-24 08:21:29 -07:00
Sen Huang	0ef1f935b7	Add a fallback in case the total blocksize of split blocks exceeds raw block size	2021-03-24 08:21:29 -07:00
Sen Huang	c90e81a692	Enable block splitter by default when applicable	2021-03-24 08:21:29 -07:00
Sen Huang	e34332834a	Clean up various functions, add debuglogging for estimate vs. actual sizes	2021-03-24 08:21:29 -07:00
Sen Huang	41c3eae6d9	Fix various fuzzer failures: repcode history, superblocks	2021-03-24 08:21:29 -07:00
senhuang42	0633bf17c3	Change 1.3.4 bugfix to be cross-compatible with superblocks and normal compression	2021-03-24 08:21:29 -07:00
senhuang42	eb1ee8686d	Refactor buildSequencesStatistics() to avoid pointer increment for superblocks	2021-03-24 08:21:29 -07:00
senhuang42	e2bb215117	Add unit tests and fuzzer param	2021-03-24 08:21:09 -07:00
senhuang42	de52de1347	Add recursive block split algorithm	2021-03-24 08:21:09 -07:00
senhuang42	f06f6626ed	Update function names for consistency	2021-03-24 08:20:54 -07:00
senhuang42	c56d6e49e8	Add block splitter to experimental params	2021-03-24 08:20:54 -07:00
senhuang42	2949a95224	Refactor block compression logic into single function	2021-03-24 08:20:54 -07:00
senhuang42	c05c090cc2	Centralize entropy statistics calculations to zstd_compress.c	2021-03-24 08:20:29 -07:00
sen	c48889f097	Merge pull request #2538 from senhuang42/monotonicity_test Add memory monotonicity test over srcSize	2021-03-22 16:54:34 -04:00
Sen Huang	dff4a0e867	Make ZSTD_estimateCCtxSize_internal() loop through all srcSize parameter sets as well	2021-03-21 16:15:31 -07:00
Sen Huang	77ae664ba6	Fix ZSTD_dedicatedDictSearch_isSupported() requirements	2021-03-16 17:36:05 -07:00
senhuang42	386111adec	Add a nbSeq argument to compressSequences() Refactor ZSTD_compressBlock_internal() to do the block header write within and add nbSeq argument to compressSequences()	2021-03-16 14:04:22 -07:00
senhuang42	98764493cf	Move block header write into compressBlock_internal()	2021-03-16 14:04:22 -07:00
Nick Terrell	cd1551d261	[lib][tracing] Add ZSTD_NO_TRACE macro When defined, it disables tracing, and avoids including the header.	2021-03-16 11:47:27 -07:00
Nick Terrell	7736549bea	[bug-fix] Make simple single-pass functions ignore advanced parameters The simple compression functions are intended to ignore the advanced parameters, but they were accidentally using them. All the `ZSTD_parameters` were set correctly, but any extra parameters were used as-is. E.g. `ZSTD_c_format`. This PR makes all the simple single-pass functions listed below ignore the advanced parameters, as intended. * `ZSTD_compressCCtx()` * `ZSTD_compress_usingDict()` * `ZSTD_compress_usingCDict()` * `ZSTD_compress_advanced()` * `ZSTD_compress_usingCDict_advanced()` It also adds a test case that ensures that each of these functions ignore the advanced parameters.	2021-02-12 19:11:23 -08:00
Nick Terrell	c62eb05964	[lib] Set appliedParams.compressionLevel correctly Forward the correct compressionLevel to the appliedParams in all cases. It was already correct for the advanced API, so only the old single-pass functions needed to be fixed. This compression level is unused by the library, but is set so that the tracing framework can consume it.	2021-02-12 15:00:14 -08:00
Nick Terrell	f520f6dfbe	[trace] Minor fixes found during integration * Mark `ZSTD_CCtx_getParameter()` as const * Add `extern "C"` guards to `zstd_trace.h`	2021-02-11 16:20:04 -08:00
Yann Collet	8884cb887d	Merge pull request #2483 from mpu/ldmgear New algorithms for the long distance matcher	2021-02-11 08:38:23 -08:00
Quentin Carbonneaux	552efcac2d	relocate large arrays from the stack to ldmState_t	2021-02-10 16:16:54 +01:00
Nick Terrell	e59c9459a5	[trace] Keep track of a uint64_t tracing context The most common information that you want to track between begin() and end() is the timestamp of the begin function, so you can measure the duration of the (de)compression call. Allow the tracing library to put this information inside the `ZSTD_TraceCtx`, so it doesn't need to keep a global map in this case. If a single uint64_t is not enough, the tracing library can return a unique identifier (like the context pointer) instead, and use it as a key in a map. This keeps the simple case simple.	2021-02-09 11:37:05 -08:00
Nick Terrell	54a4998a80	Add basic tracing functionality	2021-02-05 16:28:52 -08:00
Quentin Carbonneaux	1e65711ca5	a couple performance improvement changes for ldm	2021-01-20 00:54:20 -08:00
Nick Terrell	58476bcf7f	Don't shrink window log in ZSTD_getCParams() Treat ZSTD_getCParams() and ZSTD_adjustCParams() in the same way we treat streaming compression. Choose parameters based on the dictionary size + source size, and assume the source size is small if unkown. But, don't shrink the window log down in ZSTD_adjustCParams_internal().	2021-01-04 15:54:09 -08:00
Nick Terrell	9d31c704d5	Don't shrink window log when streaming with a dictionary Fixes #2442. 1. When creating a dictionary keep the same behavior as before. Assume the source size is 513 bytes when adjusting parameters. 2. When calling ZSTD_getCParams() or ZSTD_adjustCParams() keep the same behavior as before. 3. When attaching a dictionary keep the same behavior of ignoring the dictionary size. When streaming this will select the largest parameters and not adjust them down. But, the CDict will use the correctly sized parameters, which seems like the right tradeoff. 4. When not attaching a dictionary (either forced not to, or using a prefix dictionary) we select parameters based on the dictionary size + source size, and assume the source size is small, which is the same behavior as before. But, now we don't adjust the window log (and hash and chain log) down when the source size is unknown. When the source size is unknown all cdicts should attach, except when the user disables attaching, or `forceWindow` is used. This means that when streaming with a CDict we end up in the good case where we get small CDict parameters, and large source parameters. TODO: Add a streaming + dictionary regression test case.	2021-01-04 15:54:09 -08:00
Nick Terrell	66e811d782	[license] Update year to 2021	2021-01-04 17:53:52 -05:00
senhuang42	5c41490bfe	Use pre-defined constants	2020-12-21 11:52:05 -05:00
senhuang42	7e11bd012b	Implement skippable frame function	2020-12-21 11:13:22 -05:00
Yann Collet	0b39531d75	moving all references to `release` branch was previously `master`	2020-12-16 23:00:35 -08:00
W. Felix Handte	9dab03db90	Create Enum to Represent Static/Dynamic Allocation Distinction in cwksp	2020-12-09 14:57:37 -05:00
W. Felix Handte	db9e73cb07	Don't ASAN-Poison Statically-Allocated Workspaces Addresses #2286.	2020-12-09 13:00:47 -05:00
Nick Terrell	c238db046f	Merge pull request #2414 from terrelln/mt-progress [lib] Ensure that multithreaded compression always makes some progress	2020-12-04 16:30:08 -08:00
Nick Terrell	4c58cb8383	[lib] Ensure that multithreaded compression always makes some progress	2020-12-03 20:25:14 -08:00
Nick Terrell	6672689e7e	Merge pull request #2406 from terrelln/linux-wrapper-api [linux] Add the linux wrapper API	2020-12-02 16:49:03 -08:00
Nick Terrell	894ae36675	Merge pull request #2390 from animalize/clamp_level Clamp compression level	2020-12-02 14:35:58 -08:00
senhuang42	2cbd038528	Move max nb seq check to per-block	2020-12-02 12:11:32 -05:00
Nick Terrell	3cda5fae77	[minor][lib] Remove double semicolon	2020-12-02 01:08:08 -08:00
senhuang42	3efe9c902b	Add sequence nb validation to compressSequences(), adjust minMatch comparisons	2020-12-01 10:54:45 -05:00

1 2 3 4 5 ...

1312 Commits