townforge/zstd - zstd - Townforge git

Author	SHA1	Message	Date
Jennifer Liu	21721b75a3	Change default f to 20	2018-09-04 17:15:14 -07:00
Jennifer Liu	944c9986e0	Update comment on default steps of cover and fastcover	2018-08-30 15:37:29 -07:00
Jennifer Liu	16db0337b1	Always use splitPoint=1.0 for non-optimize cover and fastcover	2018-08-30 14:59:22 -07:00
Yann Collet	31ebb26945	Merge pull request #1301 from terrelln/lit-size [zstd] Fix seqStore growth	2018-08-28 17:10:25 -07:00
Nick Terrell	5e580de6da	[zstd] Fix seqStore growth We could undersize the literals buffer by up to 11 bytes, due to a combination of 2 bugs: * The literals buffer didn't have `WILDCOPY_OVERLENGTH` extra space, like it is supposed to. * We didn't check the literals buffer size in `ZSTD_sufficientBuff()`.	2018-08-28 13:24:44 -07:00
Yann Collet	b37a0a6bde	Merge pull request #1298 from facebook/bench Refactored bench.c	2018-08-28 12:25:02 -07:00
modbw	d14edf259f	Fixed memory leak detected by cppcheck cppcheck (which is run regularly in our CI environment) detected a possible memory leak.	2018-08-28 07:25:05 +02:00
Yann Collet	6782725155	first sketch for largeNbDicts test program	2018-08-26 19:29:12 -07:00
Yann Collet	af23d39eb8	Merge pull request #1297 from felixhandte/check-offset-table Fix Missing Offset Table Check	2018-08-24 17:36:44 -07:00
W. Felix Handte	37f17ee237	Mark Repeated Offset Table as Needing Check	2018-08-24 14:33:34 -07:00
Nick Terrell	e34e917655	Fix compiler warning	2018-08-23 17:48:06 -07:00
Nick Terrell	5ee5e71be3	[zstd] Add note about empty ZSTD_CDict	2018-08-23 17:48:06 -07:00
Nick Terrell	924944e471	[zstd] Reuse the ZSTD_CCtx more often with small data.	2018-08-23 17:48:06 -07:00
Yann Collet	2e45badff4	refactored bench.c for clarity and safety, especially at interface level	2018-08-23 14:21:18 -07:00
Jennifer Liu	9d6ed9def3	Merge fastCover into DictBuilder (#1274 ) * Minor fix * Run non-optimize FASTCOVER 5 times in benchmark * Merge fastCover into dictBuilder * Fix mixed declaration issue * Add fastcover to symbol.c * Add fastCover.c and cover.h to build * Change fastCover.c to fastcover.c * Update benchmark to run FASTCOVER in dictBuilder * Undo spliting fastcover_param into cover_param and f * Remove convert param functions * Assign f to parameter * Add zdict.h to Makefile in lib * Add cover.h to BUCK * Cast 1 to U64 before shifting * Remove trimming of zero freq head and tail in selectSegment and rebenchmark * Remove f as a separate parameter of tryParam * Read 8 bytes when d is 6 * Add trimming off zero frequency head and tail * Use best functions from COVER and remove trimming part(which leads to worse compression ratio after previous bugs were fixed) * Add finalize= argument to FASTCOVER to specify percentage of training samples passed to ZDICT_finalizeDictionary * Change nbDmer to always read 8 bytes even when d=6 * Add skip=# argument to allow skipping dmers in computeFrequency in FASTCOVER * Update comments and benchmarking result * Change default method of ZDICT_trainFromBuffer to ZDICT_optimizeTrainFromBuffer_fastCover * Add dictType enum and fix bug about passing zParam when converting to coverParam * Combine finalize and skip into a single parameter * Update acceleration parameters and benchmark on 3 sample sets * Change default splitPoint of FASTCOVER to 0.75 and benchmark first 3 sample sets * Initialize variables outside of for loop in benchmark.c * Update benchmark result for hg-manifest * Remove cover.h from install-includes * Add explanation of f * Set default compression level for trainFromBuffer to 3 * Add assertion of fastCoverParams in DiB_trainFromFiles * Add checkTotalCompressedSize function + some minor fixes * Add test for multithreading fastCovr * Initialize segmentFreqs in every FASTCOVER_selectSegment and move mutex_unnlock to end of COVER_best_finish * Free segmentFreqs * Initialize segmentFreqs before calling FASTCOVER_buildDictionary instead of in FASTCOVER_selectSegment * Add FASTCOVER_MEMMULT * Minor fix * Update benchmarking result	2018-08-23 12:06:20 -07:00
W. Felix Handte	e589ac6276	Reformat Introduction Comment and Mention Negative Levels	2018-08-22 17:07:34 -07:00
Yann Collet	c71c4f23d7	fix "unused parameter" in single-thread mode within newly added ZSD_toFlushNow()	2018-08-20 11:40:10 -07:00
Yann Collet	105677c6db	created ZSTDMT_toFlushNow() tells in a non-blocking way if there is something ready to flush right now. only works with multi-threading for the time being. Useful to know if flush speed will be limited by lack of production.	2018-08-17 18:11:54 -07:00
Yann Collet	36d6165a2d	Makefile: added variable SCANBUILD so that a different version of scan-build can be selected	2018-08-16 16:44:13 -07:00
Yann Collet	1515f0bb0d	fixed more issues detected by recent version of scan-build test run on Linux	2018-08-16 15:20:25 -07:00
Yann Collet	5291d9ac31	fix scope of scan-build tests exclude zlib code	2018-08-15 17:41:44 -07:00
Yann Collet	42a02ab745	fixed minor warnings issued by scan-build	2018-08-15 14:36:02 -07:00
Yann Collet	3692c31598	Merge branch 'dev' into scanbuild	2018-08-15 13:50:49 -07:00
Yann Collet	6e66bbf5dd	fixed several minor issues detected by scan-build only notable one : writeNCount() resists better vs invalid distributions (though it should never happen within zstd anyway)	2018-08-14 16:55:35 -07:00
Yann Collet	3e4617ef54	frameProgression reports nbActiveWorkers and output flushed	2018-08-14 11:49:25 -07:00
Yann Collet	e7a49c6683	introduced command --adapt	2018-08-11 20:48:06 -07:00
Yann Collet	2dd76037be	zstd cli can increase level when input is too slow	2018-08-09 15:51:30 -07:00
Yann Collet	79a35ac20d	minor code comments improvements	2018-08-09 15:16:31 -07:00
W. Felix Handte	2ca7c69167	Fix CDict Attachment to Handle CDicts with Non-Zero Starts CDicts were previously guaranteed to be generated with `lowLimit=dictLimit=0`. This is no longer true, and so the old length and index calculations are no longer valid. This diff fixes them to handle non-zero start indices in CDicts.	2018-08-07 18:14:14 -07:00
Yann Collet	5808027abf	Merge branch 'dev' into fix1241	2018-08-03 16:08:33 -07:00
Yann Collet	5892dd5da4	Merge pull request #1255 from terrelln/norm-fix [FSE] Fix division by zero	2018-08-02 11:48:56 -07:00
Nick Terrell	dc5a67cb7b	Disallow tableLog == srcLog	2018-08-02 11:12:17 -07:00
Jennifer Liu	f5228f2c44	Refactoring	2018-07-31 13:58:54 -07:00
Jennifer Liu	4e29bc2469	Use CDict instead of CCtx in analyzeEntropy	2018-07-31 10:36:45 -07:00
cyan4973	3f535007e4	fix %zu support under minGW and relevant test on Appveyor	2018-07-30 16:56:18 +02:00
cyan4973	aade1e5904	Merge branch 'dev' into fix1241	2018-07-30 16:30:35 +02:00
Nick Terrell	9889bca530	[FSE] Fix division by zero When the primary normalization method fails, and `(1 << tableLog) == (maxSymbolValue + 1)`, and every symbol gets assigned normalized weight 1 or -1 in the first loop, then the next division can raise `SIGFPE`.	2018-07-27 17:30:03 -07:00
Yann Collet	6e490a2f09	Merge pull request #1237 from terrelln/init-cstream-adv Set requestedParams in ZSTD_initCStream*()	2018-07-18 16:33:30 +02:00
cyan4973	9597b438e9	fix #1241 Ensure that first input position is valid for a match even during first usage of context by starting reference at 1 (avoiding the problematic 0).	2018-07-17 18:52:57 +02:00
cyan4973	53e1f0504e	zstdmt debug traces compatibles with mingw since mingw does not have `sys/times.h`, remove this path when detecting mingw compilation.	2018-07-17 14:39:44 +02:00
Nick Terrell	45821fac0c	Merge pull request #1225 from jennifermliu/dev Split samples when building dictionary for COVER	2018-07-13 13:26:15 -07:00
Nick Terrell	6d222c437c	Set requestedParams in ZSTD_initCStream() The correct parameters are used once, but once `ZSTD_resetCStream()` is called the default parameters (level 3) are used. Fix this by setting `requestedParams` in the `ZSTD_initCStream()` functions. The added tests both fail before this patch and pass after.	2018-07-12 18:35:55 -07:00
Jennifer Liu	612b346ed5	Add explanation for split=100	2018-07-11 15:50:28 -07:00
Jennifer Liu	5021441d86	Change default splitPoint to 100	2018-07-10 11:19:33 -07:00
Jennifer Liu	456f290e31	Change back to splitPoint<=0	2018-07-09 13:53:25 -07:00
Jennifer Liu	7efabb2cf6	Only make 0.0 default splitPoint	2018-07-09 12:26:53 -07:00
Yann Collet	bbd78df59b	add build macro NO_PREFETCH prevent usage of prefetch intrinsic commands which are not supported by c2rust (see https://github.com/immunant/c2rust/issues/13)	2018-07-06 17:06:04 -07:00
Jennifer Liu	015a00af0f	Change cover_sum back to 2 parameters and fix splitPoint issues	2018-07-06 14:24:18 -07:00
Jennifer Liu	0bbff01211	Fix testing parameter	2018-07-05 22:40:32 -07:00
Jennifer Liu	a085d1aae1	Allow splitPoint==1.0 (using all samples for both training and testing)	2018-07-05 10:38:45 -07:00
Jennifer Liu	0881184c89	Some edits based on pull request comments	2018-07-03 17:53:27 -07:00
Jennifer Liu	16e75e8804	Update minimal training sample size	2018-07-03 12:07:06 -07:00
Jennifer Liu	348e5f77a9	Add split=# to cli	2018-06-29 17:54:41 -07:00
Jennifer Liu	52fbbbcb6b	Explicitly cast double to unsigned	2018-06-29 16:17:20 -07:00
Jennifer Liu	f9d19b83fb	Fix variable declaration problem	2018-06-29 15:46:56 -07:00
Jennifer Liu	e061d84016	Another fix to comparator	2018-06-29 15:38:08 -07:00
Jennifer Liu	59797d3328	Fix splitPoint floating point comparison problem	2018-06-29 12:47:03 -07:00
Jennifer Liu	0ef06f2e8a	Split samples into train and test sets	2018-06-29 12:33:34 -07:00
Yann Collet	121aa2c388	Merge pull request #1211 from facebook/staticAssert updated DEBUG_STATIC_ASSERT()	2018-06-27 12:19:17 -07:00
Yann Collet	4489daec09	slightly adjusted default-distribution threshold depending on strategy. fast favors faster compression and decompression speeds.	2018-06-26 20:10:45 -07:00
Yann Collet	ff773bfcde	zeroise freq table with memset() improves decoding speed by ~5% in github_users sample set	2018-06-26 17:24:41 -07:00
Yann Collet	7b9bbf77c9	switched to a sizeof() version avoid -Werror=unused-variable issue	2018-06-26 14:08:35 -07:00
Yann Collet	f98ec46979	updated DEBUG_STATIC_ASSERT() following suggestion from #1209	2018-06-26 12:04:59 -07:00
Nick Terrell	b426bcc097	[zstdmt] Fix jobsize bugs (#1205 ) [zstdmt] Fix jobsize bugs * `ZSTDMT_serialState_reset()` should use `targetSectionSize`, not `jobSize` when sizing the seqstore. Add an assert that checks that we sized the seqstore using the right job size. * `ZSTDMT_compressionJob()` should check if `rawSeqStore.seq == NULL`. * `ZSTDMT_initCStream_internal()` should not adjust `mtctx->params.jobSize` (clamping to MIN/MAX is okay).	2018-06-25 15:21:08 -07:00
Yann Collet	3b53bfe4f3	Merge pull request #1200 from felixhandte/zstd-attach-dict-pref Add CCtx Param Controlling Dict Attachment Behavior	2018-06-25 12:42:31 -07:00
Yann Collet	31769ce702	error on no forward progress streaming decoders, such as ZSTD_decompressStream() or ZSTD_decompress_generic(), may end up making no forward progress, (aka no byte read from input __and__ no byte written to output), due to unusual parameters conditions, such as providing an output buffer already full. In such case, the caller may be caught in an infinite loop, calling the streaming decompression function again and again, without making any progress. This version detects such situation, and generates an error instead : ZSTD_error_dstSize_tooSmall when output buffer is full, ZSTD_error_srcSize_wrong when input buffer is empty. The detection tolerates a number of attempts before triggering an error, controlled by ZSTD_NO_FORWARD_PROGRESS_MAX macro constant, which is set to 16 by default, and can be re-defined at compilation time. This behavior tolerates potentially existing implementations where such cases happen sporadically, like once or twice, which is not dangerous (only infinite loops are), without generating an error, hence without breaking these implementations.	2018-06-22 17:58:21 -07:00
Yann Collet	3934e010a2	Merge pull request #1197 from facebook/poolResize Thread Pool resize	2018-06-22 14:20:07 -07:00
Yann Collet	fbd5dfc1b1	changed POOL_resize() return type to int return is now just en error code. This guarantee that `ctx` remains valid after POOL_resize(). Gets rid of internal POOL_free() operation.	2018-06-22 12:14:59 -07:00
Yann Collet	1d5648ca10	Merge pull request #1196 from felixhandte/zstd-btopt-in-place-dict ZSTD_btopt: Support Searching the Dictionary Context In-Place	2018-06-22 11:53:23 -07:00
Yann Collet	f6242d30b7	Merge pull request #1202 from facebook/barelyCompressible Increase threshold detection of poorly compressible data	2018-06-22 11:52:52 -07:00
Yann Collet	698fd00afb	huf: increase threshold detection of poorly compressible data	2018-06-21 18:32:38 -07:00
Yann Collet	243cd9d8bb	add a cond_broadcast after resize to make sure all threads (notably newly available threads) get awaken to immediately process potential items in the queue.	2018-06-21 18:04:58 -07:00
Yann Collet	818e72b4d5	added extended POOL test abrupt end + downsizing with running jobs remaining in queue. also : POOL_resize() requires numThreads >= 1	2018-06-21 14:58:59 -07:00
W. Felix Handte	01bb1c1016	Add CCtx Param Controlling Dict Attachment Behavior	2018-06-21 17:29:25 -04:00
W. Felix Handte	3e91dc4d6a	Add Repcode Bounds Check	2018-06-21 15:54:41 -04:00
W. Felix Handte	5bd3d4b7d2	Add Debug Log Statement	2018-06-21 15:54:07 -04:00
W. Felix Handte	3caba150c6	Fix `dmsBtLow` Test	2018-06-21 15:53:40 -04:00
W. Felix Handte	5da9bbc38e	Conceivably Dedup ZSTD_noDict and ZSTD_dictMatchState _insertBt1 Impls By reverting to the bool extDict flag, we call ZSTD_insertBt1 with the same const args in both non-extDict dictModes.	2018-06-21 11:20:01 -04:00
Yann Collet	6de249c1c6	fixed: bug when counting nb of active threads when queueSize > 1 also : added a test in testpool.c verifying resizing is effective.	2018-06-20 18:28:49 -07:00
Yann Collet	6b48eb12c0	change control of threadLimit now limits maximum nb of active threads even when queueSize > 1.	2018-06-20 14:35:39 -07:00
W. Felix Handte	5d81f71e83	Consistency in Guarding DMS-Only Variable Initializations	2018-06-20 16:54:53 -04:00
W. Felix Handte	9c14eafe3d	Also Use `matchLow` for HC3 Match	2018-06-20 15:51:14 -04:00
W. Felix Handte	0a6cf7cd1d	Minor Changes	2018-06-20 15:27:23 -04:00
W. Felix Handte	ae1f3898a2	Remove Dead(!) HC3 DMS Lookup	2018-06-20 15:27:12 -04:00
Yann Collet	93702a7a62	Merge pull request #1198 from facebook/msdebug made Visual Studio compatible with DEBUGLEVEL >= 2	2018-06-20 12:26:31 -07:00
cyan4973	ae0b7ffa0a	made Visual Studio compatible with DEBUGLEVEL >= 2	2018-06-20 09:45:02 -07:00
Yann Collet	62469c9f41	fixed wrong size in pthread struct transfer	2018-06-19 20:14:03 -07:00
Yann Collet	166901dc72	reduced POOL_resize() restriction It's not necessary to ensure that no job is ongoing. The pool is only expanded, existing threads are preserved. In case of error, the only option is to return NULL and terminate the thread pool anyway.	2018-06-19 18:07:18 -07:00
Yann Collet	066fbbfe1c	make zstdmt resize its context when nbThreads change. Technically, it only expands. But when instructed to use less threads, the thread pool will limit nb of concurrent threads.	2018-06-19 17:28:56 -07:00
Yann Collet	4567c57199	finalized POOL_resize() POOL_ctx* POOL_resize(POOL_ctx* ctx, size_t numThreads) The function may fail, and returns a NULL pointer in this case.	2018-06-19 16:03:12 -07:00
Yann Collet	6768cf53fd	Merge pull request #1190 from terrelln/ldm-adjust Adjust advanced parameters to source size	2018-06-19 14:40:56 -07:00
W. Felix Handte	03c39c540b	Fix Incorrect Param	2018-06-19 15:36:33 -04:00
W. Felix Handte	de639502aa	Update Dict Attachment Cut-Offs	2018-06-19 15:36:13 -04:00
W. Felix Handte	f0a13bcd68	Make Sure Position 0 Gets Into the Tree	2018-06-19 15:10:06 -04:00
W. Felix Handte	87fe4788a3	Fix Compression Ratio Regression #1	2018-06-19 13:01:21 -04:00
W. Felix Handte	4bb79f9c55	Misc Changes	2018-06-19 13:01:21 -04:00
W. Felix Handte	2091f34e9e	Find Proper Matches	2018-06-19 13:01:21 -04:00
W. Felix Handte	64348a15f1	Misc Fixes	2018-06-19 13:01:21 -04:00
W. Felix Handte	ade8586ce6	Find `mls == 3` Matches	2018-06-19 13:01:21 -04:00
W. Felix Handte	ce743312e2	Fix Typo	2018-06-19 13:01:21 -04:00
W. Felix Handte	a075864756	Switch `!= ZSTD_extDict` to `== ZSTD_noDict`	2018-06-19 13:01:21 -04:00
W. Felix Handte	1e03377bde	Implement RepCode Check	2018-06-19 13:01:21 -04:00
W. Felix Handte	ccbf067973	Add _dictMatchState Functions	2018-06-19 13:01:21 -04:00
W. Felix Handte	d5d8240967	Convert `extDict` Flag to `dictMode` Enum	2018-06-19 13:01:21 -04:00
W. Felix Handte	93c3184d44	Attach Dicts when Using ZSTD_btopt and ZSTD_btultra	2018-06-19 13:01:21 -04:00
Yann Collet	1c714fda3f	introduced POOL_resize() not complete yet : finalize behavior in case of unfinished expansion	2018-06-18 20:46:39 -07:00
Nick Terrell	3841dbac84	Adjust advanced parameters to source size In the new advanced API, adjust the parameters even if they are explicitly set. This mainly applies to the `windowLog`, and accordingly the `hashLog` and `chainLog`, when the source size is known.	2018-06-18 15:49:31 -07:00
Yann Collet	e30f13bde0	Merge pull request #1185 from felixhandte/zstd-btlazy-in-place-dict ZSTD_btlazy2: Support Searching the Dictionary Context In-Place	2018-06-18 13:29:44 -07:00
Yann Collet	d8462ecba2	Merge branch 'dev' into huf_rename	2018-06-14 20:42:10 -04:00
Yann Collet	b7e5ebef2a	grouped X2 function together	2018-06-14 20:41:50 -04:00
Yann Collet	9698d2fb72	Merge pull request #1189 from facebook/hist histogram module	2018-06-14 20:39:52 -04:00
Yann Collet	6901c94cd6	avoid duplicate code comments when a function is decribed in hist.h, do not describe it again in hist.c to avoid future doc synchronization issues.	2018-06-14 19:47:05 -04:00
Yann Collet	f70f829ff5	Merge pull request #1187 from facebook/fix1186 fix dctx initialization within ZSTD_decompress in stack mode	2018-06-14 16:22:22 -04:00
Yann Collet	a71513bec6	Merge pull request #1184 from facebook/debug Grouped debug functions into debug.h	2018-06-14 16:21:53 -04:00
Yann Collet	1adf84ccb7	renamed all HUF_decompressX4() functions into X2 to underline they generate up to 2 symbols per decoding, in preparation for a future X3 variant.	2018-06-14 15:17:03 -04:00
Yann Collet	a09af5eb6b	renamed all HUF_decompressX2() functions into X1 to underline they generate one symbol per decoding operation. The new naming scheme will make it easier to introduce an X3 variant.	2018-06-14 15:08:43 -04:00
W. Felix Handte	0c654d22c8	Force Inline BtFindBestMatch	2018-06-14 14:54:39 -04:00
Yann Collet	7fee966f02	fix dctx initialization within ZSTD_decompress in stack mode when ZSTD_HEAPMODE=0 (which is not default). Also : added an associated test (test-fuzzer-stackmode) run on travis CI fix #1186	2018-06-14 10:22:24 -04:00
Yann Collet	fc682263d0	fixed g_debuglevel variable name in debug.h	2018-06-13 20:02:33 -04:00
Yann Collet	2d76defbfe	grouped all histogram functions into hist.c renamed functions with HIST_* prefix	2018-06-13 19:49:31 -04:00
W. Felix Handte	0551de4b5a	Search Dict for Matches	2018-06-13 16:06:28 -04:00
W. Felix Handte	ace9cfa950	Attach Dicts when Using ZSTD_btlazy2	2018-06-13 16:06:28 -04:00
Yann Collet	fa41bcc2c2	grouped debug functions into debug.h There were 2 competing set of debug functions within zstd_internal.h and bitstream.h. They were mostly duplicate, and required care to avoid messing with each other. There is now a single implementation, shared by both. Significant change : The macro variable ZSTD_DEBUG does no longer exist, it has been replaced by DEBUGLEVEL, which required modifying several source files.	2018-06-13 15:43:09 -04:00
W. Felix Handte	d53200a846	Fix Cast Warning	2018-06-13 14:58:36 -04:00
W. Felix Handte	b82063b266	Extend Dictionary Matches Backwards	2018-06-13 14:58:36 -04:00
W. Felix Handte	d53a04211c	Update Dictionary Attachment Cutoff Values Again	2018-06-13 14:58:36 -04:00
W. Felix Handte	2162aa9f18	Do Not Inline DMS Search Function	2018-06-13 14:58:36 -04:00
W. Felix Handte	338bede9b5	Also Implement Depth Repcode Checks	2018-06-13 14:58:36 -04:00
W. Felix Handte	555ab9f8cf	Apply Match Continuation Bug Fix	2018-06-13 14:58:36 -04:00
W. Felix Handte	c87dd2121d	Update Dictionary Attachment Cutoff Values	2018-06-13 14:58:36 -04:00
W. Felix Handte	6204b6d592	Check Dict Match State in ZSTD_HcFindBestMatch_generic	2018-06-13 14:58:36 -04:00
W. Felix Handte	211a61b69b	Focus on Non-BT Impls for the Moment	2018-06-13 14:58:36 -04:00
W. Felix Handte	2e93736a77	Remove Pre-Existing Repcode Check	2018-06-13 14:58:36 -04:00
W. Felix Handte	3b82a23a35	Second Repcode Check	2018-06-13 14:58:36 -04:00
W. Felix Handte	a2a24bebec	First Repcode Check	2018-06-13 14:58:36 -04:00
W. Felix Handte	f74c2cd673	Disallow Too-Long Repcodes When Using an Attached Dict	2018-06-13 14:58:36 -04:00
W. Felix Handte	c14db94450	Rename `base` -> `prefixLowest`	2018-06-13 14:58:36 -04:00
W. Felix Handte	5d90708a0a	Go Back to Separate Intermediate Functions for Different Dict Modes	2018-06-13 14:58:36 -04:00
W. Felix Handte	f84fc63a43	Further Templatize Intermediate Functions on dictMode	2018-06-13 14:58:36 -04:00
W. Felix Handte	529d3a5acd	Convert Existing U32 extDict Vars to ZSTD_dictMode Enums	2018-06-13 14:58:36 -04:00
W. Felix Handte	33e2240fac	Attach Dict When Using ZSTD_lazy Strategies	2018-06-13 14:58:36 -04:00
W. Felix Handte	90cfc799e5	Add _dictMatchState Stubs for ZSTD_lazy Functions	2018-06-13 14:58:36 -04:00
W. Felix Handte	a85ecb32bd	Add dictMode Param to ZSTD_compressBlock_lazy_generic	2018-06-13 14:58:36 -04:00
Yann Collet	750ee87a92	Merge pull request #1175 from ryandesign/macos Fix name of macOS	2018-06-13 11:32:06 -04:00
Yann Collet	b2632bcf6c	Merge pull request #1174 from duc0/document_default_level Expose ZSTD_CLEVEL_DEFAULT and update documentation	2018-06-12 12:09:01 -07:00
Duc Ngo	869e2718f6	Line break	2018-06-11 10:02:15 -07:00
Duc Ngo	e8ef725e13	Address comments	2018-06-11 10:01:35 -07:00
Ryan Schmidt	b567ce9d68	Fix name of macOS	2018-06-09 14:31:17 -05:00
Duc Ngo	e34c000e44	Expose ZSTD_CLEVEL_DEFAULT and update documentation	2018-06-08 11:33:44 -07:00
Yann Collet	3050733042	Merge branch 'dev' into negLevels	2018-06-07 15:51:35 -07:00
Yann Collet	c2c47e24e0	support targetlen==0 with strategy==ZSTD_fast to mean "normal compression", targetlen >= 1 now means "disable huffman compression of literals"	2018-06-07 15:49:01 -07:00
Yann Collet	a57b4df85f	removed literalCompression directive in this version, literal compression is always disabled for ZSTD_fast strategy. Performance parity between ZSTD_compress_advanced() and ZSTD_compress_generic()	2018-06-07 15:24:12 -07:00
Yann Collet	8537bfd85c	fuzzer: make negative compression level fail result of ZSTD_compress_advanced() is different from ZSTD_compress_generic() when using negative compression levels because the disabling of huffman compression is not passed in parameters.	2018-06-07 15:12:13 -07:00
Yann Collet	8ef75547ef	Merge pull request #1165 from facebook/ctxSizeDown Dynamic context downsize	2018-06-07 14:44:32 -07:00
Yann Collet	e3c42c739b	clean ZSTD_compress() initialization The (pretty old) code inside ZSTD_compress() was making some pretty bold assumptions on what's inside a CCtx and how to init it. This is pretty fragile by design. CCtx content evolve. Knowledge of how to handle that should be concentrate in one place. A side effect of this strategy is that ZSTD_compress() wouldn't check for BMI2 capability, and is therefore missing out some potential speed opportunity. This patch makes ZSTD_compress() use the same initialization and release functions as the normal creator / destructor ones. Measured on my laptop, with a custom version of bench manually modified to use ZSTD_compress() (instead of the advanced API) : This patch : 1#silesia.tar : 211984896 -> 73651053 (2.878), 312.2 MB/s , 723.8 MB/s 2#silesia.tar : 211984896 -> 70163650 (3.021), 226.2 MB/s , 649.8 MB/s 3#silesia.tar : 211984896 -> 66996749 (3.164), 169.4 MB/s , 636.7 MB/s 4#silesia.tar : 211984896 -> 65998319 (3.212), 136.7 MB/s , 619.2 MB/s dev branch : 1#silesia.tar : 211984896 -> 73651053 (2.878), 291.7 MB/s , 727.5 MB/s 2#silesia.tar : 211984896 -> 70163650 (3.021), 216.2 MB/s , 655.7 MB/s 3#silesia.tar : 211984896 -> 66996749 (3.164), 162.2 MB/s , 633.1 MB/s 4#silesia.tar : 211984896 -> 65998319 (3.212), 130.6 MB/s , 618.6 MB/s	2018-06-07 14:05:25 -07:00
Yann Collet	b27c7389e3	Merge pull request #1164 from GeorgeLu97/CustomMacros Partial Compilation Macros	2018-06-06 16:47:42 -07:00
Yann Collet	24319975b6	bumped version number to v1.3.5	2018-06-06 15:51:55 -07:00
Yann Collet	f1ea383f45	context can be sized down even with constant parameters when parameters are "equivalent", the context is re-used in continue mode, hence needed workspace size is not recalculated. This incidentally also evades the size-down check and action. This patch intercepts the "continue mode" so that the size-down check and action is actually triggered.	2018-06-06 15:04:12 -07:00
Yann Collet	e5e17d009f	changed member name to workSpaceOversizedDuration	2018-06-06 15:00:27 -07:00
Yann Collet	f7392f3dc9	added test case	2018-06-05 14:53:28 -07:00
George Lu	11d5bfdaa9	Revert "Partial compilation test?" This reverts commit `b2496ab606`.	2018-06-05 13:55:36 -07:00
George Lu	b2496ab606	Partial compilation test?	2018-06-05 13:24:00 -07:00
Yann Collet	3d523c741b	added workSpaceTooLarge and workSpaceWasteful also : slightly increased speed of test fuzzer.16	2018-06-05 11:42:48 -07:00
George Lu	b3ef314830	Fix Typos	2018-06-04 17:19:06 -07:00
Yann Collet	357c648c3f	changed a few variable names to unify naming convention	2018-06-04 17:10:50 -07:00
George Lu	609d72b0ca	Added Deprecated Dependencies	2018-06-04 14:33:21 -07:00
George Lu	9437021d2f	Remove old file declaration	2018-06-04 13:32:41 -07:00
George Lu	6a617d70ed	Documentation	2018-06-04 09:56:37 -07:00
George Lu	65de25a463	Created Macros	2018-06-04 09:56:29 -07:00
Yann Collet	2108decb41	Fixed a nasty corruption bug recently introduce into the new dictionary mode. The bug could be reproduced with this command : ./zstreamtest -v --opaqueapi --no-big-tests -s4092 -t639 error was in function ZSTD_count_2segments() : the beginning of the 2nd segment corresponds to prefixStart and not the beginning of the current block (istart == src). This would result in comparing the wrong byte.	2018-06-01 18:54:34 -07:00
Yann Collet	143fc9ff6c	Merge pull request #1157 from facebook/decompressedSize minor : improved zstd.h API code comment	2018-06-01 10:28:17 -07:00
Yann Collet	7c33b48221	Merge pull request #1151 from felixhandte/zstd-dfast-in-place-dict-goto ZSTD_dfast: Support Searching the Dictionary Context In-Place (Alternate `goto` Implementation)	2018-05-31 17:37:09 -07:00
W. Felix Handte	48deab92de	Allow Different Dict Attachment Cut-Offs for Different Strategies	2018-05-31 17:37:44 -04:00
W. Felix Handte	f86796639e	Remove Incorrect and Extraneous Repcode Bounds Check	2018-05-31 17:02:29 -04:00
Yann Collet	9b979d0e33	minor : improved API code comment Extend guarantee that ZSTD_getFrameContentSize() will delivering the decompressed size to any single-pass compression function. Answer #1156	2018-05-31 11:12:18 -07:00
Yann Collet	809f2f9322	minor update of literal cost function just assert() there is no negative cost evaluation for literals	2018-05-29 15:34:50 -07:00
Yann Collet	463a0fe38b	simplified optimal parser removed "cached" structure. prices are now saved in the optimal table. Primarily done for simplification. Might improve speed by a little. But actually, and surprisingly, also improves ratio in some circumstances.	2018-05-29 14:07:25 -07:00
Yann Collet	bb6eaf6495	Merge pull request #1153 from facebook/dynThreshold changed dynamic fse threshold for offset	2018-05-26 08:43:45 -07:00
Yann Collet	e916c365a1	fixed minor visual warning	2018-05-25 20:43:09 -07:00
Yann Collet	a7fdceeccd	changed dynamic fse threshold for offset recent experienced showed that default distribution table for offset can get it wrong pretty quickly with the nb of symbols, while it remains a reasonable choice much longer for lengths symbols. Changed the formula, so that dynamic threshold is now 32 symbols for offsets. It remains at 64 symbols for lengths. Detection based on defaultNormLog	2018-05-25 17:41:16 -07:00
Yann Collet	4b3a36d5d8	Merge branch 'dev' into lowCompression	2018-05-25 15:45:03 -07:00
Yann Collet	5f177f1c53	btultra accepts blocks with poorer compression ratio zstd rejects blocks which do not compress by at least a certain amount. In which case, such block is simply emitted uncompressed (even if a little bit of compression could be achieved). This is better for decompression speed, hence for energy. The logic is controlled by ZSTD_minGain(). The rule is applied uniformly, at all compression levels. This change makes btultra accepts blocks with poor compression ratios. We presume that users of btultra mode prefers compression ratio over some decompress speed gains. The threshold for minimum gain is lowered for btultra from s>>6 (~1.5% minimum gain) to s>>7 (~0.8% minimum gain). This is a prudent change. Not sure if it's large enough.	2018-05-25 15:19:52 -07:00
Yann Collet	e2c0e3d437	slightly nudge choices towards less sequences also slightly improve some strange detrimental corner cases.	2018-05-25 14:52:21 -07:00
W. Felix Handte	5b292b5685	Check Long + 1 Matches in Both Prefix and Dict in Bothe Short Match Paths	2018-05-25 13:13:57 -04:00
W. Felix Handte	88b733b380	Interleave Prefix and Dict Searches	2018-05-25 13:13:57 -04:00
W. Felix Handte	1850025156	Refactor ZSTD_dfast to Use `goto`s	2018-05-25 13:13:57 -04:00
W. Felix Handte	43606f9c83	... When I Said "HashTable", I Meant "ChainTable"	2018-05-25 13:13:28 -04:00
W. Felix Handte	ec7efe88f5	Fix Off-By-One Error	2018-05-25 13:13:28 -04:00
W. Felix Handte	2bfe43267e	Disallow Too-Long Repcodes When Using an Attached Dict	2018-05-25 13:13:28 -04:00
W. Felix Handte	b97ad3f457	Port Changes Made to ZSTD_fast to ZSTD_dfast	2018-05-25 13:13:28 -04:00
W. Felix Handte	2313cca1b7	Implement Second Repcode Check	2018-05-25 13:13:28 -04:00
W. Felix Handte	0998f10813	Implement First Repcode Check	2018-05-25 13:13:28 -04:00
W. Felix Handte	50c5b2bb90	Find Dict Hash Table Matches	2018-05-25 13:13:28 -04:00
W. Felix Handte	7a25f7ef5b	Existing Repcode Check Only Applies to noDict Case	2018-05-25 13:13:28 -04:00
W. Felix Handte	8b241da4df	Properly Initialize Repcode Values	2018-05-25 13:13:28 -04:00
W. Felix Handte	7097a03749	Add Necessary Dict Variables	2018-05-25 13:13:28 -04:00
W. Felix Handte	aacbbf4f9a	Rename 'lowest' to 'localLowest' to Prepare to Introduce Dict Indices	2018-05-25 13:13:28 -04:00
W. Felix Handte	c10d1b4011	Skeleton for In-Place Impl for ZSTD_dfast	2018-05-25 13:13:28 -04:00
Yann Collet	f6ad59ab5c	Merge branch 'dev' into staticDictCost	2018-05-24 16:21:02 -07:00
Yann Collet	b5ef32fea7	Merge branch 'dev' into fracFse	2018-05-24 14:09:49 -07:00
Yann Collet	776128d16f	fix corner case when requiring cost of an FSE symbol ensure that, when frequency[symbol]==0, result is (tableLog + 1) bits with both upper-bit and fractional-bit estimates. Also : enable BIT_DEBUG in /tests	2018-05-24 13:59:11 -07:00
Yann Collet	08c5be5db3	Merge pull request #1117 from felixhandte/zstd-fast-in-place-dict ZSTD_fast: Support Searching the Dictionary Context In-Place	2018-05-23 19:32:25 -07:00
Nick Terrell	06b70179da	Work around bug in zstd decoder (#1147 ) Work around bug in zstd decoder Pull request #1144 exercised a new path in the zstd decoder that proved to be buggy. Avoid the extremely rare bug by emitting an uncompressed block.	2018-05-23 18:02:30 -07:00
Nick Terrell	f2d0924b87	Variable declarations	2018-05-23 14:58:58 -07:00
W. Felix Handte	d9c7e67125	Assert that Dict and Current Window are Adjacent in Index Space	2018-05-23 17:53:03 -04:00
W. Felix Handte	298d24fa57	Make loadedDictEnd an Index, not the Dict Len	2018-05-23 17:53:03 -04:00
W. Felix Handte	7ef85e0618	Fixes in re Comments	2018-05-23 17:53:03 -04:00
W. Felix Handte	582b7f85ed	Don't Attach Empty Dict Contents In weird corner cases, they produce unexpected results...	2018-05-23 17:53:03 -04:00
W. Felix Handte	9c92223468	Avoid Undefined Behavior in Match Ptr Calculation	2018-05-23 17:53:03 -04:00
W. Felix Handte	a44ab3b475	Remove Out-of-Date Comment	2018-05-23 17:53:03 -04:00
W. Felix Handte	95bdf20a87	Moar Renames	2018-05-23 17:53:03 -04:00
W. Felix Handte	7e0402e738	Also Attach Dict When Source Size is Unknown	2018-05-23 17:53:03 -04:00
W. Felix Handte	3ba70cc759	Clear the Dictionary When Sliding the Window	2018-05-23 17:53:03 -04:00
W. Felix Handte	b05ae9b608	Refine ip Initialization to Avoid ARM Weirdness	2018-05-23 17:53:03 -04:00
W. Felix Handte	1a7b34ef28	Use New Index Invariant to Simplify Conditionals	2018-05-23 17:53:03 -04:00
W. Felix Handte	2d598e6fed	Force Working Context Indices Greater than Dict Indices	2018-05-23 17:53:03 -04:00
W. Felix Handte	d005e5daf4	Whitespace Fix	2018-05-23 17:53:03 -04:00
W. Felix Handte	154eb09419	Switch to Original Match Calc for noDict Repcode Check	2018-05-23 17:53:03 -04:00
W. Felix Handte	191fc74a51	Rename 'hasDict' to 'dictMode'	2018-05-23 17:53:03 -04:00
W. Felix Handte	ae4fcf7816	Respond to PR Comments; Formatting/Style/Lint Fixes	2018-05-23 17:53:03 -04:00
W. Felix Handte	ca26cecc7a	Rename and Reformat	2018-05-23 17:53:03 -04:00
W. Felix Handte	66bc1ca641	Change Cut-Off to 8 KB	2018-05-23 17:53:03 -04:00
W. Felix Handte	c31ee3c7f8	Fix Rep Code Initialization	2018-05-23 17:53:03 -04:00
W. Felix Handte	b67196f30d	Coalesce hasDictMatchState and extDict Checks into One Enum and Rename Stuff	2018-05-23 17:53:03 -04:00
W. Felix Handte	265c2869d1	Split Wrapper Functions to Cause Inlining	2018-05-23 17:53:03 -04:00
W. Felix Handte	6929964d65	Add bounds check in repcode tests	2018-05-23 17:53:03 -04:00
W. Felix Handte	70a537d1d7	Initial Repcode Check Support for Ext Dict Ctx	2018-05-23 17:53:03 -04:00
W. Felix Handte	8d24ff0353	Preliminary Support in ZSTD_compressBlock_fast_generic() for Ext Dict Ctx	2018-05-23 17:53:03 -04:00
W. Felix Handte	d18a405779	Refer to the Dictionary Match State In-Place (Sometimes)	2018-05-23 17:53:03 -04:00
Nick Terrell	c92dd11940	Error if reported size is too large in edge case	2018-05-23 14:47:20 -07:00
Nick Terrell	a97e9a627a	[zstd] Fix decompression edge case This edge case is only possible with the new optimal encoding selector, since before zstd would always choose `set_basic` for small numbers of sequences. Fix `FSE_readNCount()` to support buffers < 4 bytes. Credit to OSS-Fuzz	2018-05-23 12:16:00 -07:00
Nick Terrell	e3959d5eba	Fixes	2018-05-22 16:06:33 -07:00
Yann Collet	7a8b3496b4	Merge branch 'dev' into staticDictCost	2018-05-22 15:10:05 -07:00
Yann Collet	a8ddf1d370	disable 2-passes strategy	2018-05-22 15:06:36 -07:00
Nick Terrell	49cf880513	Approximate FSE encoding costs for selection Estimate the cost for using FSE modes `set_basic`, `set_compressed`, and `set_repeat`, and select the one with the lowest cost. * The cost of `set_basic` is computed using the cross-entropy cost function `ZSTD_crossEntropyCost()`, using the normalized default count and the count. * The cost of `set_repeat` is computed using `FSE_bitCost()`. We check the previous table to see if it is able to represent the distribution. * The cost of `set_compressed` is computed with the entropy cost function `ZSTD_entropyCost()`, together with the cost of writing the normalized count `ZSTD_NCountCost()`.	2018-05-22 14:33:22 -07:00
Yann Collet	27af35c110	Merge pull request #1143 from facebook/tableLevels Update table of compression levels	2018-05-19 14:40:37 -07:00
Yann Collet	5381369cb1	Merge branch 'dev' into tableLevels	2018-05-18 18:23:27 -07:00
Yann Collet	b0b3fb517d	updated compression levels for blocks of 256KB	2018-05-18 17:17:12 -07:00
Nick Terrell	7cbb8bbbbf	[cover] Small compression ratio improvement The cover algorithm selects one segment per epoch, and it selects the epoch size such that `epochs * segmentSize ~= dictSize`. Selecting less epochs gives the algorithm more candidates to choose from for each segment it selects, and then it will loop back to the first epoch when it hits the last one. The trade off is that now it takes longer to select each segment, since it has to look at more data before making a choice. I benchmarked on the following data sets using this command: ```sh $ZSTD -T0 -3 --train-cover=d=8,steps=256 $DIR -r -o dict && $ZSTD -3 -D dict -rc $DIR \| wc -c ``` \| Data set \| k (approx) \| Before \| After \| % difference \| \|--------------\|------------\|----------\|----------\|--------------\| \| GitHub \| ~1000 \| 738138 \| 746610 \| +1.14% \| \| hg-changelog \| ~90 \| 4295156 \| 4285336 \| -0.23% \| \| hg-commands \| ~500 \| 1095580 \| 1079814 \| -1.44% \| \| hg-manifest \| ~400 \| 16559892 \| 16504346 \| -0.34% \| There is some noise in the measurements, since small changes to `k` can have large differences, which is why I'm using `steps=256`, to try to minimize the noise. However, the GitHub data set still has some noise. If I run the GitHub data set on my Mac, which presumably lists directory entries in a different order, so the dictionary builder sees the files in a different order, or I use `steps=1024` I see these results. \| Run \| Before \| After \| % difference \| \|------------\|--------\|--------\|--------------\| \| steps=1024 \| 738138 \| 734470 \| -0.50% \| \| MacBook \| 738451 \| 737132 \| -0.18% \| Question: Should we expose this as a parameter? I don't think it is necessary. Someone might want to turn it up to exchange a much longer dictionary building time in exchange for a slightly better dictionary. I tested `2`, `4`, and `16`, and `4` got most of the benefit of `16` with a faster running time.	2018-05-18 16:15:27 -07:00
Yann Collet	5cbef6e094	Merge branch 'dev' into staticDictCost	2018-05-18 16:03:06 -07:00
Yann Collet	a95e9e80d1	adding some debug functions to observe statistics	2018-05-18 14:09:42 -07:00
fbrosson	291824f49d	__builtin_prefetch did probably not exist before gcc 3.1.	2018-05-18 18:40:11 +00:00
fbrosson	16bb8f1f9e	Drop colon in asm snippet to make old versions of gcc happy.	2018-05-18 17:05:36 +00:00
Yann Collet	af3da079d1	fixed minor conversion warning	2018-05-17 17:27:27 -07:00
Yann Collet	8572b4d09f	fixed a pretty complex bug when combining ldm + btultra	2018-05-17 16:13:53 -07:00
Yann Collet	134388ba6b	collect statistics for first block in ultra mode this patch makes btultra do 2 passes on the first block, the first one being dedicated to collecting statistics so that the 2nd pass is more accurate. It translates into a very small compression ratio gain : enwik7, level 20: blocks 4K : 2.142 -> 2.153 blocks 16K : 2.447 -> 2.457 blocks 64K : 2.716 -> 2.726 On the other hand, the cpu cost is doubled. The trade off looks bad. Though, that's ultimately a price to pay to reach better compression ratio. So it's only enabled when setting btultra.	2018-05-17 12:24:30 -07:00
Yann Collet	a243020d37	slightly improved weight calculation translating into a tiny compression ratio improvement	2018-05-17 11:19:44 -07:00
Yann Collet	63eeeaa1dd	update table levels for blocks <= 16K also : allow hlog to be slighly larger than windowlog, as it's apparently good for both speed and compression ratio.	2018-05-16 16:13:37 -07:00
Yann Collet	18fc3d3cd5	introduced bit-fractional cost evaluation this improves compression ratio by a tiny amount. It also reduces speed by a small amount. Consequently, bit-fractional evaluation is only turned on for btultra.	2018-05-16 14:53:35 -07:00
Yann Collet	9938b17d4c	Merge pull request #1135 from facebook/frameCSize decompress: changed error code when input is too large	2018-05-15 11:02:53 -07:00
Nick Terrell	30d9c84b1a	Fix failing Travis tests	2018-05-15 09:46:20 -07:00
Yann Collet	0b31304c8d	Merge branch 'dev' into staticDictCost	2018-05-14 18:09:26 -07:00
Yann Collet	2c26df0e13	opt: removed static prices after testing, it's actually always better to use dynamic prices albeit initialised from dictionary.	2018-05-14 18:04:08 -07:00
Yann Collet	f372ffc64d	Merge pull request #1127 from facebook/staticDictCost Improved optimal parser with dictionary	2018-05-14 17:45:50 -07:00
Yann Collet	d59cf02df0	decompress: changed error code when input is too large ZSTD_decompress() can decompress multiple frames sent as a single input. But the input size must be the exact sum of all compressed frames, no more. In the case of a mistake on srcSize, being larger than required, ZSTD_decompress() will try to decompress a new frame after current one, and fail. As a consequence, it will issue an error code, ERROR(prefix_unknown). While the error is technically correct (the decoder could not recognise the header of _next_ frame), it's confusing, as users will believe that the first header of the first frame is wrong, which is not the case (it's correct). It makes it more difficult to understand that the error is in the source size, which is too large. This patch changes the error code provided in such a scenario. If (at least) a first frame was successfully decoded, and then following bytes are garbage values, the decoder assumes the provided input size is wrong (too large), and issue the error code ERROR(srcSize_wrong).	2018-05-14 15:32:28 -07:00
Yann Collet	c9227ee16b	update table for 128 KB blocks	2018-05-13 17:15:07 -07:00
Yann Collet	b4250489cf	update compression levels for large inputs	2018-05-13 01:53:38 -07:00
Yann Collet	761758982e	replaced FSE_count by FSE_count_simple to reduce usage of stack memory. Also : tweaked a few comments, as suggested by @terrelln	2018-05-11 16:03:37 -07:00
Yann Collet	3193d692c2	minor patch, ensuring LIBDIR is created before installation follow-up from #1123	2018-05-11 11:31:48 -07:00
Yann Collet	99ddca43a6	fixed wrong assertion base can actually overflow	2018-05-10 19:48:09 -07:00
Yann Collet	0d7626672d	fixed c++ conversion warning	2018-05-10 18:17:21 -07:00
Yann Collet	09d0fa29ee	minor adjusting of weights	2018-05-10 18:13:48 -07:00
Yann Collet	1a26ec6e8d	opt: init statistics from dictionary instead of starting from fake "default" statistics.	2018-05-10 17:59:12 -07:00
Yann Collet	74b1c75d64	btopt : minor adjustment of update frequencies	2018-05-10 16:32:36 -07:00
Yann Collet	ac6105463a	opt: minor improvements to log traces slight improvement when using fractional-bit evaluation (opt:dictionay)	2018-05-09 15:46:11 -07:00
Yann Collet	c39061cb7b	fixed declaration-after-statement warning	2018-05-09 12:07:25 -07:00
Yann Collet	4d5bd32a00	added traces to look at symbol costs evaluation looks correct.	2018-05-09 12:00:12 -07:00
Yann Collet	c0da0f5e9e	switchable bit-approximation / fractional-bit accuracy modes also : makes it possible to select nb of fractional bits.	2018-05-09 10:48:09 -07:00
Yann Collet	ba2ad9b6b9	implemented fractional bit cost evaluation for FSE symbols. While it seems to work, the gains are negligible compared to rough maxNbBits evaluation. There are even a few losses sometimes, that still need to be explained. Furthermode, there are still cases where btlazy2 does a better job than btopt, which seems rather strange too.	2018-05-08 17:43:13 -07:00
Yann Collet	1aff63b114	opt: shift all costs by 8 bits (* 256) making it possible to represent fractional bit costs.	2018-05-08 16:19:04 -07:00
Yann Collet	6a3c34aa58	opt: estimate cost of both Hufman and FSE symbols For FSE symbols : provide an upper bound, in nb of bits, since cost function is not able to store fractional bit costs.	2018-05-08 16:11:21 -07:00
Yann Collet	338f738c24	pass entropy tables to optimal parser for proper estimation of symbol's weights when using dictionary compression. Note : using only huffman costs is not good enough, presumably because sequence symbol costs are incorrect.	2018-05-08 15:37:06 -07:00
Yann Collet	a155061328	minor code refactor for readability removed some useless operations from optimal parser (should not change performance, too small a difference)	2018-05-08 12:32:44 -07:00
Baruch Siach	9a0643b633	lib/Makefile: create include directory before headers installation Make sure that $(INCLUDEDIR) exists before copying the headers there. Otherwise, the contest of header files is copied over $(DESTDIR)$(INCLUDEDIR), making it a regular file. While at it, remove $(DESTDIR)$(INCLUDEDIR) from the list of directories to create in the install-pc target. The install-pc target does not need this directory.	2018-05-08 20:59:44 +03:00
Yann Collet	ad4524d605	fix ZSTD_compressBlock() associated with CDict reported by @let-def. It's actually a bug in ZSTD_compressBegin_usingCDict() which would pass a wrong pledgedSrcSize value (0 instead of ZSTD_CONTENTSIZE_UNKNOWN) resulting in wrong window size, resulting in downsized seqStore, resulting in segfault when writing into the seqStore later in the process. Added a test in fuzzer to cover this use case (fails before the patch).	2018-05-07 12:54:13 -07:00
Peter Seiderer	64bfdca5b9	Split library install target into pc, static, shared and include only target Signed-off-by: Peter Seiderer <ps.report@gmx.net>	2018-04-30 20:32:32 +02:00
Nick Terrell	ca77822ddf	Fix parameter adjustment with dictionary The new advanced API basically set `requestedParams = appliedParams` when using a dictionary. This halted all parameter adjustment, which can hurt compression ratio if, for example, the window log is small for the first call, but the rest of the files are large. This patch fixes the bug, and checks that the `requestedParams` don't change in the new advanced API when using a dictionary, and generally in the fuzzer.	2018-04-25 16:32:29 -07:00
Yann Collet	12f60b8c98	clarified documentation related to refPrefix()	2018-04-25 10:17:06 -07:00
Yann Collet	ace856a835	updated documentation of streaming compression api	2018-04-24 14:44:27 -07:00
taigacon	2c3ad05812	Fix the problem that enables DYNAMIC_BMI2 macro by mistake on ARM architecture with Clang (#1110 )	2018-04-23 15:41:50 -07:00
Nick Terrell	e8c9dc5cea	Fix documentation	2018-04-13 12:43:38 -07:00
Nick Terrell	c0987986e5	Only reset CDict in ZSTD_CCtx_resetParameters()	2018-04-13 11:26:40 -07:00
Nick Terrell	9f76eebd17	Add ZSTD_CCtx_resetParameters() function * Fix docs for `ZSTD_CCtx_reset()`. * Add `ZSTD_CCtx_resetParameters()`. Fixes #1094.	2018-04-12 16:54:07 -07:00
Nick Terrell	3c3f59e68f	Enforce pledgeSrcSize whenever known (#1106 ) The test fails before the patch and passes after. Fixes #1095.	2018-04-12 16:02:03 -07:00
Nick Terrell	280a236e9e	Add ZSTD_CCtx(Param)?_getParameter() function Closes #1096.	2018-04-12 11:50:12 -07:00
Yann Collet	04212178b5	doc : clarified advanced API usage sticky parameters only work with `ZSTD_compress_generic()`	2018-04-10 11:40:36 -07:00
Yann Collet	ad5ba6cdcf	updated comment on parameters that can be changed during compression	2018-04-09 17:39:07 -07:00
Yann Collet	1da629f2ad	Merge pull request #1104 from terrelln/fast-train Allow negative compression levels in training	2018-04-09 14:16:20 -07:00
Nick Terrell	569e2abccd	Allow negative compression levels in training * Set `dictCLevel` in `zstdcli.c`. * Only set to default level if the compression level `== 0`, not `<= 0`.	2018-04-09 12:12:03 -07:00
Yann Collet	4195b36dd7	Merge pull request #1100 from bket/stable_sort zstd requires a stable sort.	2018-04-05 11:39:27 -07:00
Yann Collet	f35b8ba9da	updated ZSTD_p_chainLog description	2018-04-05 11:05:11 -07:00
Björn Ketelaars	462aed6811	zstd requires a stable sort. On OpenBSD qsort() is not guaranteed to be stable, their mergesort() is. This fixes issue #1088. All the hard work has been done by @terrelln.	2018-04-05 07:59:16 +02:00
Yann Collet	55f67502f4	Merge pull request #1098 from terrelln/nd-mt Only load extra table positions for CDicts	2018-04-02 15:38:20 -07:00
Nick Terrell	295ab0dbfa	Only load extra table positions for CDicts Zstdmt uses prefixes to load the overlap between segments. Loading extra positions makes compression non-deterministic, depending on the previous job the context was used for. Since loading extra position takes extra time as well, only do it when creating a `ZSTD_CDict`. Fixes #1077.	2018-04-02 14:41:30 -07:00
Yann Collet	5b616fa269	Merge pull request #1090 from bket/openbsd Fix building zstd on OpenBSD.	2018-04-02 14:15:26 -07:00
Björn Ketelaars	9d3048346d	Fix building zstd on OpenBSD.	2018-03-31 10:46:20 +02:00
Yann Collet	8be984ec45	fixed comments as suggested by @terrelln	2018-03-30 20:09:27 -07:00
Yann Collet	e6e848bfe9	added ZSTD_getFrameHeader_advanced() makes it possible to request frame header from a magicless frame	2018-03-29 17:51:08 -06:00
Yann Collet	a6694838e1	added more code documentation for ZSTD_getFrameHeader()	2018-03-29 15:24:17 -06:00
René Rebe	21eb26d664	fixed legacy/zstd_v* with older gcc version, by guarding builtin_* like in other files	2018-03-25 20:35:15 +02:00
Yann Collet	ad15c1b724	added __has_attribute() define for non-clang compilers	2018-03-23 19:04:48 -07:00
Yann Collet	52ca7c6c56	make DYNAMIC_BMI2 support of clang conditional to __has_attribute() to support older clang versions such as 3.4	2018-03-23 18:45:42 -07:00
Yann Collet	29b021f9a0	Merge pull request #1067 from facebook/targetLength removed limit ZSTD_TARGETLENGTH_MAX	2018-03-22 10:38:33 -07:00
Nick Terrell	ad344033df	Fix broken assertion The `avgJobSize` must not be lower than 256 KB for single-pass mode. In `zstd.h` we say the minimum value for `ZSTD_p_jobSize` is 1 MB, so ensure that we always pick a size >= 1 MB. Found by libFuzzer fuzzer tests with large input limits.	2018-03-21 16:20:30 -07:00
Yann Collet	153bc1c004	removed limit ZSTD_TARGETLENGTH_MAX this makes it possible to specify extremely large negative compression levels, achieving the side effect as "no compression". It will also be possible to define larger targetlength for ultra compression mode. There is no adverse side effect due to removing this limit.	2018-03-21 15:50:05 -07:00
Yann Collet	a99c4a3621	Merge branch 'dev' into advancedDecompress	2018-03-21 06:08:28 -07:00
Yann Collet	87b0cf05bd	Merge pull request #1057 from facebook/lrmSettings LRM parameters	2018-03-21 05:59:39 -07:00
Yann Collet	d1bf609abf	Merge pull request #1059 from terrelln/mt-ldm Integrate ldm with zstdmt	2018-03-20 17:50:20 -07:00
Yann Collet	e0cb8d19c6	fixed legacy test case	2018-03-20 17:48:22 -07:00
Yann Collet	878728dc26	fixed several comments by @terrelln	2018-03-20 16:35:14 -07:00
Yann Collet	e1c52faace	Merge pull request #1060 from facebook/compressImpl merge bmi2 implementation of encodeSequence into zstd_compress.c	2018-03-20 16:19:42 -07:00
Yann Collet	6cda8c932c	added test with ZSTD_decompress_generic() + ZSTD_DCtx_refPrefix() also : clarified stage condition to accept new parameters, fixed initializers correspondingly.	2018-03-20 16:16:13 -07:00
Yann Collet	0dadb6b70d	implemented ZSTD_DCtx_refPrefix*()	2018-03-20 15:45:56 -07:00
Yann Collet	569b8ba4d9	implemented ZSTD_DCtx_refDDict()	2018-03-20 15:43:49 -07:00
Nick Terrell	a3b76a77ef	Quiet appveyor warnings	2018-03-20 15:34:40 -07:00
Yann Collet	6873fec658	changed dictMore for dictContentType which seems clearer to describe what the variable/argument is about.	2018-03-20 15:13:14 -07:00
Yann Collet	31b54b6eea	updated ZSTD_initStaticDDict() prototype can also specify dictContentType.	2018-03-20 14:52:02 -07:00
Nick Terrell	136b9e2392	Fix external sequence corner cases * Clear external sequences when we reset the `ZSTD_CCtx`. * Skip external sequences when a block is too small to compress.	2018-03-20 14:50:28 -07:00
Yann Collet	353117c5d7	implemented ZSTD_DCtx_loadDictionary*() this required updating ZSTD_createDDict_advanced() to accept a dictContentType parameter (raw, full, auto).	2018-03-20 13:40:29 -07:00
Yann Collet	451357f37f	Merge pull request #1058 from facebook/cctxParams updated CCtxParams API	2018-03-20 12:36:12 -07:00
Yann Collet	2ed5af0766	merge bmi2 implementation of encodeSequence into zstd_compress.c	2018-03-19 19:10:31 -07:00
Nick Terrell	d19f803a3b	Fix window size for 1 worker + flushing	2018-03-19 18:56:39 -07:00
Nick Terrell	24d9edbdd8	Set ldmParams to 0 when disabled	2018-03-19 18:23:54 -07:00
Nick Terrell	4b92574feb	Fix corner cases exposed by zstreamtest	2018-03-19 17:54:04 -07:00
Nick Terrell	94c77710a9	Integrate ldm with zstdmt Integrate ldm into zstdmt by running it in serial and in order in the first step of each job, in the same place as the hash gets updated. The input buffer is sized to fit the whole LDM window and 2 full buffers of slack. Input buffers cannot be reused until the LDM step is done with them. After the LDM step is finished, the jobs don't actually have access to the full window, only the overlap. Tested on a few different multi-GB files with and without sanitizers, and with different numbers of threads.	2018-03-19 16:29:03 -07:00
Nick Terrell	aa4dbd09a1	Pull job/overlap log logic into common function (#1055 ) Prepares for LDM integration by separating the job size and overlap logic into helper functions.	2018-03-19 15:56:36 -07:00
Yann Collet	c8b3d389fd	updated CCtxParams API to respect naming convention : ZSTD_CCtxParams_*()	2018-03-19 15:07:26 -07:00
Yann Collet	6f4d0778a5	make it possible to express compression parameters in any order	2018-03-19 14:41:23 -07:00
Nick Terrell	2253d01b27	Move XXH64_update() into worker threads * Computes the XXH hash in the worker threads. * Workers get a sequence number and wait until ther number shows up. On error, ensures that its sequence is finished, so future threads don't get blocked. * Sets up for ldm integration, which will go in the same spot.	2018-03-19 11:08:27 -07:00
Yann Collet	9618c0c804	make it possible to specify LDM parameters in any order	2018-03-19 11:07:04 -07:00
Yann Collet	ec0959e701	Merge branch 'dev' into mt-single	2018-03-18 01:06:31 -07:00
Nick Terrell	4af1fafeb8	Restore setting loadedDictEnd Setting `loadedDictEnd` was accidently removed from `ZSTD_loadDictionaryContent()`, which means that dictionary compression will only be able to reference the parts of the dictionary within the window. The spec allows us to reference the entire dictionary so long as even one byte is in the window. `ZSTD_enforceMaxDist()` incorrectly always allowed offsets up to `loadedDictEnd` beyond the window, even once the dictionary was out of range. When overflow protection kicked in, the check `current > loadedDictEnd + maxDist` is incorrect if `loadedDictEnd` isn't reset back to zero. `current` could be reset below the value, which would incorrectly allow references beyond the window. This bug is present in `master`, but is very hard to trigger, since it requires both dictionaries and data which triggers overflow correction.	2018-03-16 14:54:06 -07:00
Yann Collet	cbc71e40f6	moving LRM parameters out of experimental section into "normal" range, start pinned at 160.	2018-03-15 17:22:40 -07:00
Nick Terrell	f15a17e19f	Use a single buffer in zstdmt Summary: Allocate a single input buffer large enough to house each job, as well as enough space for the IO thread to write 2 extra buffers. One goes in the `POOL` queue, and one to fill, and then block on a full `POOL` queue. Since we can't overlap with the prefix, we allocate space for 3 extra input buffers. Test Plan: * CI * With and without ASAN/UBSAN run zstdmt with different number of threads on two large binaries, and verify that their checksums match. * Test on the tip of the zstdmt ldm integration. Reviewers: cyan Differential Revision: https://phabricator.intern.facebook.com/D7284007 Tasks: T25664120	2018-03-15 16:21:33 -07:00
Yann Collet	192542b63c	Merge pull request #1047 from facebook/hufCompress removed huf_compress_impl.h	2018-03-15 14:14:03 -07:00
Nick Terrell	a271399c97	Expose reference external sequence API Summary: * Expose the reference external sequences API for zstdmt. Allows external sequences of any length, which get split when necessary. * Reset the LDM window when the context is reset. * Store the maximum number of LDM sequences. * Sequence generation now returns the number of last literals. * Fix sequence generation to not throw out the last literals when blocks of more than 1 MB are encountered. Expose reference external sequence API * Expose the reference external sequences API for zstdmt. * Allows external sequences of any length, which get split when necessary. * Reset the LDM window when the context is reset. * Store the maximum number of LDM sequences. * Sequence generation now returns the number of last literals. * Fix sequence generation to not throw out the last literals when blocks of more than 1 MB are encountered. Test Plan: * CI * Test the zstdmt ldm integration stacked on top of this diff Reviewers: cyan Differential Revision: https://phabricator.intern.facebook.com/D7283968 Tasks: T25664120	2018-03-14 18:07:53 -07:00
Nick Terrell	1908c92c46	Merge remote-tracking branch 'upstream/dev' into extern-seq * upstream/dev: Fix overflow protection with wlog=31	2018-03-14 17:26:31 -07:00
Yann Collet	a909c293c6	Merge branch 'dev' into hufCompress	2018-03-14 16:11:25 -07:00
Nick Terrell	a9a6dcba63	Expose reference external sequence API * Expose the reference external sequences API for zstdmt. Allows external sequences of any length, which get split when necessary. * Reset the LDM window when the context is reset. * Store the maximum number of LDM sequences. * Sequence generation now returns the number of last literals. * Fix sequence generation to not throw out the last literals when blocks of more than 1 MB are encountered.	2018-03-14 12:29:31 -07:00
Nick Terrell	33fb966e56	Fix overflow protection with wlog=31 The overflow protection is broken when the window log is `> (3U << 29)`, so 31. It doesn't work when `current` isn't around `1U << windowLog` ahead of `lowLimit`, and the the assertion `current > newCurrent` fails. This happens when the same context is used many times over, but with a large window log, like in zstdmt. Fix it by triggering correction based on `nextSrc - base` instead of `lowLimit`. The added test fails before the patch, and passes after.	2018-03-14 11:45:44 -07:00
Yann Collet	4c5cbac179	Merge pull request #1041 from facebook/fasterFast Negative compression levels	2018-03-13 21:32:46 -07:00
Yann Collet	50f763ec44	fixed several comments are underlined by @terrelln	2018-03-13 14:23:14 -07:00
Yann Collet	a95a88af57	removed huf_compress_impl.h re-imported all functions inside huf_compress.c for easier source editing. Also updated a bunch of code comments for clarification.	2018-03-13 14:14:05 -07:00
Yann Collet	bd7bb94361	Merge pull request #1044 from baldurk/remove-utf8-characters Remove non-ASCII characters in header file comments	2018-03-13 13:22:07 -07:00
Baldur Karlsson	430a2fec19	Remove non-ASCII characters in header file comments * Replaced a non-breaking space and an en dash with a plain space and a hyphen. * This means the files are simple ASCII and less likely to run into codepage issues.	2018-03-13 20:05:53 +00:00
Yann Collet	530eeb41a7	Merge pull request #1039 from facebook/zstd_decompress Removed zstd_decompress_impl.h	2018-03-12 18:21:46 -07:00
Yann Collet	2291b85a1e	changed ZSTD_p_literalCompression into ZSTD_p_compressLiterals prefer verb+object construction	2018-03-12 11:44:10 -07:00
Yann Collet	a57d43d4d4	updated documentation of targetLength	2018-03-12 11:35:01 -07:00
Yann Collet	6a9b41b731	create command --fast[=#] access negative compression levels from command line for both compression and benchmark modes. also : ensure proper propagation of parameters through ZSTD_compress_generic() interface. added relevant cli tests.	2018-03-11 20:01:23 -07:00
Yann Collet	a146ee04ae	added negative compression levels negative compression level trade compression ratio for more compression speed. They turn off huffman compression of literals, and use row 0 as baseline with a stepSize = -cLevel. added associated test in fuzzer also added : new advanced parameter ZSTD_p_literalCompression	2018-03-11 05:21:53 -07:00
Yann Collet	facc09aa03	minor compression level adaptation level 12 compresses slightly more and faster due to better btlazy2 mode	2018-03-11 03:06:52 -07:00
Yann Collet	fe321f9e2a	re-integrate ZSTD_decompressSequencesLong() into zstd_decompress.c removed zstd_decompress_impl.h	2018-03-09 19:48:06 -08:00
Yann Collet	89a2ebb971	incorporated ZSTD_decompressSequences() into zstd_decompress()	2018-03-09 19:35:57 -08:00
Yann Collet	cdb1f1433e	incorporated ZSTD_initFseState() inside zstd_decompress.c	2018-03-09 18:16:10 -08:00
Yann Collet	a166eae1ba	incorporate ZSTD_decodeSequenceLong() within zstd_decompress.c	2018-03-09 18:11:14 -08:00
Yann Collet	17626ba56e	restored ZSTD_decodeSequence() into zstd_decompress.c	2018-03-09 18:03:25 -08:00
Yann Collet	51169575a8	Merge pull request #1036 from terrelln/thread-void [threading] Cast unused arguments to void	2018-03-07 12:14:05 -08:00
Nick Terrell	7e103cdaf5	[threading] Cast unused arguments to void	2018-03-06 18:36:40 -08:00
Yann Collet	db147ea620	improved comments following @terrelln suggestions	2018-03-06 18:15:26 -08:00
Yann Collet	06ca9c7d7c	fixed 0-seq blocks in block-decompression mode	2018-03-06 01:50:19 -08:00
Yann Collet	9a91afe6ef	long offset mode : new default threshold for 32-bit	2018-03-05 16:41:08 -08:00
Yann Collet	7bd7a3ad43	long offset mode : new default threshold for 64-bits mode	2018-03-05 16:16:49 -08:00
Yann Collet	c0393a538f	fixed counting long distance weights	2018-03-05 15:12:10 -08:00
Yann Collet	41bd10446e	Merge branch 'dev' into longOffsetMode	2018-03-05 13:10:10 -08:00
Yann Collet	cb789d2df8	re-inserted offset evaluation	2018-03-05 13:08:59 -08:00
Yann Collet	b91ddf0ae6	Merge branch 'dev' into longOffsetMode	2018-03-05 11:59:54 -08:00
Yann Collet	d02b44cf55	DYNAMIC_BMI2 enabled for clang clang only claims compatibility with gcc 4.2. Consequently, recent patch which reserved DYNAMIC_BMI2 for gcc >= 4.8 also disabled it for clang. fix : __clang__ is now enough to enable DYNAMIC_BMI2 (associated with other existing conditions : x64/x64, !bmi2)	2018-03-04 16:05:59 -08:00
Yann Collet	45b09e7625	limit DYNAMIC_BMI2 to gcc >= 4.8 attribute bmi2 not supported by gcc 4.4	2018-03-01 15:02:18 -08:00
Yann Collet	b01552a07a	force inlining of HUF_decodeSymbol*() functions which was not done properly by gcc 4.8 resulting in major performance difference. ex : zstd -b1 silesia.tar before : dec 680 MB/s after : dec 710 MB/s (without bmi2) after : dec 770 MB/s (with DYNAMIC_BMI2)	2018-03-01 11:31:45 -08:00
Yann Collet	ccb7184a76	Merge pull request #1026 from terrelln/lrm-window LDM manages its own window round buffer	2018-02-27 17:09:10 -08:00
Nick Terrell	0a0e64c641	LDM manages its own window round buffer	2018-02-27 12:13:23 -08:00
Yann Collet	2c4d3f339a	Merge pull request #1025 from facebook/huf Huf	2018-02-27 09:57:01 -08:00
Yann Collet	33a3f18848	fixed wrong size test	2018-02-26 18:27:51 -08:00
Yann Collet	89741653ab	added error code workSpace_tooSmall	2018-02-26 15:11:50 -08:00
Yann Collet	6cdf690441	minor cleaning of huff0 Update code documentation, and properly names a few "magic constants". Also, HUF_compress_internal() gets a cleaner way to determine size of tables inside workspace.	2018-02-26 14:52:23 -08:00
Nick Terrell	6b88d592fd	Reduce ZSTD_CHAINLOG_MAX to 29 in 32-bit mode	2018-02-26 13:30:24 -08:00
Nick Terrell	7e5e226cbf	Split the window state into substructure	2018-02-26 13:29:57 -08:00
Yann Collet	50bc2ce95e	Merge pull request #1021 from terrelln/lrm-split Split block compresser out of long range matcher	2018-02-23 17:36:51 -08:00
Yann Collet	653383f74a	minor nit from Mac XCode	2018-02-22 15:44:26 -08:00
Nick Terrell	7e2bf4ebad	Remove long range matcher immediate repcode check The compression ratio gets about 0.01% worse on the files I tested, but the code is much simpler.	2018-02-22 15:18:47 -08:00
Nick Terrell	af866b3a58	Split block compresser out of long range matcher * `ZSTD_ldm_generateSequences()` generates the LDM sequences and stores them in a table. It should work with any chunk size, but is currently only called one block at a time. * `ZSTD_ldm_blockCompress()` emits the pre-defined sequences, and instead of encoding the literals directly, it passes them to a secondary block compressor. The code to handle chunk sizes greater than the block size is currently commented out, since it is unused. The next PR will uncomment exercise this code. * During optimal parsing, ensure LDM `minMatchLength` is at least `targetLength`. Also don't emit repcode matches in the LDM block compressor. Enabling the LDM with the optimal parser now actually improves the compression ratio. * The compression ratio is very similar to before. It is very slightly different, because the repcode handling is slightly different. If I remove immediate repcode checking in both branches the compressed size is exactly the same. * The speed looks to be the same or better than before. Up Next (in a separate PR) -------------------------- Allow sequence generation to happen prior to compression, and produce more than a block worth of sequences. Expose some API for zstdmt to consume. This will test out some currently untested code in `ZSTD_ldm_blockCompress()`.	2018-02-22 15:18:41 -08:00
Yann Collet	0fd4df6ed3	Implemented BMI2 functions directly within huf_decompress.c This makes it easier to edit for maintenance and evolutions (I plan to experiment modifications in huffman decompression functions). The methology followed seems broadly applicable to other BMI2 modules. Performance was tracked rigorously at each step, there is no noticeable loss (nor win) of performance compared to `#include` version. Note however that 4X decoder variants tend to be extremely sensitive to code alignment. This source code resulted in pretty good performance for gcc 7.2 and 7.3, but future changes (even in other parts of the code) might trigger the issue again.	2018-02-22 10:51:47 -08:00
Yann Collet	9c5a8040a9	fixed huf_compress workspace size	2018-02-21 11:34:49 -08:00
Yann Collet	010ba5f71f	Merge pull request #1017 from terrelln/c-bmi2 [compress] Support BMI2	2018-02-20 15:34:59 -08:00
Nick Terrell	6e128d3534	[BMI2] Add comments to the bmi2 variable in the contexts	2018-02-20 14:12:11 -08:00
Yann Collet	70163bf0d3	added clarification comments in zstd_errors.h answering some points in #1018	2018-02-20 12:54:49 -08:00
Yann Collet	7117ea8bec	Merge pull request #1011 from terrelln/bmi2 [decompress] Support BMI2	2018-02-15 11:40:34 -08:00
Nick Terrell	b58f01537e	[compress] Support BMI2	2018-02-14 19:20:32 -08:00
Nick Terrell	4319132312	[decompress] Support BMI2	2018-02-13 17:00:15 -08:00
Yann Collet	5cb1144872	fixed --single-thread was incorrectly set to -T0 (use as many cores as possible) previously	2018-02-13 14:56:35 -08:00
Yann Collet	2524cbd847	added code comment on how to generate default tables as suggested by @terrelln	2018-02-13 10:02:25 -08:00
Yann Collet	71c07966bb	added SEQSYMBOL_TABLE_SIZE() as suggested by @terrelln's comment	2018-02-12 16:52:15 -08:00
Yann Collet	5f7495371e	Merge branch 'dev' into fasterDec	2018-02-10 14:24:44 -08:00
Yann Collet	9945e60ac4	Merge branch 'dev' into flexibleLevel	2018-02-10 11:54:49 -08:00
Yann Collet	04a3f85ce7	fixed gcc warning on a switch code path	2018-02-09 16:16:27 -08:00
Yann Collet	af48f0b62b	fix : offset table pointer when using default table	2018-02-09 15:15:46 -08:00
Yann Collet	426944c3e3	fixed strict aliasing issue tuned threshold	2018-02-09 13:24:11 -08:00
Yann Collet	64ee732694	decide long-offset mode based on offcode statistics threshold vaguely estimated	2018-02-09 12:33:28 -08:00
Yann Collet	c72091556b	fixed minor nit as per @terrelln's comments	2018-02-09 09:46:08 -08:00
Yann Collet	4beaeaace5	Merge branch 'dev' into flexibleLevel	2018-02-09 09:15:05 -08:00
Yann Collet	6bfe50ad48	re-enabled ZSTD_decompressSequencesLong()	2018-02-09 09:14:25 -08:00
Yann Collet	1850597eaa	pre-calculated default decoding tables	2018-02-09 06:01:02 -08:00
Yann Collet	ab75df21ed	fixed mono-symbol distribution	2018-02-09 05:12:13 -08:00
Yann Collet	421a2716d8	fixed default fse distributions but would be better to pre-calculate tables, for speed	2018-02-09 04:50:58 -08:00
Yann Collet	95424409ea	addBits and baseline into FSE decoding table note : unfinished - need new default tables - need modify long mode	2018-02-09 04:25:15 -08:00
Yann Collet	de68c2ff10	Merged ZSTD_preserveUnsortedMark() into ZSTD_reduceIndex() as it's faster, due to one memory scan instead of two (confirmed by microbenchmark). Note : as ZSTD_reduceIndex() is rarely invoked, it does not translate into a visible gain. Consider it an exercise in auto-vectorization and micro-benchmarking.	2018-02-07 14:22:35 -08:00
Yann Collet	0170cf9a7a	minor : modified ZSTD_preserveUnsortedMark() to be more vectorization friendly	2018-02-05 11:46:02 -08:00
Yann Collet	94efb1749d	faster decoding in 32-bits mode for long offsets (tentative) On my laptop: Before: ./zstd32 -b --zstd=wlog=27 silesia.tar enwik8 -S 3#silesia.tar : 211984896 -> 66683478 (3.179), 97.6 MB/s , 400.7 MB/s 3#enwik8 : 100000000 -> 35643153 (2.806), 76.5 MB/s , 303.2 MB/s After: ./zstd32 -b --zstd=wlog=27 silesia.tar enwik8 -S 3#silesia.tar : 211984896 -> 66683478 (3.179), 97.4 MB/s , 435.0 MB/s 3#enwik8 : 100000000 -> 35643153 (2.806), 76.2 MB/s , 338.1 MB/s Mileage vary, depending on file, and cpu type. But a generic rule is : x86 benefits less from "long-offset mode" than x64, maybe due to register pressure. On "entropy", long-mode is _never_ a win for x86. On my laptop though, it may, depending on file and compression level (enwik8 benefits more from "long-mode" than silesia).	2018-02-04 01:49:31 -08:00
Yann Collet	5188749e1c	ensure compression parameters are updated when only compression level is changed	2018-02-02 16:31:20 -08:00
Yann Collet	4b525af53a	zstdmt: applies new parameters on the fly when invoked from ZSTD_compress_generic()	2018-02-02 15:58:13 -08:00
Yann Collet	90eca318a7	fileio: create dedicated function to generate zstd frames like other formats	2018-02-02 14:24:56 -08:00
Yann Collet	1291d9d7cf	Merge pull request #1006 from systemcrash/patch-2 Update README.md	2018-02-02 10:04:55 -08:00
Yann Collet	209df52ba2	Changed nbThreads for nbWorkers This makes it easier to explain that nbWorkers=0 --> single-threaded mode, while nbWorkers=1 --> asynchronous mode (one mode thread on top of the "main" caller thread). No need for an additional asynchronous mode flag. nbWorkers>=2 works the same as nbThreads>=2 previously.	2018-02-01 19:29:30 -08:00
Yann Collet	4b6a94f0cc	clarified comments on LDM parameters	2018-02-01 17:07:27 -08:00
Yann Collet	60fa90b6c0	zstdmt: added ability to change compression parameters during compression	2018-02-01 16:13:31 -08:00
Nick Terrell	48acaddff9	Test for incorrect pledgeSrcSize earlier	2018-02-01 12:04:05 -08:00
Yann Collet	727bb7f090	Merge pull request #1008 from terrelln/hlog3 Fix hashLog3 size when copying cdict tables	2018-01-31 12:49:07 -08:00
Nick Terrell	ab3346af07	Fix hashLog3 size when copying cdict tables	2018-01-31 11:12:17 -08:00
Yann Collet	823a28a1f4	Merge pull request #1000 from facebook/progressiveFlush Progressive flush	2018-01-30 22:49:47 -08:00
Yann Collet	a2ba629971	fixed function declaration ZSTD_getBlockSize()	2018-01-30 15:03:39 -08:00
Yann Collet	2cb0740b6b	zstdmt: changed naming convention to avoid confusion with blocks. also: - jobs are cut into chunks of 512KB now, to reduce nb of mutex calls. - fix function declaration ZSTD_getBlockSizeMax() - fix outdated comment	2018-01-30 14:43:36 -08:00
systemcrash	6b57387728	Update README.md spelling	2018-01-29 18:42:20 +01:00
Yann Collet	9f8ed23b5b	bumped version number to v1.3.4 also added a paragraph on using compression level with training mode as this is a recurrent question (see for example #1004)	2018-01-27 22:23:26 -08:00
Yann Collet	ba0cd8cf78	fixed minor conversion warning for C++ compilation mode	2018-01-26 18:18:42 -08:00
Yann Collet	caf9e96dc3	job mutex creation is checked	2018-01-26 18:09:25 -08:00
Yann Collet	9c40ae7ff1	zstdmt: there is now one mutex/cond per job	2018-01-26 17:55:08 -08:00
Yann Collet	77e36273de	zstdmt: minor code refactor for clarity	2018-01-26 17:08:58 -08:00
Yann Collet	27c5853c42	zstdmt: job table correctly cleaned after synchronous ZSTDMT_compress()	2018-01-26 14:35:54 -08:00
Yann Collet	0d426f6b83	zstdmt : refactor a few member names for clarity	2018-01-26 13:00:14 -08:00
Yann Collet	79b6e28b0a	zstdmt : flush() only lock to read shared job members Other job members are accessed directly. This avoids a full job copy, which would access everything, including a few members that are supposed to be used by worker only, uselessly requiring additional locks to avoid race conditions.	2018-01-26 12:15:43 -08:00
Yann Collet	d2b62b6fa5	minor : ZSTDMT_writeLastEmptyBlock() is a void function because it cannot fail	2018-01-26 11:06:34 -08:00
Yann Collet	fca13c6855	zstdmt : fixed memory leak writeLastEmptyBlock() must release srcBuffer as mtctx assumes it's done by job worker. minor : changed 2 job member names (src->srcBuffer, srcStart->prefixStart) for clarity	2018-01-26 10:44:09 -08:00
Yann Collet	8e128eaf05	zstdmt : refactor job members grouped by sharing properties	2018-01-26 10:20:38 -08:00
Yann Collet	777d3c1559	fixed minor declaration-after-statement warning	2018-01-25 17:45:18 -08:00
Yann Collet	a1d4041e69	zstdmt: removed job->jobCompleted replaced by equivalent signal job->consumer == job->srcSize. created additional functions ZSTD_writeLastEmptyBlock() and ZSTDMT_writeLastEmptyBlock() required when it's necessary to finish a frame with a last empty job, to create an "end of frame" marker. It avoids creating a job with srcSize==0.	2018-01-25 17:35:49 -08:00
Yann Collet	1272d8e760	zstdmt:: renamed mutex and cond to underline they are context-global	2018-01-25 14:52:34 -08:00
Yann Collet	5f349b129c	zstdmt : correctly set end of frame	2018-01-23 15:52:40 -08:00
Yann Collet	c1cc57f270	zstdmt : fix end condition (ZSTD_e_end) When ZSTD_e_end directive is provided, the question is not only "are internal buffers completely flushed", it is also "is current frame completed". In some rare cases, it was possible for internal buffers to be completely flushed, triggering a @return == 0, but frame was not completed as it needed a last null-size block to mark the end, resulting in an unfinished frame.	2018-01-23 15:19:11 -08:00
Yann Collet	de5e38a7a6	zstdmt: fixed minor race condition no real consequence, but pollute tsan tests : job->dstBuff is being modified inside worker, while main thread might read it accidentally because it copies whole job. But since it doesn't used dstBuff, there is no real consequence. Other potential solution : only copy useful data, instead of whole job	2018-01-23 14:03:07 -08:00
Yann Collet	ebd955e26a	zstdmt : fixed ending frame with 0-size block	2018-01-23 13:12:40 -08:00
Yann Collet	6711396d97	zstreamtest : fixed test 32 : multi-thread compression using ZSTD_compress_generic(,,ZSTD_e_end) Since it already provides ZSTD_e_end as directive, it should not be followed by ZSTDMT_endStream().	2018-01-19 22:20:53 -08:00
Yann Collet	a7ef3a219c	zstdmt : fixed last job size	2018-01-19 18:19:09 -08:00
Yann Collet	3ad7d4951c	zstdmt : finally vanquished an elusive and rare race condition	2018-01-19 17:35:08 -08:00
Yann Collet	940634a610	zstdmt : simplify job creation job will not be created when not enough room within job Table	2018-01-19 13:25:06 -08:00
Yann Collet	dc69623453	zstdmt: fixed corruption issue in ZSTDMT_endStream() when invoked directly.	2018-01-19 12:41:56 -08:00
Yann Collet	70f81d6030	zstdmt uses POOL_tryAdd() to call a new worker so that it's no longer a blocking call. This makes it possible to stream out data gradually, while waiting for a worker to become available.	2018-01-19 10:01:40 -08:00
Yann Collet	d19dc1903c	Merge pull request #995 from facebook/progressiveMT Progressive mt	2018-01-18 17:59:49 -08:00
Yann Collet	6f7280fb33	fixed frame checksum issue and race conditions	2018-01-18 16:20:26 -08:00
Yann Collet	997e4d0ccd	added POOL_tryAdd()	2018-01-18 14:39:51 -08:00
Yann Collet	4f43ef731d	Merge branch 'dev' into constCDict	2018-01-18 13:36:43 -08:00
Yann Collet	ef97d5a287	Merge branch 'progressiveMT' into progressiveFlush	2018-01-18 13:35:24 -08:00
Yann Collet	b6ab232f2d	Merge branch 'dev' into progressiveMT	2018-01-18 13:34:56 -08:00
Nick Terrell	9d96761520	Set repcodes for empty ZSTD_CDict When the dictionary is <= 8 bytes, no data is loaded from the dictionary. In this case the repcodes weren't set, because they were inserted after the size check. Fix this problem in general by first setting the cdict state to a clean state of an empty dictionary, then filling the state from there.	2018-01-18 13:28:30 -08:00
Yann Collet	c7190c69cc	fixes for @terrelln comments	2018-01-18 11:15:23 -08:00
Yann Collet	1b5d80d633	zstdmt: added ability to flush current job before it's completed however, zstdmt may still wait on next available worker, so it's not smooth yet.	2018-01-18 11:03:27 -08:00
Yann Collet	aa79c18e3f	fixed a few access contention passes thread sanitizer test	2018-01-17 17:18:19 -08:00
Yann Collet	394eec697b	Introduce ZSTD_getFrameProgression() Produces 3 statistics for ongoing frame compression : - ingested - consumed (effectively compressed) - produced Ingested can be larger than consumed due to buffering effect. For the time being, this patch mostly fixes the % ratio issue, since it computes consumed / produced, instead of ingested / produced. That being said, update is not "smooth", because on a slow enough setting, fileio spends most of its time waiting for a worker to complete its job. This could be improved thanks to more granular flushing i.e. start flushing before ongoing job is fully completed.	2018-01-17 16:39:02 -08:00
Yann Collet	f3b8f90b6d	changed initStatic?Dict() return type to const ZSTD_?Dict* ZSTD_create?Dict() is required to produce a ?Dict* return type because `free()` does not accept a `const type` argument. If it wasn't for this restriction, I would have preferred to create a `const ?Dict` object to emphasize the fact that, once created, a dictionary never changes (hence can be shared concurrently until the end of its lifetime). There is no such limitation with initStatic?Dict() : as stated in the doc, there is no corresponding free() function, since `workspace` is provided, hence allocated, externally, it can only be free() externally. Which means, ZSTD_initStatic?Dict() can return a `const ZSTD_?Dict*` pointer. Tested with `make all`, to catch initStatic's users, which, incidentally, also updated zstd.h documentation.	2018-01-17 14:08:48 -08:00
Yann Collet	b86865323a	Merge branch 'dev' into progressiveMT fixed minor conflict on cdict	2018-01-17 13:51:03 -08:00
Yann Collet	d14cc881b0	zstdmt : fixed very large window sizes would create too large buffers, since default job size == window size * 4. This would crash on 32-bit systems. Also : jobSize being a 32-bit unsigned, it cannot be >= 4 GB, so the formula was failing for large window sizes >= 1 GB. Fixed now : max job Size is 2 GB, whatever the window size.	2018-01-17 12:39:58 -08:00
Yann Collet	58dd7de640	zstdmt: fixed an endless loop on allocation failure this happened on 32-bits build when requiring a too large input buffer, typically on wlog=29, creating jobs of 2 GB size. also : zstd32 now compiles with multithread support enabled by default (can be disabled with HAVE_THREAD=0)	2018-01-17 12:10:15 -08:00
Nick Terrell	16bd0fd4df	Reduce size of ZSTD_CDict Shaves 492,076 B off of the `ZSTD_CDict`. The size of a `ZSTD_CDict` created from a 112,640 B dictionary is: \| Level \| Before (B) \| After (B) \| \|-------\|------------\|-----------\| \| 1 \| 648,448 \| 156,412 \| \| 3 \| 1,140,008 \| 647,932 \|	2018-01-17 11:50:49 -08:00
Yann Collet	cb57c107ff	zstdmt: minor variable renaming, for clarity	2018-01-17 11:39:07 -08:00
Yann Collet	1dba98d563	introduced parameter ZSTD_p_nonBlockingMode This new parameter makes it possible to call streaming ZSTDMT with a single thread set which is non blocking. It makes it possible for the main thread to do other tasks in parallel while the worker thread does compression. Typically, for zstd cli, it means it can do I/O stuff. Applied within fileio.c, this patch provides non-negligible gains during compression. Tested on my laptop, with enwik9 (1000000000 bytes) : time zstd -f enwik9 With traditional single-thread blocking mode : real 0m9.557s user 0m8.861s sys 0m0.538s With new single-worker non blocking mode : real 0m7.938s user 0m8.049s sys 0m0.514s => 20% faster	2018-01-16 16:15:47 -08:00
Yann Collet	6025465e42	ZSTDMT : minor CCtx memory optimization can be useful when a compression job only has small amount of data to compress.	2018-01-16 15:34:41 -08:00
Yann Collet	2e23333094	ZSTDMT can now work in non-blocking mode with 1 thread it still fallbacks to single-thread blocking invocation when input is small (<1job) or when invoking ZSTDMT_compress(), which is blocking. Also : fixed a bug in new block-granular compression routine.	2018-01-16 15:28:43 -08:00
Yann Collet	8e83c5c910	Merge branch 'dev' into progressiveMT	2018-01-16 12:54:33 -08:00
Nick Terrell	aae267a2e1	Reorganize block state	2018-01-16 11:17:50 -08:00
Nick Terrell	887cd4e35e	Split ZSTD_CCtx into smaller sub-structures	2018-01-16 11:17:50 -08:00
Yann Collet	9477f6529d	Merge pull request #984 from terrelln/dict-load Load more dictionary positions into table if empty	2018-01-13 13:20:42 -08:00
Yann Collet	58ecf13e02	zstdmt : can compress at block granularity offering perspective of more accurate progression report.	2018-01-13 13:18:57 -08:00
Nick Terrell	9a211d1f05	Load more dictionary positions into table if empty If the hash table is empty load positions into the hash table that we would otherwise skip. \| Level \| Data Set \| Improvement \| \|-------\|--------------\|-------------\| \| 1 \| github \| 0.44% \| \| 1 \| hg-changelog \| 0.13% \| \| 1 \| hg-commands \| 1.28% \| \| 1 \| hg-manifest \| 0.70% \| \| 3 \| github \| 0.74% \| \| 3 \| hg-changelog \| 0.87% \| \| 3 \| hg-commands \| 1.74% \| \| 3 \| hg-manifest \| 0.23% \|	2018-01-12 16:17:22 -08:00
Yann Collet	863b2f8db4	Merge pull request #983 from terrelln/dict-wlog Increase windowLog from CDict based on the srcSize when known	2018-01-12 07:47:43 -08:00
Nick Terrell	b610b777d3	Increase windowLog from CDict based on the srcSize when known	2018-01-11 16:23:21 -08:00
Yann Collet	cacf47cbee	Merge branch 'dev' into dubtlazy and fixed conflicts	2018-01-11 13:25:08 -08:00
Yann Collet	04c00f9388	Merge pull request #982 from facebook/fix304 Fix for #304 and #977 : error during dictionary creation	2018-01-11 13:20:59 -08:00
Yann Collet	b9a14900ff	changed function name to ZSTD_DUBT_findBestMatch()	2018-01-11 12:38:31 -08:00
Yann Collet	752bae4a48	added warning message when pathological dataset is detected (note : cover_optimize needs -v to display the warning)	2018-01-11 11:29:28 -08:00
Yann Collet	e8093dde09	fixed #304 Pathological samples may result in literal section being incompressible. This case is now detected, and literal distribution is replaced by one that can be written into the dictionary.	2018-01-11 11:16:32 -08:00
Yann Collet	218e9fe0fc	added a test case for dictBuilder failure cyclic data set makes the entropy stage fails now, onto a fix for #304 ...	2018-01-11 09:42:38 -08:00
Yann Collet	ff795580f2	fixed bug #976 , reported by @indygreg constants in zstd.h should not depend on MIN() macro which existence is not guaranteed. Added a test to check the specific constants. The test is a bit too specific. But I have found no way to control a more generic "are all macro already defined" condition, especially as this is a valid construction (the missing macro might be defined later, intentionnally).	2018-01-10 20:33:45 -08:00
Yann Collet	292eeb672f	api doc : grouped all ZSTD_create*_advanced() functions together in a new "custom memory allocator" paragraph which is itself part of "memory management" category. This makes it simpler to see the relation between the type and its usages.	2018-01-10 09:07:47 -08:00
Yann Collet	3ea156368c	API doc : grouped ZSTD_initStatic*() together within "memory management" category.	2018-01-10 08:49:50 -08:00
Yann Collet	b17fb488b0	fixed msan test a pointer calculation was wrong in a corner case	2018-01-06 20:50:36 +01:00
Yann Collet	658d6b8588	Merge branch 'dev' into dubtlazy	2018-01-06 12:40:58 +01:00
Yann Collet	a927fae2a1	fixed ZSTD_reduceIndex() following suggestions from @terrelln. Also added some comments to present logic behind ZSTD_preserveUnsortedMark().	2018-01-06 12:31:26 +01:00
Yann Collet	2eff217136	updated /lib documentation	2017-12-31 15:50:00 +01:00
Yann Collet	00db4dbbb3	fixed minor argument property for Visual	2017-12-30 15:42:28 +01:00
Yann Collet	f597f55675	improved btlazy2 : list of unsorted candidates can reach extDict It used to stop on reaching extDict, for simplification. As a consequence, there was a small loss of performance each time the round buffer would restart from beginning. It's not a large difference though, just several hundreds of bytes on silesia. This patch fixes it.	2017-12-30 15:12:59 +01:00
Yann Collet	a68b76afef	updated compression level table for btlazy2 now selected for levels 13, 14 and 15. Also : dropped the requirement for monotonic memory budget increase of compression levels,, which was required for ZSTD_estimateCCtxSize() in order to ensure that a memory budget for level L is large enough for any level <= L. This condition is now ensured at run time inside ZSTD_estimateCCtxSize().	2017-12-30 11:40:35 +01:00
Yann Collet	eb52e2f45e	simplify ZSTD_preserveUnsortedMark() implementation since no compiler attempts to auto-vectorize it.	2017-12-30 11:13:52 +01:00
Yann Collet	d228b6b0d0	btlazy2 : optimization for dictionary compression we want the dictionary table to be fully sorted, not just lazily filled. Dictionary loading is a bit more intensive, but it saves cpu cycles for match search during compression.	2017-12-29 19:14:18 +01:00
Yann Collet	02f64ef955	btlazy2: fixed interaction between unsortedMark and reduceTable	2017-12-29 19:08:51 +01:00
Yann Collet	64482c2c97	fixed bug in dubt the chain of unsorted candidates could grow beyond lowLimit.	2017-12-29 17:04:37 +01:00
Yann Collet	f36da5b4d9	minor speed optimization : index overflow prevention new code supposed to be easier to auto-vectorize	2017-12-29 14:40:33 +01:00
Yann Collet	5235d8d6ba	first implementation of delayed update for btlazy2 This is a pretty nice speed win. The new strategy consists in stacking new candidates as if it was a hash chain. Then, only if there is a need to actually consult the chain, they are batch-updated, before starting the match search itself. This is supposed to be beneficial when skipping positions, which happens a lot when using lazy strategy. The baseline performance for btlazy2 on my laptop is : 15#calgary.tar : 3265536 -> 955985 (3.416), 7.06 MB/s , 618.0 MB/s 15#enwik7 : 10000000 -> 3067341 (3.260), 4.65 MB/s , 521.2 MB/s 15#silesia.tar : 211984896 -> 58095131 (3.649), 6.20 MB/s , 682.4 MB/s (only level 15 remains for btlazy2, as this strategy is squeezed between lazy2 and btopt) After this patch, and keeping all parameters identical, speed is increased by a pretty good margin (+30-50%), but compression ratio suffers a bit : 15#calgary.tar : 3265536 -> 958060 (3.408), 9.12 MB/s , 621.1 MB/s 15#enwik7 : 10000000 -> 3078318 (3.249), 6.37 MB/s , 525.1 MB/s 15#silesia.tar : 211984896 -> 58444111 (3.627), 9.89 MB/s , 680.4 MB/s That's because I kept `1<<searchLog` as a maximum number of candidates to update. But for a hash chain, this represents the total number of candidates in the chain, while for the binary, it represents the maximum depth of searches. Keep in mind that a lot of candidates won't even be visited in the btree, since they are filtered out by the binary sort. As a consequence, in the new implementation, the effective depth of the binary tree is substantially shorter. To compensate, it's enough to increase `searchLog` value. Here is the result after adding just +1 to searchLog (level 15 setting in this patch): 15#calgary.tar : 3265536 -> 956311 (3.415), 8.32 MB/s , 611.4 MB/s 15#enwik7 : 10000000 -> 3067655 (3.260), 5.43 MB/s , 535.5 MB/s 15#silesia.tar : 211984896 -> 58113144 (3.648), 8.35 MB/s , 679.3 MB/s aka, almost the same compression ratio as before, but with a noticeable speed increase (+20-30%). This modification makes btlazy2 more competitive. A new round of paramgrill will be necessary to determine which levels are impacted and could adopt the new strategy.	2017-12-28 16:58:57 +01:00
Yann Collet	473362e922	Merge pull request #958 from facebook/continueCCtx fix a subtle issue in continue mode	2017-12-20 00:12:50 +01:00
Yann Collet	cafedcbbe4	ZSTD_resetCCtx_internal: fixed order of arguments params1 was swapped with params2. This used to be a non-issue when testing for strict equality, but now that some tests look for "sufficient size" `<=`, order matters.	2017-12-19 21:49:04 +01:00
Yann Collet	9096088f45	changed variable name for clarity, suggested by @terrelln	2017-12-19 21:20:46 +01:00
Yann Collet	f299fa39ac	fix a subtle issue in continue mode The deep fuzzer tests caught a subtle bug that was probably there for a long time. The impact of the bug is not a crash, or any other clear error signal, rather, it reduces performance, by cutting data into smaller blocks. Eventually, the following test would fail because it produces too many 1-byte blocks, requiring more space than buffer can provide : `./zstreamtest_asan --mt -s3514 -t1678312 -i1678314` The root scenario is as follows : - Create context, initialize it using explicit parameters or a `cdict` to pin them down, set `pledgedSrcSize=1` - The compression parameters will not be adapted, but `windowSize` and `blockSize` will be automatically set to `1`. `windowSize` and `blockSize` are dynamic values, set within `ZSTD_resetCCtx_internal()`. The automatic adaptation makes it possible to generate smaller contexts for smaller input sizes. - Complete compression - New compression with same context, using same parameters, but `pledgedSrcSize=ZSTD_CONTENTSIZE_UNKNOWN` trigger "continue mode" - Continue mode doesn't modify blockSize, because it used to depend on `windowLog` only, but in fact, it also depends on `pledgedSrcSize`. - The "old" blocksize (1) is still there, next compression will use this value to cut input into blocks, resulting in more blocks and worse performance than necessary performance. Given the scenario, and its possible variants, I'm surprised it did not show up before. But I suspect it did show up, it's just that it never triggered an error, because "worse performance" is not a trigger. The above test is a special corner case, where performance is so impacted that it reaches an error case. The fix works, but I'm not completely pleased. I think the current code relies too much on implied relations between variables. This will likely break again in the future when some related part of the code change. Unfortunately, no time to make larger changes if we want to keep the release target for zstd v1.3.3. So a longer term fix will have to be considered after the release. To do : create a reliable test case which triggers this scenario for CI tests.	2017-12-19 09:43:03 +01:00
Yann Collet	5c2f2ebfdb	zstdmt via compress_generic: reduce opportunity to free/create mtctx `zstreamtest --newapi` (and `--opaqueapi`) create and destroy way too many threads resulting in failure of tsan tests, and potentially connected to the qemu flaky tests. This is because, at each test, the nb of threads can be changed (random). The `--no-big-tests` directive reduce this choice to 1/2 threads, in order to limit memory usage, especially for qemu and 32-bits builds. Unfortunately, swapping between 1 and 2 threads is enough to constantly create/destroy new mtctx. This patch takes advantage of the following property : via compress_generic, no internal mtctx is needed for nbThreads < 2. As a consequence, when nbThreads == 2, the currently active mtctx is necessarily good. This dramatically reduces the nb of thread creations when invoking `zstreamtest --newapi --no-big-tests` (only when parent cctx itself is created, which is randomized to 1/256 tests). Expected outcome : - at a minimum : tsan tests shall now work continuously without exploding the thread counter - at best : flaky qemu tests on `zstreamtest --newapi --no-big-tests` may stop being flaky, due to less stress from constant thread creation/destruction Real world impact : minimal, I don't expect users to constantly change `nbThreads` between each invocation. If `nbThreads` remains stable, existing implementation re-uses existing mtctx. Also : `zstreamtest --newapi` but without `--no-big-tests` doesn't benefit as much, since this test can select a random `nbThreads` value between 1 and 4. The current patch only reduces opportunity to free/create mtctx (for example : 2->1->2 doesn't need a new mtctx) but doesn't completely eliminate it, since `nbThreads` can still change between 2/3/4. A more complete solution could be to only use 2 out of 4 allocated threads, thus keeping the pool at a constant size. This would require a larger change to `POOL_*` api though.	2017-12-16 12:48:13 -08:00
Yann Collet	3cbfac1cdb	updated levels 15-20 taking advantage of `btopt` improved speed to tune parameters. Levels 16-19 are stronger than previous release, making the graph more favorable. In theory, I should also update small-size tables, but I got lazy on that one ...	2017-12-14 23:29:00 -08:00
Yann Collet	2cff66b62f	version bump to v1.3.3	2017-12-14 16:11:20 -08:00
Yann Collet	8c41a9cb1e	Merge pull request #951 from facebook/lastBlock saves 3-bytes on small input with streaming API	2017-12-14 15:39:50 -08:00
Yann Collet	a0ac8c895c	Merge pull request #950 from facebook/srcSizeAdaptation fix adaptation on srcSize	2017-12-14 14:48:31 -08:00
Yann Collet	281f06e01f	saves 3-bytes on small input with streaming API zstd streaming API was adding a null-block at end of frame for small input. Reason is : on small input, a single block is enough. ZSTD_CStream would size its input buffer to expect a single block of this size, automatically triggering a flush on reaching this size. Unfortunately, that last byte was generally received before the "end" directive (at least in `fileio`). The later "end" directive would force the creation of a 3-bytes last block to indicate end of frame. The solution is to not flush automatically, which is btw the expected behavior. It happens in this case because blocksize is defined with exactly the same size as input. Just adding one-byte is enough to stop triggering the automatic flush. I initially looked at another solution, solving the problem directly in the compression context. But it felt awkward. Now, the underlying compression API `ZSTD_compressContinue()` would take the decision the close a frame on reaching its expected end (`pledgedSrcSize`). This feels awkward, a responsability over-reach, beyond the definition of this API. ZSTD_compressContinue() is clearly documented as a guaranteed flush, with ZSTD_compressEnd() generating a guaranteed end. I faced similar issue when trying to port a similar mechanism at the higher streaming layer. Having ZSTD_CStream end a frame automatically on reaching `pledgedSrcSize` can surprise the caller, since it did not explicitly requested an end of frame. The only sensible action remaining after that is to end the frame with no additional input. This adds additional logic in the ZSTD_CStream state to check this condition. Plus some potential confusion on the meaning of ZSTD_endStream() with no additional input (ending confirmation ? new 0-size frame ?) In the end, just enlarging input buffer by 1 byte feels the least intrusive change. It's also a contract remaining inside the streaming layer, so the logic is contained in this part of the code. The patch also introduces a new test checking that size of small frame is as expected, without additional 3-bytes null block.	2017-12-14 11:47:02 -08:00
Yann Collet	c005df136f	Merge pull request #947 from facebook/fix944 Fix #944	2017-12-14 10:01:52 -08:00
Yann Collet	2e97a6d464	fixed minor declaration-after-statement warning	2017-12-13 18:50:05 -08:00
Yann Collet	5432ef6921	fixes adaptation on srcSize This patch restores capability for each file to receive adapted compression parameters depending on its size. The bug breaking this feature was relatively silly : setting a parameter with a value "0" is supposed to be a no-op. Unfortunately, it would pin down compression parameters as if they were manually set, preventing later automatic adaptation. Unfortunately, I'm currently short of a test case that could check this situation and trigger an error. Compression parameters selection between tableID 0,1,2,3 is largely internal, leaving no trace to outside world, not even in frame header.	2017-12-13 17:45:26 -08:00
Yann Collet	d23eb9a098	zstreamtest : added missing CHECK_Z()	2017-12-13 15:35:49 -08:00
Nick Terrell	22727a7467	Fix cdict compressor repcodes	2017-12-13 11:31:20 -08:00
Yann Collet	e28305fcca	fix #944 : ZSTDMT with large files and dictionary now works correctly windowLog is now enforced from provided compression parameters, instead of being copied blindly from `cdict` where it could be smaller. also : - fix a minor bug in zstreamtest --mt : advanced parameters must be set before init - changed advanced parameter name to ZSTDMT_jobSize	2017-12-12 18:04:58 -08:00
Yann Collet	03832b7aa5	re-added test case messing with revert ... :(	2017-12-12 14:01:54 -08:00
Yann Collet	8a104fda05	Revert "Created a test case which reliably reproduces bug #944 " This reverts commit `5098d1fbe2`.	2017-12-12 12:51:49 -08:00
Yann Collet	5098d1fbe2	Created a test case which reliably reproduces bug #944 in zstreamtest.	2017-12-12 12:48:31 -08:00
Yann Collet	ac8e022806	Merge pull request #943 from facebook/fix942 Fix #942	2017-12-08 13:53:08 -05:00
Yann Collet	dfc697e967	comment clarification	2017-12-08 12:16:49 -05:00
Yann Collet	c029ee1f0b	ZSTD_initCStream_srcSize() considers "0" to mean "unknown" to not break existing programs relying on this behavior. Might be changed to mean "empty" in the future.	2017-12-07 17:13:10 -05:00
Yann Collet	3aa2b27a89	fix #942 : streaming interface does not compress after ZSTD_initCStream() While the final result is still, technically, a frame, the resulting frame expands initial data instead of compressing it. This is because the streaming API creates a tiny 1-byte buffer for input, because it believes input is empty (0-bytes), because in the past, 0 used to mean "unknown" instead. This patch fixes the issue. Todo : add a test which traps the issue.	2017-12-07 02:52:50 -05:00
Yann Collet	c173dbd6e7	no longer supported starting C++17	2017-12-04 18:00:53 -08:00
Yann Collet	7e05ef851a	Merge branch 'dev' into qemu32panic	2017-12-03 11:14:36 -08:00
Yann Collet	5e1f34b7e4	setParameter : no side-effect on setting a compression parameter last such side-effect was modifying cctx->loadedDictEnd on setting forceWindow. It is no a useless operation, so it's removed. No side-effect left when setting a compression parameter.	2017-12-01 21:17:09 -08:00
Yann Collet	78290874a5	fixed Visual warning on minor interface discrepancy	2017-11-29 17:01:14 -08:00
Yann Collet	d3c59edac9	removed long-range-mode tests from `zstreamtest --no-big-tests`	2017-11-29 16:42:20 -08:00
Yann Collet	998a93b784	simplified ZSTD_CCtx_setParametersUsingCCtxParams() Any ZSTD_CCtx_setParameter() shall just write the requested parameter, without further action. Any action shall be taken at parameter application only (during init). It makes it possible to just copy CCtxParams from external container to internal state, and get rid of the more complex code which was trying to compensate for missing actions.	2017-11-29 16:13:05 -08:00
Yann Collet	f98ee994c4	zstd_opt: added comments, as requested by @terrelln	2017-11-29 15:19:00 -08:00
Yann Collet	bc42bc3b1d	removed one invocation of SET_PRICE() macro	2017-11-28 16:08:56 -08:00
Yann Collet	0a0a212934	zstd_opt: changed cost formula There was a flaw in the formula which compared literal cost with match cost : at a given position, a non-null literal suite is going to be part of next sequence, while if position ends a previous match, to immediately start another match, next sequence will have a litlength of zero. A litlength of zero has a non-null cost. It follows that literals cost should be compared to match cost + litlength==0. Not doing so gave a structural advantage to matches, which would be selected more often. I believe that's what led to the creation of the strange heuristic which added a complex cost to matches. The heuristic was actually compensating. It was probably created through multiple trials, settling for best outcome on a given scenario (I suspect silesia.tar). The problem with this heuristic is that it's hard to understand, and unfortunately, any future change in the parser would impact the way it should be calculated and its effects. The "proper" formula makes it possible to remove this heuristic. Now, the problem is : in a head to head comparison, it's sometimes better, sometimes worse. Note that all differences are small (< 0.01 ratio). In general, the newer formula is better for smaller files (for example, calgary.tar and enwik7). I suspect that's because starting statistics are pretty poor (another area of improvement). However, for silesia.tar specifically, it's worse at level 22 (while being better at level 17, so even compression level has an impact ...). It's a pity that zstd -22 gets worse on silesia.tar. That being said, I like that the new code gets rid of strange variables, which were introducing complexity for any future evolution (faster variants being in mind). Therefore, in spite of this detrimental side effect, I tend to be in favor of it.	2017-11-28 14:07:03 -08:00
Yann Collet	b71405dc51	removed a bunch of code related to cached literal price optState was used both to evaluate price and to cache cost of previously calculated literals. This created a strong dependency, forcing parser to request cost in a strict order. This limitation is forbids future parser with skipping capabilities. After this patch, caching literals price still exists, but is now explicit, in a stack structure.	2017-11-28 12:32:24 -08:00
Yann Collet	03f30d9dcb	separate rawLiterals, fullLiterals and match costs removed one SET_PRICE() macro invocation	2017-11-28 12:14:46 -08:00
Yann Collet	eee87cd6f2	btopt: minor refactor : removed one SET_PRICE() macro invocation direct assignment makes operation cleaner. Also allows some (very minor) optimization (non-measurable)	2017-11-27 17:18:57 -08:00
Yann Collet	e9d1987fd7	btopt: minor speed optimization matchPrice is always right at beginning	2017-11-27 17:01:51 -08:00
Yann Collet	bd88f633ac	zstreamtest : in `-T#s`, s considered a suffix meaning "seconds" avoid unintentionnally triggering `seedset`, so that seed gets automatically determined when not set.	2017-11-27 12:15:23 -08:00
Yann Collet	f8d5c478af	fixed comment, reported by @gyscos	2017-11-21 10:36:14 -08:00
Yann Collet	4154aec679	fixed comment, as suggested by @terrelln	2017-11-21 10:26:17 -08:00
Yann Collet	899f2a29f6	strategy ZSTD_btopt pinned to (0) variant (faster one)	2017-11-20 11:53:20 -08:00
Yann Collet	3f457264d1	slightly improved compression speed	2017-11-19 14:40:21 -08:00
Yann Collet	42c1e64270	slightly improved ratio at -22 merging of repcode search into btsearch introduced a small compression ratio regressio at max level : 1.3.2 : 52728769 after repMerge patch : 52760789 (+32020) A few minor changes have produced this difference. They can be hard to spot. This patch buys back about half of the difference, by no longer inserting position at hc3 when a long match is found there. It feels strangely counter-intuitive, but works : after this patch : 52742555 (-18234)	2017-11-19 14:00:55 -08:00
Yann Collet	99435dbbab	minor : search early-out on sufficient_len for hc3 and rep very very small speed and ratio increases	2017-11-19 12:58:04 -08:00
Yann Collet	d100670045	btopt0 : a bit faster and weaker	2017-11-19 10:38:02 -08:00
Yann Collet	e6da37c430	created (hidden) new strategy btopt0 about ~+10% faster but losing ~0.01 compression ratio (note : amplitude vary a lot depending on files, but direction remains the same)	2017-11-19 10:21:21 -08:00
Yann Collet	e717a5b0dd	zstd_opt: minor speed optimization Calculate reference log2sums only once per serie of sequence (as opposed to once per sequence) Also: improved code comments	2017-11-18 16:24:02 -08:00
Yann Collet	d11661c3ec	fix ZSTD_COMPRESSBOUND() macro It was using macro `KB`, which is not defined in `zstd.h`.	2017-11-18 11:16:39 -08:00
Yann Collet	a4a20a4b2f	fix un-initialized memory warning harmless, but cleaner	2017-11-17 15:51:52 -08:00
Yann Collet	23767e950a	fix one UB pointer arithmetic in encoder Instead of calculating distance between 2 memory objects, which is UB, we extract the offset from object 1, and transfer it into object 2.	2017-11-17 13:24:51 -08:00
Yann Collet	cdade555ee	fixed one UB pointer arithmetic	2017-11-17 11:40:08 -08:00
Yann Collet	11e58d9ba4	fixed minor warning warning: void function returning a value (even if the return value is void)	2017-11-16 15:21:30 -08:00
Yann Collet	15768cabb5	fixed some complex scenarios Fixed : multithreading to compress some small data with dictionary Fixed : ZSTD_initCStream_usingCDict() Improved streaming memory usage when pledgedSrcSize is known.	2017-11-16 15:18:18 -08:00
Yann Collet	05dffe43a7	Fixed Btree update ZSTD_updateTree() expected to be followed by a Bt match finder, which would update zc->nextToUpdate. With the new optimal match finder, it's not necessarily the case : a match might be found during repcode or hash3, and stops there because it reaches sufficient_len, without even entering the binary tree. Previous policy was to nonetheless update zc->nextToUpdate, but the current position would not be inserted, creating "holes" in the btree, aka positions that will no longer be searched. Now, when current position is not inserted, zc->nextToUpdate is not update, expecting ZSTD_updateTree() to fill the tree later on. Solution selected is that ZSTD_updateTree() takes care of properly setting zc->nextToUpdate, so that it no longer depends on a future function to do this job. It took time to get there, as the issue started with a memory sanitizer error. The pb would have been easier to spot with a proper `assert()`. So this patch add a few of them. Additionnally, I discovered that `make test` does not enable `assert()` during CLI tests. This patch enables them. Unfortunately, these `assert()` triggered other (unrelated) bugs during CLI tests, mostly within zstdmt. So this patch also fixes them. - Changed packed structure for gcc memory access : memory sanitizer would complain that a read "might" reach out-of-bound position on the ground that the `union` is larger than the type accessed. Now, to avoid this issue, each type is independent. - ZSTD_CCtxParams_setParameter() : @return provides the value of parameter, clamped/fixed appropriately. - ZSTDMT : changed constant name to ZSTDMT_JOBSIZE_MIN - ZSTDMT : multithreading is automatically disabled when srcSize <= ZSTDMT_JOBSIZE_MIN, since only one thread will be used in this case (saves memory and runtime). - ZSTDMT : nbThreads is automatically clamped on setting the value.	2017-11-16 12:18:56 -08:00
Yann Collet	dfc14579f5	removed wrong assertion	2017-11-15 15:35:56 -08:00
Yann Collet	c55e35b2fc	removed a few specialized traces	2017-11-15 15:04:53 -08:00
Yann Collet	61c2d70c86	shortened repcode match finder implementation	2017-11-15 14:37:40 -08:00
Yann Collet	d7e9805028	fixed corruption issue	2017-11-15 13:44:24 -08:00
Yann Collet	046ea53bef	still fighting data corruption due to messed up tree. Seems to happen when reaching end of buffer.	2017-11-15 11:29:24 -08:00
Yann Collet	4202b2e8a6	merged rep search into btMatchSearch but there is a tree corruption somewhere ... bug hunt ongoing	2017-11-14 20:38:52 -08:00
Yann Collet	9a11f70dc3	merged repcode search into BT match search this version has same speed as branch `opt` which is itself 5-10% slower than branch `dev` (no identified reason) It does not compress exactly the same as `opt` or `dev`, maybe because it doesn't stop search after repcodes, leading to sometimes better compression, sometimes worse (by a small margin). warning : _extDict path does not work for the time being This means that benchmark module works, but file module will fail with large files (and high compression level). Objective is to fuse _extDict path into current one, in order to have a single parser to maintain.	2017-11-13 02:23:48 -08:00
Yann Collet	eb47705b18	reduced scope of multiple variables renamed some variables for better understanding	2017-11-10 08:31:12 -08:00
Yann Collet	100d8ad6be	lib/compress: created ZSTD_LLcode() and ZSTD_MLcode() transform length into code. Since transformation is needed in several places throughout the code, better write the logic in one place.	2017-11-08 12:43:05 -08:00
Yann Collet	5aa0352742	zstd_opt: simplified ZSTD_getPrice() and ZSTD_updatePrice() interface ZSTD_getPrice() and ZSTD_updatePrice() accept normal matchlength as argument instead of matchlength-MINMATCH, which makes them easier / more logical to use and read. Conversion is simply done internally.	2017-11-08 12:23:27 -08:00
Yann Collet	bf730e2044	zstd_opt: refactor code for improved readability renamed variables to be more meaningful reduced scope of multiple variables removed some useless var attribution	2017-11-08 12:07:39 -08:00
Yann Collet	4191efa993	zstd_opt: ensure sufficient_len < ZSTD_OPT_NUM to simplify some tests	2017-11-08 11:24:00 -08:00
Yann Collet	ee441d5d2b	renamed zstd_compress.h into zstd_compress_internal.h to emphasize the fact that all definitions it contains must remain private, accross lib/compress modules.	2017-11-07 16:15:23 -08:00
Yann Collet	8b6aecf2cb	moved a few structures from `zstd_internal.h` to `zstd_compress.h` which is a more precise scope	2017-11-07 16:03:14 -08:00
Yann Collet	aec56a52fb	Merge pull request #908 from facebook/ubsan Modified one pointer arithmetic expression to a more conformant way.	2017-11-07 11:45:34 -08:00
Yann Collet	d0ffd398d2	Merge pull request #906 from facebook/fixAutoPledge fix : ZSTD_compress_generic(,,,ZSTD_e_end) automatically sets pledgedSrcSize	2017-11-02 10:14:20 -07:00
Yann Collet	150354c5fe	minor refactor added some traces and assert related to hunting a potential ubsan error in 32-bits more (it ends up being a compiler-side issue : https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82802). Modified one pointer arithmetic expression for a more conformant way.	2017-11-01 16:57:48 -07:00
Yann Collet	428e8b3bf4	fix : ZSTD_compress_generic(,,,ZSTD_e_end) automatically sets pledgedSrcSize as per documentation, on ZSTD_setPledgedSrcSize() : > If all data is provided and consumed in a single round, > this value (pledgedSrcSize) is overriden by srcSize instead. This wasn't applied before compression level is transformed into compression parameters. As a consequence, small input missed compression parameters adaptation. It seems to work fine now : compression was compared with ZSTD_compress_advanced(), results were the same.	2017-11-01 13:15:23 -07:00
Nick Terrell	1fc4f593da	Allow skippable frames of any size	2017-11-01 13:07:26 -07:00
Yann Collet	61e5a1adfc	removed direct call to malloc() from pool.c	2017-10-31 17:43:24 -07:00
Yann Collet	f73e15de33	Merge pull request #903 from terrelln/empty-input [libzstd] Fix parameter selection for empty input	2017-10-28 17:28:08 -07:00
Nick Terrell	86b8134cad	[libzstd] Fix parameter selection for empty input ZSTD_compress() and friends would treat an empty input as an unknown size when selecting parameters. Thus, they would drastically overallocate the context. Tell ZSTD_getParams() that the source size is 1 when it is empty.	2017-10-25 17:24:15 -07:00
Nick Terrell	b495140f67	Update BUCK files * Correct XXH namespace (Fixes #901) * Multithreading always enabled * GZIP/LZ4/LZMA always enabled * Legacy support always fully enabled	2017-10-25 12:47:57 -07:00
Yann Collet	97dccbbb2b	fixed zbufftest preserve "pledgedSrcSize=0" means "unknown" in init_advanced()	2017-10-19 14:06:02 -07:00
Yann Collet	ca1a9ebac5	fixed zlib wrapper it was invoking ZSTD_initCStream_advanced() with pledgedSrcSize==0 and contentSizeFlag=1 which means "empty" while the intention was to mean "unknown". The contentSizeFlag==1 is new, it is a consequence of setting this value to 1 by default. The solution selected here is to pass ZSTD_CONTENTSIZE_UNKNOWN to mean "unknown". So contentSizeFlag remains set (it wasn't in previous versions).	2017-10-18 11:22:23 -07:00
Yann Collet	1ff8a8c109	Merge pull request #891 from facebook/contentSize Content size	2017-10-17 17:24:51 -07:00
Yann Collet	32c9f715ae	fixed : Visual build compressing stdin with multi-threading enabled fails It was multiple reasons stacked : - Visual use a different code path, because ZSTD_NEWAPI is not defined - fileio.c sends `0` as `pledgedSrcSize` to mean `ZSTD_CONTENTSIZE_UNKNOWN` (fixed) - ZSTDMT_resetCCtx() interpreted `0` as "empty" instead of "unknown" (fixed)	2017-10-17 14:07:43 -07:00
Yann Collet	13bfe885aa	edited ZSTD_initCStream_advanced() comment	2017-10-16 14:06:22 -07:00
Nick Terrell	7f961ba6cd	Don't allow default tables to repeat It isn't useful in any case to repeat default tables. Saves a few bytes on Silesia, since we don't trigger the dictionary heuristic. Before: 211988480 => 73651998 bytes After: 211988480 => 73651721 bytes	2017-10-16 11:37:56 -07:00
Yann Collet	fc8d293460	dictionary compression use correct file size estimation when determining compression parameters to compress one file only. For multiple files, it still "bets" that files are going to be small. There was also a bug recently added in ZSTD_CCtx_loadDictionary_advanced() making it incapable to use pledgedSrcSize to determine compression parameters.	2017-10-14 01:21:43 -07:00
Yann Collet	5eed8e7a55	changed API comments to invite using macro ZSTD_CONTENTSIZE_UNKNOWN to mean "pledgedSrcSize is not known at init time" instead of `0`. Note that, a few prototypes created and documented with `0` to mean "unknown" still interpret "0" as unknown, to avoid breaking 3rd party applications which depend on this behavior. But this value is no longer recommended to mean "unknown". In some future version, it might be possible to switch "0" to mean "empty", as is already the case for several prototypes. The advantage is that pledgedSrcSize field would have same behavior accross entire API, making it easier to reason about. Note that all concerned prototypes belong to the "experimental" API section. srcSize is controlled at end of compression, so if someone uses "0" to mean "unknown" while it effectively means "empty", this is immediately caught by the compression function, which generates an error code : ZSTD_ERROR_srcSize_wrong	2017-10-14 00:32:06 -07:00
Yann Collet	beb9b4b398	fixed ZSTDMT_initCStream() when contentSizeFlag==1 by default and a wrong test in zstreamtest --mt	2017-10-13 19:09:30 -07:00
Yann Collet	213ef3b510	fixed ZSTD_initCStream_advanced() behavior, which depends on contentSizeFlag, and a stream fuzzer test, which was incorrect (relied on 0 being unconditionnally transformed into `ZSTD_CONTENTSIZE_UNKNOWN`)	2017-10-13 19:01:58 -07:00
Yann Collet	3c1e3f8ec9	contentSizeFlag enabled by default would also fail for streaming and MT operations fixed	2017-10-13 18:32:06 -07:00
Yann Collet	fb44516641	ensure fParams.contentSizeFlag starts at 1 such default was failing for ZSTD_compressBegin/ZSTD_compressContinue fixed too	2017-10-13 17:39:13 -07:00
Yann Collet	dd18d73e7e	fileio: content size is enabled by default	2017-10-13 16:32:18 -07:00
Nick Terrell	ced6e6189c	Add DEBUGLOG() that prints FSE encoding types	2017-10-13 14:55:23 -07:00
Nick Terrell	24ac2dbd2a	Fix invalid use of dictionary offcode table Fixes #888.	2017-10-13 12:47:03 -07:00
Yann Collet	a9e5705077	minor code formatting added a trace during sequence encoding	2017-10-13 02:36:16 -07:00
Yann Collet	7f6a783862	fixed a small error in decodeCorpus a compressed block must be strictly smaller than its decompressed size.	2017-10-07 15:19:52 -07:00
Nick Terrell	a86a7097ec	Ensure dictionary Huff table can encode any symbol * Ensure that the dictionary Huffman CTable has maxSymbolValue 255. * Fix a stack buffer overflow during compression dictionary loading.	2017-10-03 13:22:13 -07:00
Yann Collet	67478f4cb0	fixed minor conversion warnings for printf in debug mode	2017-10-02 17:28:57 -07:00
Yann Collet	9b166d2291	Merge branch 'dev' of github.com:facebook/zstd into dev	2017-10-02 16:34:26 -07:00
Yann Collet	3b27ed41fd	Merge branch 'srcSize' into dev	2017-10-02 16:34:14 -07:00
Yann Collet	7e00df4a49	bumped version number and updated NEWS in anticipation for release	2017-10-02 16:27:25 -07:00
Yann Collet	004fd34fd9	Merge pull request #876 from facebook/srcSize CLI Fix : srcSize written in frame headers when compressing multiple files	2017-10-02 15:02:05 -07:00
Nick Terrell	86e83e926f	[libzstd] Set CLEVEL_CUSTOM correctly In `ZSTD_compressBegin_advanced()`, `ZSTD_parameters` are used to set the compression parameters, but the level didn't get set to `CLEVEL_CUSTOM`, so `ZSTD_compressBlock()` used the wrong parameters when checking the source size.	2017-10-02 13:43:30 -07:00
Yann Collet	5db19b8685	added comment on ZSTD_COMPRESSBOUND() as requested by @terrelln	2017-10-01 11:32:38 -07:00
Yann Collet	6e930c13d1	Merge branch 'dev' into compressBound	2017-10-01 11:24:02 -07:00
Yann Collet	76ac0b2d99	macro compatible with scenario where windowSize = 1024 (minimum)	2017-09-30 15:34:44 -07:00
Yann Collet	dc404119e5	ZSTD_adjustCParams_internal : minor optimization	2017-09-30 15:02:40 -07:00
Nick Terrell	c5d6dde502	Don't `size -= 1` in ZSTD_adjustCParams() The window size could end up too small if the source size is 2^n + 1. Credit to OSS-Fuzz	2017-09-30 14:20:06 -07:00
Yann Collet	ee1ed78fcb	fix proper naming on FSE_createCTable() arguments in fse.h	2017-09-30 11:08:50 -07:00
Yann Collet	5b10345b26	added ZSTD_COMPRESSBOUND() as a macro ZSTD_compressBound() works fine, but is only useful for dynamic allocation. For static allocation, only a macro can provide the amount during compilation time.	2017-09-29 23:17:41 -07:00
Yann Collet	8afb151c9b	cli: fixed wrong initialization in MT mode It's not good to mix old and new API ZSTD_resetCStream() doesn't just set pledgedSrcSize : it also sets the CCtx for a single thread compression. Problem is, when 2+ threads are defined in cctx->requestedParams, ZSTD_compress_generic() will want to start MT compression, since initialization is supposed to have already happened (thanks to ZSTD_resetCStream()) except that the underlying ZSTDMT_CCtx* object is not created, resulting in a segfault. This is an invalid construction (correct one is to use ZSTD_CCtx_setPledgedSrcSize()). I haven't found a nice way to mitigate this impact if someone makes the same mistake. At some point, removing the old API to keep only the new API within fileio.c will limit these risks.	2017-09-29 22:14:37 -07:00
Yann Collet	fbd5ab7027	minor fix : no longer use fake srcSize during resource creation srcSize is read and provided at each file, not at resource creation. This used to be useful with older API, because it could not re-adapt parameters between sessions. At some point, it will be better to remove the old code, and only keep the new_api. It works fine by now.	2017-09-29 19:40:27 -07:00
Yann Collet	db1668a43b	fix : srcSize written in frame header when multiple files compressed This information used to be disabled when nbFiles>1. It was badly initialized later in the code, resulting in an error.	2017-09-29 18:05:18 -07:00
Yann Collet	7c9669f272	Merge pull request #873 from facebook/shorterTests Leaner tests	2017-09-29 17:26:46 -07:00
Yann Collet	1416bc0f07	erase existence of a buffer when it's sent out of the pool In some complex scenario, the buffer would be freed because it's too large, another buffer would be allocated, but fail, trigger an error, and the general buffer pool would then be freed, where the definition of the already freed buffer would be found (beyond total index, but still), and freed again, resulting in double-free error.	2017-09-29 16:27:47 -07:00
Yann Collet	e963800e27	zstdmt : fixed : buffer dst0 wasn't properly set to null after usage now it's possible to unconditionnally invoke ZSTD_releaseAllJobRessources() wether previous compression was completed correctly or not.	2017-09-28 23:01:31 -07:00
Yann Collet	754ae5cc0b	removed ZSTDMT_waitForAllJobsCompleted() from ZSTDMT_freeCCtx() as per @terrelln comment	2017-09-28 20:45:31 -07:00
Yann Collet	86b4fe5b45	adjustCParams : restored previous behavior unknowns srcSize presumed small if there is a dictionary (dictSize>0) and presumed large otherwise.	2017-09-28 18:14:28 -07:00
Yann Collet	b93598d6a4	zstdmt : reduced maximum nb of threads to avoid memory address space issues on 32-bits systems (see https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=876416#17)	2017-09-28 13:49:12 -07:00
Yann Collet	e4ec427720	Merge branch 'dev' into shorterTests fixed conflicts	2017-09-28 12:19:28 -07:00
Yann Collet	8074261d00	zstdmt : move on when not enough memory for a new input buffer just continue operations without input forward progress, instead of an error that stops current compression session.	2017-09-28 11:46:19 -07:00
Yann Collet	2cd15dd9a4	fixed minor Visual conversion warning	2017-09-28 02:33:41 -07:00
Yann Collet	377abcc02c	zstdmt : better behavior when freeing a context right after a memory allocation error wait for all jobs to be completed, so that freeing can happen safely	2017-09-28 02:23:44 -07:00
Yann Collet	d6770f80af	minor : rewrite unit tests using CHECK_Z macro	2017-09-28 02:14:48 -07:00
Yann Collet	9b5b47ac93	ensure adjustCParams adjust hLog and cLog even without srcSize It would previously exit when srcSize is unknown. But in the case of custom parameters, hLog and cLog can still be too large in comparison with windowLog. Reduces maximum memory allocated during zstreamtest --newapi	2017-09-28 01:25:40 -07:00
Yann Collet	54a827fff0	Merge branch 'dev' into newFormats Fixed conflicts in zstdmt_compress.c	2017-09-27 16:39:40 -07:00
Yann Collet	e45a2aea9b	Merge pull request #869 from terrelln/dev [libzstd] pthread function prefixed with ZSTD_	2017-09-27 16:35:08 -07:00
Nick Terrell	b555b7ef41	[libzstd][opt] Simplify repcode logic	2017-09-27 15:30:12 -07:00
Yann Collet	ea1f50bf73	removed ZSTD_decompressBegin() from ZSTD_initDCtx_internal() It does not feel "right" from a dependency perspective. ZSTD_initDCtx_internal() is triggered once, on DCtx creation, while ZSTD_decompressBegin() is invoked at the beginning of each new frame, and is also a user-facing prototype. Downside : a DCtx must be init before first usage ! This was always the intention by the way, and is documented as such. This stage is automatically done within ZSTD_decompress() and variants, and also within ZSTD_decompressStream(). Only ZSTD_decompressContinue() is impacted, it must be preceded by a ZSTD_decompressBegin(), as detailed in doc. A test has been fixed, to no longer rely on undocumented assumption that ZSTD_decompressBegin() is invoked during init.	2017-09-27 13:51:05 -07:00
Yann Collet	c994932788	fixed ZSTD_format_e value validation	2017-09-27 12:22:22 -07:00
Nick Terrell	6c41adfb28	[libzstd] pthread function prefixed with ZSTD_ * `sed -i 's/pthread_/ZSTD_pthread_/g' lib/{,common,compress,decompress,dictBuilder}/.[hc]` Fix up `lib/common/threading.[hc]` * `sed -i s/PTHREAD_MUTEX_LOCK/ZSTD_PTHREAD_MUTEX_LOCK/g lib/compress/zstdmt_compress.c`	2017-09-27 11:48:48 -07:00
Yann Collet	ecf1778e23	updated ZSTD_format_e value validation also updated manual	2017-09-27 11:19:21 -07:00
Yann Collet	9416195221	changed error code when pos<=size condition is not respected Now pointing towards src_size or dst_size, instead of error_GENERIC.	2017-09-27 10:35:56 -07:00
Yann Collet	d56a350402	removed unsupported formats	2017-09-27 10:29:31 -07:00
Yann Collet	ca306c1c84	fixed a bug in zstreamtest decoder output buffer would receive a wrong size. In previous version, ZSTD_decompressStream() would blindly trust the caller that pos <= size. In this version, this condition is actively checked, and the function returns an error code if this condition is not respected. This check could also be done with an assert(), but since this is a user-facing interface, it seems better to keep this check at runtime.	2017-09-27 00:39:41 -07:00
Yann Collet	cd53ac831b	fixed DCtx initialization error now relying on initialization of dctx->format first	2017-09-26 18:26:09 -07:00
Yann Collet	4791561c4a	silence minor gcc warning -Wempty-body also silence fuzz test artefacts	2017-09-26 17:57:38 -07:00
Yann Collet	c0dd960363	switch assert() position	2017-09-26 15:36:57 -07:00
Yann Collet	319c699991	created ZSTD_startingInputLength() as suggested by @terrelln	2017-09-26 15:36:14 -07:00
Yann Collet	8d1e97ea9c	minor fixes following @terrelln comments	2017-09-26 15:06:30 -07:00
Yann Collet	df4e9bba25	fixed constant errors for gcc in c99 mode C standard does not consider a `static const int` as a constant. This is a problem for initializer, and ZSTD_STATIC_ASSERT(). Replaced by macro values	2017-09-26 14:31:06 -07:00
Yann Collet	9f0b8dfbe9	Merge branch 'dev' into newFormats	2017-09-26 14:22:39 -07:00
Nick Terrell	c233bdbaee	Increase maximum window size * Maximum window size in 32-bit mode is 1GB, since allocations for 2GB fail on my Mac. * Maximum window size in 64-bit mode is 2GB, since that is the largest power of 2 that works with the overflow prevention. * Allow `--long=windowLog` to set the window log, along with `--zstd=wlog=#`. These options also set the window size during decompression, but don't override `--memory=#` if it is set. * Present a helpful error message when the window size is too large during decompression. * The long range matcher defaults to a hash log 7 less than the window log, which keeps it at 20 for window log 27. * Keep the default long range matcher window size and the default maximum window size at 27 for the API and CLI. * Add tests that use the maximum window size and hash size for compression and decompression.	2017-09-26 14:00:01 -07:00
Yann Collet	586df82a78	Merge pull request #862 from terrelln/static [zstd] Backport kernel patch from @ColinIanKing	2017-09-25 17:02:40 -07:00
Yann Collet	52a1d1c6dc	added ZSTD_DCtx_reset()	2017-09-25 16:56:48 -07:00
Yann Collet	5d8fdd1641	Merge pull request #855 from terrelln/maxoff [libzstd] Increase MaxOff	2017-09-25 16:34:29 -07:00
Nick Terrell	76cb38d085	[zstd] Backport kernel patch from @ColinIanKing * Make the U32 table in `FSE_normalizeCount()` static. * Patch from https://lkml.kernel.org/r/20170922145946.14316-1-colin.king@canonical.com. * Clang makes non-static tables static anyways. gcc however, does [weird things](https://godbolt.org/g/fvTcED). * Benchmarks showed no difference in speed.	2017-09-25 16:18:23 -07:00
Yann Collet	f2a913862c	added ZSTD_decompress_generic_simpleArgs()	2017-09-25 15:46:34 -07:00
Yann Collet	6ee05a02b8	added ZSTD_decompress_generic() same as ZSTD_decompressStream(), just for a similar feeling as the compression side, which uses ZSTD_compress_generic()	2017-09-25 15:41:48 -07:00
Yann Collet	b8d4a3887f	introduced constant ZSTD_frameIdSize within zstd_internal.h This is the size of magic number. Avoids using `4` directly in source code, which is a bit less meaningful.	2017-09-25 15:26:18 -07:00
Yann Collet	044fb4c057	implemented magic-less frame decoder	2017-09-25 15:12:09 -07:00
Yann Collet	62568c9a42	added capability to generate magic-less frames decoder not implemented yet	2017-09-25 14:26:26 -07:00
Nick Terrell	bbe77212ef	[libzstd] Increase MaxOff	2017-09-25 13:36:18 -07:00
Yann Collet	96f0cde31a	minor function rename ZSTD_estimateCStreamSize_advanced_usingCParams -> ZSTD_estimateCStreamSize_usingCParams _usingX is clear. _advanced feels redundant	2017-09-24 16:47:02 -07:00
Yann Collet	7c3dea42ce	added prototypes for advanced parameters for decompression API required to decode custom formats	2017-09-24 15:57:29 -07:00
Yann Collet	e60f48c549	Merge branch 'dev' into newFormats	2017-09-24 14:33:37 -07:00
Yann Collet	8977224b9b	Merge pull request #859 from terrelln/31 Prepare for ZSTD_WINDOWLOG_MAX == 31	2017-09-22 09:01:39 -07:00
Nick Terrell	d6abb28951	Prepare for ZSTD_WINDOWLOG_MAX == 31	2017-09-21 17:18:41 -07:00
Yann Collet	cd3115b284	added control from frame content size at end of decompression adding check at end of single-pass ZSTD_decompressFrame(). Check within ZSTD_decompressContinue() was already added in a previous patch : `b3f33ccfb3`	2017-09-21 16:21:10 -07:00
Yann Collet	645563583e	Merge branch 'dev' into newFormats	2017-09-21 16:08:06 -07:00
Yann Collet	f97c2dbd39	created ZSTD_format declaration	2017-09-21 16:07:29 -07:00
Yann Collet	da74aabc00	Merge pull request #850 from terrelln/fse-optimal [fse] Fix FSE_optimalTableLog() for srcSize==1	2017-09-19 14:59:21 -07:00
Yann Collet	c399ab4804	Merge pull request #849 from terrelln/30 [bitstream] Allow adding 31 bits at a time	2017-09-19 14:25:10 -07:00
Nick Terrell	74718d7e43	[bitstream] Allow adding 31 bits at a time	2017-09-19 13:57:33 -07:00
Nick Terrell	6c9ed76676	[ldm] Fix corner case where minMatch < 8 There is a potential read buffer overflow when minMatch < 8. fix-fuzz-failure	2017-09-19 13:49:37 -07:00
Nick Terrell	18442a31ff	[libzstd] Fix bad window size assert The window size is not validated or used in the one-pass API, so there shouldn't be an assert based on it. fix-fuzz-failure	2017-09-19 13:47:59 -07:00
Yann Collet	cb8b471e8b	Merge branch 'dev' of github.com:facebook/zstd into dev	2017-09-18 14:48:23 -07:00
Yann Collet	7d1ff3817b	fix ZSTD_sizeof_CCtx() / ZSTD_sizeof_CStream() previous result was over-estimated by counting streaming buffers twice	2017-09-18 14:47:34 -07:00
Nick Terrell	cae3e3c652	[fse] Fix FSE_optimalTableLog() for srcSize==1	2017-09-18 14:11:18 -07:00
Yann Collet	72a80515ec	Merge pull request #848 from terrelln/fparams [block] Don't use fParams in ZSTD_decompressBlock()	2017-09-18 13:48:31 -07:00
Yann Collet	539b91ee9b	minor : added assert in bt	2017-09-16 23:41:58 -07:00
Nick Terrell	5f22479517	[block] Don't use fParams in ZSTD_decompressBlock()	2017-09-15 17:37:20 -07:00
Yann Collet	77c137b3ae	minor comment refactor	2017-09-14 15:12:57 -07:00
Yann Collet	335780c427	fixed too strong alignment assert in ZSTD_initStaticCCtx() 64-bits fields are only 32-bits aligned on 32-bits CPU	2017-09-13 16:35:29 -07:00
Yann Collet	f1571dad8f	Merge pull request #838 from stellamplau/ldm-mergeDev Add long distance matcher	2017-09-13 13:24:08 -07:00
Yann Collet	4120a7fd5a	Merge pull request #837 from facebook/libzstd-nomt makes it possible to compile libzstd in single-thread mode without zs…	2017-09-12 17:13:17 -07:00
Yann Collet	3306bcb0e6	fix #820 : GCC v3.x 32-bits doesn't define 64-bits intrinsic resulting in undefined symbol error. Push the requirement to GCC 4 for now. Another solution, proposed by @NWilson, is to use __LONG_MAX__ instead. __LONG_MAX__ is a GCC-specific constant, which value is supposed to depend on underlying target hardware (32/64 bits) Might be better, but seems also more complex, hence more prone to side effects. Keeping the simple solution for now (just rely on __GNUC__)	2017-09-11 15:17:31 -07:00
Stella Lau	eb3327c10a	Merge branch 'dev' of https://github.com/facebook/zstd into ldm-mergeDev	2017-09-11 15:00:01 -07:00
Stella Lau	f902bf9676	Merge branch 'ldm-integrate' into ldm-mergeDev	2017-09-11 14:55:29 -07:00
Yann Collet	f325ee4e84	fixed pass-through warning	2017-09-11 14:37:03 -07:00
Stella Lau	0d1b54db61	Explicitly cast raw numerals when left-shifting	2017-09-11 14:28:18 -07:00
Yann Collet	0d6ecc72a3	makes it possible to compile libzstd in single-thread mode without zstdmt_compress.c (#819 )	2017-09-11 14:09:34 -07:00
Yann Collet	ce31004f20	fix following suggestions by @terrelln	2017-09-11 13:12:52 -07:00
Yann Collet	b3f33ccfb3	use ZSTD_decodingBufferSize_min() inside ZSTD_decompressStream() Use same definition as public one minor : reduce allocated buffer size in some cases (when frameContentSize is known and == windowSize)	2017-09-09 14:37:28 -07:00
Yann Collet	058ed2ad33	ZSTD_decodingBufferSize_min() supporting function for bufferless streaming API (ZSTD_decompressContinue()) makes it possible to correctly size a round buffer for decoding using this API. also : added field blockSizeMax within ZSTD_frameHeader, as it's a necessary information to know when to restart at beginning of decoding buffer.	2017-09-09 01:03:29 -07:00
Yann Collet	3128e03be6	updated license header to clarify dual-license meaning as "or"	2017-09-08 00:09:23 -07:00
Stella Lau	360428c5d9	Move ldm functions to their own file	2017-09-06 18:09:26 -07:00
Yann Collet	baa37c3362	programs/Makefile : better support for GNU conventions see https://www.gnu.org/prep/standards/html_node/Command-Variables.html	2017-09-06 16:53:59 -07:00
Yann Collet	3a12531a3d	lib/Makefile : better support for GNU conventions see https://www.gnu.org/prep/standards/html_node/Makefile-Conventions.html	2017-09-06 16:35:49 -07:00
Yann Collet	1c7b914cdf	update README on BUCK file	2017-09-06 16:23:39 -07:00
Yann Collet	36374cc3b4	update and clarify lib/README	2017-09-06 16:15:18 -07:00
Stella Lau	2b99d696de	Remove debug code	2017-09-06 15:57:26 -07:00
Stella Lau	eeff55dfa8	Merge remote-tracking branch 'upstream/dev' into ldm-mergeDev	2017-09-06 15:56:32 -07:00
Yann Collet	ad0046244f	Merge pull request #831 from terrelln/split-compress Split parsers out of zstd_compress.c	2017-09-06 10:01:27 -07:00
Stella Lau	9e4060200b	Add tests and fix pointer alignment	2017-09-06 09:14:05 -07:00
Stella Lau	c706de5395	Rename and add short ldm parameters in cli	2017-09-05 21:11:18 -07:00
Stella Lau	98b85426f1	Fix setting of nextToUpdate at end of ldm matcher	2017-09-05 20:41:37 -07:00
Nick Terrell	721726d688	Split parsers out of zstd_compress.c	2017-09-05 17:10:25 -07:00
Stella Lau	08d33fe1c9	Fix parameter handling in copyCCtx with cdict	2017-09-05 15:50:20 -07:00
Stella Lau	fd0071da29	Fix parameter handling with ZSTD_copyCCtx	2017-09-05 15:34:17 -07:00
Stella Lau	643d28c701	Add ldm options to 'man zstd'	2017-09-05 11:27:15 -07:00
Nick Terrell	423b133568	[POOL] Allow free on NULL when multithreading is disabled	2017-09-05 11:18:13 -07:00
Stella Lau	67d4a6161c	Add ldmBucketSizeLog param	2017-09-02 21:55:29 -07:00
Stella Lau	a1f04d518d	Move hashEveryLog to cctxParams and update cli	2017-09-01 15:05:47 -07:00
Stella Lau	767a0b3be1	Move ldm hashLog, bucketLog, and mml to cctxParams	2017-09-01 12:24:59 -07:00
Yann Collet	8a5c0c98ae	restored 32-bits decoder ability to decode long offsets (>32 MB, levels 21+)	2017-09-01 11:56:57 -07:00
Yann Collet	36aa8b5999	improved decoding speed	2017-09-01 11:40:59 -07:00
Stella Lau	17d8e0bdcc	Merge remote-tracking branch 'upstream/longRangeMatcher' into ldm-integrate	2017-09-01 10:19:38 -07:00
Stella Lau	8081becadc	Add long distance matching as a CCtxParam	2017-09-01 09:18:58 -07:00
Yann Collet	d963daa6a9	fixed minor warning (empty translation unit)	2017-09-01 00:12:07 -07:00
Yann Collet	3704507774	fixed decompression bug reported by @Etsukata (#828 )	2017-09-01 00:05:37 -07:00
Yann Collet	369c29dd1a	fixed impact of merge conflict for longRange	2017-08-31 18:25:56 -07:00
Yann Collet	d7ad99b2ab	Merge branch 'longRangeMatcher' into dev	2017-08-31 18:08:37 -07:00
Stella Lau	6a546efb8c	Add long distance matcher Move last literals section to ZSTD_block_internal	2017-08-31 12:53:19 -07:00
Yann Collet	b0cb081dc8	last batch of header files changed to reflect new license (#825 ) only remains to update contrib/linux-kernel (@terrelln)	2017-08-31 12:20:50 -07:00
Yann Collet	e21384fffb	fixed more file headers after license change (#825 )	2017-08-31 12:11:57 -07:00
Yann Collet	e9dc204f42	fixed a bunch of headers after license change (#825 )	2017-08-31 11:24:54 -07:00
Stella Lau	90a31bfa16	Pass dictMode to ZSTDMT_initCStream; fix nits - Return error code in estimate{CCtx,CStream}Size functions	2017-08-30 16:19:07 -07:00
Stella Lau	ee65701720	Minor fixes; remove formatting only changes	2017-08-29 20:27:35 -07:00
Stella Lau	a6e20e1bd7	Add test for raw content starting with dict header	2017-08-29 18:36:18 -07:00
Stella Lau	623e3cd40b	Use ZSTD_dm_rawContent in zstdmt_compress	2017-08-29 18:04:32 -07:00
Stella Lau	82d636b76a	Rename applyCCtxParams()	2017-08-29 18:03:06 -07:00
Stella Lau	4e835720bf	Delay creation of ZSTDMT_CCtx	2017-08-29 17:58:32 -07:00
Stella Lau	c7a18b7c21	Localize 'dictMode' from cctx to function param	2017-08-29 15:52:24 -07:00
Yann Collet	d6ddb879da	Merge pull request #817 from terrelln/pool-custom-alloc [pool] Accept custom allocators	2017-08-29 13:05:39 -07:00
Stella Lau	c88fb9267f	Replace 'byReference' with enum	2017-08-29 11:55:02 -07:00
Nick Terrell	9822f97721	[error] Don't guard undef X with ifdef X	2017-08-29 11:54:38 -07:00
Stella Lau	b5b9275e67	Rename estimateCCtxSize_advanced() and estimateCStreamSize_advanced()	2017-08-29 10:49:29 -07:00
Stella Lau	0e56a84a1e	Fix getting cParams from CCtxParams	2017-08-28 19:25:17 -07:00
Nick Terrell	02033be08c	[pool] Visual Studios disallows empty structs	2017-08-28 17:19:01 -07:00
Nick Terrell	7c365eb02c	[threading] Fix ERROR macro after including windows.h	2017-08-28 16:25:02 -07:00
Bernhard M. Wiedemann	cf689b84f9	Sort input file list in order to make builds reproducible in spite of indeterministic filesystem readdir order. See https://reproducible-builds.org/ for why this is good.	2017-08-26 17:08:00 +02:00
Stella Lau	024098a47d	Fix parameter retrieval from cdict	2017-08-25 17:58:28 -07:00
Stella Lau	2adde898c8	Fix typo with ZSTDMT_parameter	2017-08-25 16:13:40 -07:00
Stella Lau	18224608ff	Remove ZSTD_setCCtxParameter()	2017-08-25 13:58:41 -07:00
Stella Lau	0744592d38	Add function initializing cctxParams from clevel	2017-08-25 13:36:47 -07:00
Stella Lau	9911153723	Move jobSize and overlapLog in zstdmt to cctxParams	2017-08-25 13:14:51 -07:00
Stella Lau	de5193422d	Distinguish between jobParams and cctxParams in zstdmt	2017-08-25 11:36:17 -07:00
Stella Lau	eb7bbab36a	Remove ZSTD_p_refDictContent and dictContentByRef	2017-08-25 11:11:45 -07:00
Nick Terrell	db3f5372df	[zstdmt] Use POOL_create_advanced()	2017-08-24 18:12:28 -07:00
Nick Terrell	de6c6bce85	Fix zstd_internal.h for C++ mode	2017-08-24 18:09:50 -07:00
Nick Terrell	26dc040a7b	[pool] Accept custom allocators	2017-08-24 17:01:41 -07:00
Nick Terrell	89dc856cae	[pool] Fix formatting	2017-08-24 16:48:32 -07:00
Stella Lau	15fdeb9e41	Enforce nbThreads<=1 for estimateCCtxSize	2017-08-24 16:28:49 -07:00
Nick Terrell	376f435914	[dictBuilder] Set default compression level to 3	2017-08-24 16:21:05 -07:00
Stella Lau	2fbf0285b2	Fix interaction with ZSTD_setCCtxParameter() and cleanup	2017-08-24 11:25:41 -07:00
Stella Lau	fd9bf42516	Fix forceWindow and dictMode setting for zstdmt jobs	2017-08-23 19:16:57 -07:00
Stella Lau	bf3108fb50	Ensure zstdmt uses 'job version' of cctx parameters	2017-08-23 17:03:31 -07:00
Stella Lau	1c81f725ff	Remove duplicated testing code	2017-08-23 15:47:15 -07:00
Stella Lau	64ce49426b	Fix cstream compression level	2017-08-23 12:30:47 -07:00
Stella Lau	5bc2c1e982	Add prototype support for customMem with cctxParams	2017-08-23 12:03:30 -07:00
Yann Collet	e9ce1208a1	Merge pull request #812 from facebook/longRangeFix fixed extraordinary scenario where all fields use maximum nbBits	2017-08-23 11:35:28 -07:00
Yann Collet	74cde5a4d8	Merge pull request #813 from stellamplau/highbit32fix Fix undefined behavior when srcSize==1	2017-08-23 11:31:06 -07:00
Stella Lau	6f1a21c7e9	Remove formatting-only changes	2017-08-23 10:24:19 -07:00
Dmitriy Titarenko	20f715d709	Fix displayLevel overflow	2017-08-23 15:56:15 +05:00
Stella Lau	11303778d0	Add function to make cctxParams from ZSTD_parameters	2017-08-22 14:53:13 -07:00
Yann Collet	bd9c8ca146	Merge pull request #811 from terrelln/segmentSize [cover] Fix end condition for small dictionary	2017-08-22 14:36:30 -07:00
Stella Lau	23fc0e41fa	Remove 'opaque' naming from internal functions	2017-08-22 14:24:47 -07:00
Stella Lau	8fd1636776	Remove unused functions	2017-08-22 13:33:58 -07:00
Yann Collet	6b2b6a9bd5	fixed extraordinary scenario where all fields use maximum possible nb of bits simultaneously can only happen if windowLog>=27 (level 22 --ultra)	2017-08-22 12:09:21 -07:00
Stella Lau	e50ed1fa3a	Fix undefined behavior when srcSize==1	2017-08-22 11:55:42 -07:00
Stella Lau	60e1bc617c	Explicitly create a job cctxParam for multithreading	2017-08-21 15:39:37 -07:00
Stella Lau	5b956f4753	Comment out CCtx_param versions of CDict functions	2017-08-21 14:49:16 -07:00
Nick Terrell	29c2d9a4d0	[cover] Turn down notification for ZDICT subroutines	2017-08-21 14:28:31 -07:00
Nick Terrell	98de3f6847	[cover] Add dictionary size to compressed size	2017-08-21 14:23:17 -07:00
Yann Collet	78c3d16bf4	Merge pull request #809 from terrelln/dev [cover] Fix divide by zero	2017-08-21 13:33:19 -07:00
Nick Terrell	9a54a315aa	[cover] Convert score to U32 and check for zero	2017-08-21 13:30:07 -07:00
Stella Lau	fd8a25786e	Check parameters are valid in initCCtxParams	2017-08-21 13:23:35 -07:00
Stella Lau	1c0dbe81b1	Add documentation for CCtx_params	2017-08-21 13:18:00 -07:00
Nick Terrell	d49eb40c03	[cover] Stop when segmentSize is less than d	2017-08-21 13:10:03 -07:00
Stella Lau	939f954285	Pass ZSTD_CCtx_params as const ptr when possible	2017-08-21 12:57:18 -07:00
Stella Lau	73c73bf16a	Reduce code duplication in zstreamtest	2017-08-21 12:41:19 -07:00
Stella Lau	560b34f6d2	Return error code when initializing NULL cctxParams	2017-08-21 11:52:26 -07:00
Stella Lau	25be09c6b4	Set some parameters to zero before initializing cdict	2017-08-21 11:35:46 -07:00
Yann Collet	232d62b637	fixed a few headers that were too hastily copy/pasted during last license change	2017-08-21 11:24:32 -07:00
Nick Terrell	f306d400c0	[cover] Fix divide by zero	2017-08-21 11:12:11 -07:00
Stella Lau	502031ca10	Use cctxParam version of createCDict internally	2017-08-21 11:00:44 -07:00
Stella Lau	91b30dbe84	Remove test parameter	2017-08-21 10:09:06 -07:00
Stella Lau	f181f33bdf	Disable tests and refactor	2017-08-21 01:59:08 -07:00
Stella Lau	023b24e6d4	Add cctx param tests	2017-08-20 22:55:07 -07:00
Yann Collet	7db552676e	reduced pool queue to 0 to save memory fixed : pool performance when jobs are fires fast and queueSize==0	2017-08-19 15:07:54 -07:00
Stella Lau	6cee6e07e5	Add internal createCDict function	2017-08-18 22:48:31 -07:00
Stella Lau	d775519296	Add cctxParam versions of internal functions	2017-08-18 17:37:58 -07:00
Yann Collet	32fb407c9d	updated a bunch of headers for the new license	2017-08-18 16:52:05 -07:00
Stella Lau	63b8c98531	Pass cctx parameters to MTCtx	2017-08-18 16:17:24 -07:00
Stella Lau	399ae013d4	Add function to apply cctx params	2017-08-18 13:01:55 -07:00
Stella Lau	81d89d82a6	Move nbThreads to cctx params	2017-08-18 12:08:57 -07:00
Stella Lau	2300c58a6f	Move dictContentByRef to cctx params	2017-08-18 12:03:16 -07:00
Stella Lau	b6cb2ed8cb	Move dictMode to cctxParams	2017-08-18 11:43:31 -07:00
Stella Lau	97e27affcb	Move compression level to cctx params	2017-08-18 11:20:08 -07:00
Stella Lau	c0221124d5	Add function to set opaque parameters	2017-08-17 19:30:22 -07:00
Stella Lau	4169f49171	Add initialization/allocation functions for opaque params	2017-08-17 18:45:04 -07:00
Stella Lau	ade95b8bed	Add opaque interfaces for static initialization	2017-08-17 18:13:08 -07:00
Stella Lau	699f11b4f7	Create opaque parameter structure	2017-08-17 17:33:46 -07:00
Yann Collet	f9e6590715	Merge pull request #796 from terrelln/is-error [FSE][HUF] Inline error checks	2017-08-15 12:37:28 -07:00
Yann Collet	2dbcfc6994	Merge pull request #794 from terrelln/force-inline [libzstd] Fix FORCE_INLINE macro	2017-08-15 12:03:44 -07:00
Nick Terrell	07c6ff588e	[FSE][HUF] Inline error checks Caught by Clang's optimization remarks.	2017-08-15 11:23:28 -07:00
Nick Terrell	565e925eb7	[libzstd] Fix FORCE_INLINE macro	2017-08-14 21:12:05 -07:00
Roman Gershman	b9d4f4fb74	Fix ZSTD_estimateDStreamSize function after ZSTD_DStream and ZSTD_DCtx were merged	2017-08-13 13:29:42 +03:00
Nick Terrell	9ba97182d1	[CI] Add gcc7build test	2017-08-08 13:28:56 -07:00
Yann Collet	d9f2893eb9	Merge pull request #782 from terrelln/dstSizeTooSmall Fix compression failure on incompressible data	2017-08-07 14:52:02 -07:00
Yann Collet	8049556928	Merge pull request #778 from terrelln/bad-huff [libzstd] Fix bug in Huffman decompresser	2017-08-07 14:05:58 -07:00
Nick Terrell	abe12b3399	[libzstd] Fix bug in Huffman decompresser The zstd format specification doesn't enforce that Huffman compressed literals (including the table) have to be smaller than the uncompressed literals. The compressor will never Huffman compress literals if the compressed size is larger than the uncompressed size. The decompresser doesn't accept Huffman compressed literals with 4 streams whose compressed size is at least as large as the uncompressed size. * Make the decompresser accept Huffman compressed literals whose size increases. * Add a test case that exposes the bug. The compressed file has to be statically generated, since the compressor won't normally produce files that expose the bug.	2017-08-07 12:37:48 -07:00
Nick Terrell	308047eb5d	Fix compression failure on incompressible data If the destination buffer is the minimum allowed size in `ZSTD_compressSequences()` (2^17), then if the block isn't compressible compression might fail with `dstSize_tooSmall`, when it should instead emit a raw uncompressed block. Additionally, `ZSTD_compressLiterals()` implicitly called `ZSTD_noCompressLiterals()` if Huffman compression failed. Make that explicit.	2017-08-07 11:45:24 -07:00
Stella Lau	73ba58955f	Signal after finishing job when queueSize=0	2017-08-01 20:12:06 -07:00
Stella Lau	1d76da1d87	Replace marker with queueEmpty variable and update pool.h comment	2017-08-01 12:30:16 -07:00
Stella Lau	5adceeed01	Allow queueSize=0 in pool.c and update poolTests	2017-07-31 10:10:16 -07:00
Yann Collet	e1222544be	Merge pull request #753 from paulcruz74/adapt-approach-3 adaptive compression v1	2017-07-27 10:00:10 -07:00
Nick Terrell	ae20d413da	[libzstd] Fix CHECK_V_F macros	2017-07-25 12:52:01 -07:00
Yann Collet	a90b16e150	Visual blind fix 2	2017-07-20 15:57:55 -07:00
Yann Collet	b4d460f32c	pool.c : blindfix for Visual warnings	2017-07-20 01:13:14 -07:00
Yann Collet	3974d2b38a	blind fix for Windows Multithreading module adds a fake 0 return value for mutex/cond init	2017-07-19 13:33:21 -07:00
Paul Cruz	6945b3c43d	removed previous version of completion for compression	2017-07-19 11:51:50 -07:00
Yann Collet	b71363b967	check pthread_*_init() success condition	2017-07-19 01:05:40 -07:00
Nick Terrell	cc1522351f	[libzstd] Fix bug in Huffman encoding Summary: Huffman encoding with a bad dictionary can encode worse than the HUF_BLOCKBOUND(srcSize), since we don't filter out incompressible input, and even if we did, the dictionaries Huffman table could be ill suited to compressing actual data. The fast optimization doesn't seem to improve compression speed, even when I hard coded fast = 1, the speed didn't improve over hard coding it to 0. Benchmarks: $ ./zstd.dev -b1e5 Benchmarking levels from 1 to 5 1#Synthetic 50% : 10000000 -> 3139163 (3.186), 524.8 MB/s ,1890.0 MB/s 2#Synthetic 50% : 10000000 -> 3115138 (3.210), 372.6 MB/s ,1830.2 MB/s 3#Synthetic 50% : 10000000 -> 3222672 (3.103), 223.3 MB/s ,1400.2 MB/s 4#Synthetic 50% : 10000000 -> 3276678 (3.052), 198.0 MB/s ,1280.1 MB/s 5#Synthetic 50% : 10000000 -> 3271570 (3.057), 107.8 MB/s ,1200.0 MB/s $ ./zstd -b1e5 Benchmarking levels from 1 to 5 1#Synthetic 50% : 10000000 -> 3139163 (3.186), 524.8 MB/s ,1870.2 MB/s 2#Synthetic 50% : 10000000 -> 3115138 (3.210), 370.0 MB/s ,1810.3 MB/s 3#Synthetic 50% : 10000000 -> 3222672 (3.103), 223.3 MB/s ,1380.1 MB/s 4#Synthetic 50% : 10000000 -> 3276678 (3.052), 196.1 MB/s ,1270.0 MB/s 5#Synthetic 50% : 10000000 -> 3271570 (3.057), 106.8 MB/s ,1180.1 MB/s $ ./zstd.dev -b1e5 ../silesia.tar Benchmarking levels from 1 to 5 1#silesia.tar : 211988480 -> 73651685 (2.878), 429.7 MB/s ,1096.5 MB/s 2#silesia.tar : 211988480 -> 70158785 (3.022), 321.2 MB/s ,1029.1 MB/s 3#silesia.tar : 211988480 -> 66993813 (3.164), 243.7 MB/s , 981.4 MB/s 4#silesia.tar : 211988480 -> 66306481 (3.197), 226.7 MB/s , 972.4 MB/s 5#silesia.tar : 211988480 -> 64757852 (3.274), 150.3 MB/s , 963.6 MB/s $ ./zstd -b1e5 ../silesia.tar Benchmarking levels from 1 to 5 1#silesia.tar : 211988480 -> 73651685 (2.878), 429.7 MB/s ,1087.1 MB/s 2#silesia.tar : 211988480 -> 70158785 (3.022), 318.8 MB/s ,1029.1 MB/s 3#silesia.tar : 211988480 -> 66993813 (3.164), 246.5 MB/s , 981.4 MB/s 4#silesia.tar : 211988480 -> 66306481 (3.197), 229.2 MB/s , 972.4 MB/s 5#silesia.tar : 211988480 -> 64757852 (3.274), 149.3 MB/s , 963.6 MB/s Test Plan: I added a test case to the fuzzer which crashed with ASAN before the patch and succeeded after.	2017-07-18 13:20:40 -07:00
Yann Collet	77d67fb167	Merge pull request #766 from terrelln/real-block-split [libzstd] Pull optimal parser state out of seqStore_t	2017-07-18 08:26:24 -07:00
Yann Collet	14c83b05c7	Merge pull request #765 from terrelln/real-block-split [libzstd] Remove ZSTD_CCtx* argument of ZSTD_compressSequences()	2017-07-17 19:25:55 -07:00
Nick Terrell	7a28b9e4a3	[libzstd] Pull optimal parser state out of seqStore_t	2017-07-17 15:29:11 -07:00
Yann Collet	3381bf4b84	Merge pull request #764 from terrelln/real-block-split [libzstd] Refactor ZSTD_compressSequences()	2017-07-17 14:46:01 -07:00
Nick Terrell	e198230645	[libzstd] Remove ZSTD_CCtx* argument of ZSTD_compressSequences()	2017-07-17 12:27:24 -07:00
Nick Terrell	634f012420	[libzstd] Refactor ZSTD_compressSequences()	2017-07-17 11:36:11 -07:00
Paul Cruz	50ce4eaeb6	added error detection for pthread initialization, added compression completion measurement, fixed const values	2017-07-17 10:12:44 -07:00
Yann Collet	3b0cff3c33	fixed clang's -Wdocumentation	2017-07-13 18:58:30 -07:00
Yann Collet	2bd6440be0	pinned down error code enum values Note : all error codes are changed by this new version, but it's expected to be the last change for existing codes. Codes are now grouped by category, and receive a manually attributed value. The objective is to guarantee that error code values will not change in the future when introducing new codes. Intentionnal empty spaces and ranges are defined in order to keep room for potential new codes.	2017-07-13 17:12:16 -07:00
Nick Terrell	830ef4152a	[libzstd] Increase granularity of FSECTable repeat mode	2017-07-13 12:45:39 -07:00
Yann Collet	d985319337	Merge pull request #759 from terrelln/real-block-split [libzstd] Pull CTables into sub-structure	2017-07-13 10:24:19 -07:00
Yann Collet	3a60efd3a9	policy change : ZSTDMT automatically caps nbThreads to ZSTDMT_NBTHREADS_MAX (#760 ) Previously, ZSTDMT would refuse to create the compressor. Also : increased ZSTDMT_NBTHREADS_MAX to 256, updated doc, and added relevant test	2017-07-13 10:17:23 -07:00
Yann Collet	132e6efd76	switched ZSTDMT_compress_advanced() last argument to overlapLog overlapRLog (== 9 - overlapLog) was a bit "strange" as all other public entry points use overlapLog	2017-07-13 02:22:58 -07:00
Yann Collet	4e77f7761d	clarified comment on ZSTD_p_contentSizeFlag	2017-07-13 02:09:07 -07:00
Nick Terrell	de0414b736	[libzstd] Pull CTables into sub-structure	2017-07-12 19:49:19 -07:00
Yann Collet	8ef666c325	slightly increased buffer pool, to cover normal "full load" scenarios 2 buffers per active worker + 1 buffer for input loading + 1 buffer for "next input" when submitting current one + 1 buffer stuck in queue	2017-07-12 14:23:34 -07:00
Yann Collet	052a95f77c	fix : ZSTDMT_compress_advanced() correctly generates checksum when params.fParams.checksumFlag==1. This use case used to be impossible when only ZSTD_compress() was available	2017-07-11 17:18:26 -07:00
Yann Collet	2a62f48bf4	release input buffers from inside worker thread buffers are released sooner, which makes them available faster for next job. => decreases total nb of buffers necessary	2017-07-11 15:56:40 -07:00
Yann Collet	57236184af	buffer pool : all buffers have same size to reduce memory fragmentation. They can be used for in or out, interchangeably.	2017-07-11 15:17:25 -07:00
Yann Collet	34b2b95631	zstdmt : intermediate outBuffer allocated from within worker reduces total amount of memory needed, since jobs in queue do not have an outBuffer pre-reserved now	2017-07-11 14:59:10 -07:00
Yann Collet	16261e6951	buffer pool can be invoked from multiple threads	2017-07-11 14:14:07 -07:00
Yann Collet	ef0ff7fe7f	zstdmt: removed margin for improved memory usage	2017-07-11 08:54:29 -07:00
Yann Collet	4616fad18b	improved ZSTDMT_compress() memory usage does not need the input buffer for streaming operations also : reduced a few tests time length	2017-07-10 17:16:41 -07:00
Yann Collet	670b1fc547	optimized memory usage for ZSTDMT_compress() Previously, each job would reserve a CCtx right before being posted. The CCtx would be "part of the job description", and only released when the job is completed (aka flushed). For ZSTDMT_compress(), which creates all jobs first and only join at the end, that meant one CCtx per job. The nb of jobs used to be == nb of threads, but since latest modification, which reduces the size of jobs in order to spread the load of difficult areas, it also increases the nb of jobs for large sources / small compression level. This resulted in many more CCtx being created. In this new version, CCtx are reserved within the worker thread. It guaranteea there cannot be more CCtx reserved than workers (<= nb threads). To do that, it required to make the CCtx Pool multi-threading-safe : it can now be called from multiple threads in parallel.	2017-07-10 16:30:55 -07:00
Yann Collet	3510efb02d	fix : custom allocator correctly propagated to child contexts	2017-07-10 14:21:40 -07:00
Yann Collet	88da8f1816	fix : propagate custom allocator to ZSTDMT though ZSTD_CCtx_setParameter() also : compile fuzzer with MT enabled	2017-07-10 14:02:33 -07:00
Yann Collet	e32fb0c1fe	added ZSTD_sizeof_CCtx() test	2017-07-10 12:29:57 -07:00
Yann Collet	40156a4967	bumped version nb to v1.3.1	2017-07-08 04:55:09 -07:00
Yann Collet	0f4fc6c20a	fixed several conversion warnings	2017-07-07 17:13:12 -07:00
Yann Collet	9bde061a0b	fixed minor Visual compilation limitation	2017-07-07 16:14:17 -07:00
Yann Collet	593d517ebf	fixed minor cast warning	2017-07-07 16:09:47 -07:00
Yann Collet	ead4dd48f6	new field frameHeader.headerSize	2017-07-07 15:51:24 -07:00
Yann Collet	46396523c0	ZSTD_getFrameHeader : control of windowSize limits is delegated to caller Extracting frame header is a separate operation. It's now possible to get frame header, whatever the window size set in it.	2017-07-07 15:32:12 -07:00
Yann Collet	990449b89d	new field : ZSTD_frameHeader.frameType Makes frame type (zstd,skippable) detection more straighforward. ZSTD_getFrameHeader set frameContentSize=ZSTD_CONTENTSIZE_UNKNOWN to mean "field not present"	2017-07-07 15:21:35 -07:00
Yann Collet	e622330a3b	extended frameHeader.windowSize to unsigned long long	2017-07-07 14:19:01 -07:00
Yann Collet	f04deff4fc	fixed #718 , reported by @GregSlazinski, solution suggested by @mcmilk	2017-07-06 01:42:46 -07:00
Yann Collet	d75c0e71c4	minor code refactoring	2017-07-05 18:10:07 -07:00
Yann Collet	49af41820d	clarified status of zstdmt_compress.h API	2017-07-05 17:21:13 -07:00
Yann Collet	27e883371d	fixed wrong assert() condition A single job created by ZSTDMT_compress() can be < 256KB if data to compress is < 256 KB (in which case it is delegated to single thread mode)	2017-07-04 19:33:16 -07:00
Yann Collet	2cb9774f5e	more precise estimation of amount to flush at end of stream (single thread mode) also : can use DEBUGLEVEL variable in /tests	2017-07-04 12:39:26 -07:00
Yann Collet	6383372dec	fixed : 0-copy in NULL is UB	2017-07-04 10:36:41 -07:00
Yann Collet	5051dd39ca	Merge pull request #743 from facebook/fullbench compress_generic() automatic optimization opportunities	2017-07-03 21:26:38 -07:00
Yann Collet	2de2396a36	refactor ZSTDMT_compress()	2017-07-03 16:23:36 -07:00
Yann Collet	2084b041f4	fixed comments	2017-07-03 15:52:19 -07:00
Yann Collet	5a77361595	fixed wrong function name in comment	2017-07-03 15:21:24 -07:00
Nick Terrell	c80fc50a8d	[libzstd] Fix memcpy() on potential NULL source * `ZSTD_decompressStream_generic()` `ip` may be `NULL` for one of the calls to `memcpy()` * Assert the source is not `NULL` for calls to `memcpy()` where I believe the source should not be `NULL`.	2017-07-03 12:31:55 -07:00
Yann Collet	2485f88bf8	fixed legacy version init bug	2017-07-01 09:09:34 -07:00
cyan4973	21fdf97e00	Merge branch 'dev' into fullbench	2017-07-01 07:01:08 -07:00
cyan4973	1bafe393e4	fix : ZSTDMT_compressStream_generic() can accept NULL input also : converge implementations towards new version of ZSTDMT_compressStream_generic()	2017-07-01 06:59:24 -07:00
Yann Collet	58bd0e70fc	fixed : dictionary compression with new advanced API in Multi-threading mode	2017-06-30 16:01:02 -07:00
Yann Collet	d8b33a598d	Optimized ZSTDMT single-pass mode speed on large sources by ensuring job sizes remain "not too large"	2017-06-30 15:44:57 -07:00
Yann Collet	d5c046c609	implemented shortcut for zstd_compress_generic() in MT mode added ZSTDMT_compress_advanced() API	2017-06-30 14:51:01 -07:00
Yann Collet	7f40bb1c39	Merge pull request #742 from stellamplau/stack-space Reduce stack usage of HUF_readDTableX4 and HUF_readDTableX2	2017-06-30 14:50:23 -07:00
Stella Lau	32df49e9f8	Fix typo	2017-06-30 12:56:24 -07:00
Stella Lau	b0513b519c	Add comment to HUF_DECOMPRESS_WORKSPACE_SIZE	2017-06-30 12:53:56 -07:00
Stella Lau	4c71f59c77	Clarify typedef of rankVal_t and rankValCol_t	2017-06-30 09:52:20 -07:00
Stella Lau	28f711ef95	Rename ALIGN and ALIGN_MASK to HUF_ALIGN and HUF_ALIGN_MASK	2017-06-30 09:38:11 -07:00
Stella Lau	70ad6829e7	Delegate HUF_decompress4X_hufOnly to workspace version	2017-06-29 16:22:32 -07:00
Stella Lau	104c4d57c1	Fix bitshift error	2017-06-29 15:40:49 -07:00
Yann Collet	a3d9926c40	compression optimization opportunity switch to single-pass mode directly into output buffer when outputSize >= ZSTD_compressBound(inputSize). Speed gains observed with fullbench (~+15% on level 1)	2017-06-29 14:44:49 -07:00
Stella Lau	fedc94de8c	Fix pointer casting warning	2017-06-29 13:04:15 -07:00
Stella Lau	c6a5275a28	Fix alignment warnings with pointer casting	2017-06-29 12:39:34 -07:00
Stella Lau	99e315999c	Reduce stack usage of HUF_readDTableX4 and HUF_readDTableX2	2017-06-29 11:49:59 -07:00
Yann Collet	97f2bf66da	minor : fix typo	2017-06-29 11:31:40 -07:00
Yann Collet	acbef3decd	ZSTD_getFrameContentSize() is promoted to "stable" status	2017-06-29 05:19:51 -07:00
Yann Collet	590937df20	Merge pull request #739 from facebook/refPrefix ZSTD_refPrefix	2017-06-29 04:36:03 -07:00
Yann Collet	811deaea6f	Merge pull request #736 from terrelln/cover-default-api [zdict] Make COVER the default algorithm	2017-06-28 20:25:36 -07:00
Yann Collet	037466245f	refactor ZSTD_check_compressionLevel_monotonicIncrease_memoryBudget() use less macro statements the initial version was meant to work with STATIC_ASSERT but since it doesn't work and needs assert() it's possible to rewrite it using normally compiled code which is better for compiler. Downside : the error message is less precise. There is a DEBUGLOG(3,) to compensate.	2017-06-28 20:24:08 -07:00
Yann Collet	2bf428df45	Merge branch 'advancedAPI2' into refPrefix	2017-06-28 16:35:49 -07:00
Yann Collet	1ca76039af	fixed -Wdeclaration-after-statement	2017-06-28 15:40:21 -07:00
Yann Collet	813535105b	added function to control monotonic memory budget increase of ZSTD_defaultCParameters[0] It's a runtime test, based on assert(), played once, on first ZSTD_getCParams() usage, when ZSTD_DEBUG is enabled.	2017-06-28 15:34:56 -07:00
Yann Collet	adbe74a8ac	adjusted compression levels to guarantee a monotonically increasing memory budget	2017-06-28 13:22:37 -07:00
Yann Collet	33a6639039	fixed ZSTD_refPrefix with Multithread-enabled CCtx	2017-06-28 11:09:43 -07:00
Yann Collet	2e4274262d	controlled dictMode	2017-06-27 17:09:12 -07:00
Yann Collet	b7372933b8	implemented ZSTD_refPrefix()	2017-06-27 15:49:12 -07:00
Yann Collet	7d3816183f	exposed ZSTD_MAGIC_DICTIONARY in zstd.h makes it easier to explain ZSTD_dictMode	2017-06-27 13:50:34 -07:00
Yann Collet	fecc721fd9	added parameter ZSTD_p_refDictContent	2017-06-27 11:46:39 -07:00
Nick Terrell	5b7fd7c422	[zdict] Make COVER the default algorithm	2017-06-26 21:09:22 -07:00
Yann Collet	c7fb884eea	fixed minor conversion warning	2017-06-26 18:02:23 -07:00
Yann Collet	dde10b23fe	refactored ZSTD_estimateDStreamSize() now uses windowSize as argument. Also : created ZSTD_estimateDStreamSize_fromFrame()	2017-06-26 17:44:26 -07:00
Yann Collet	09ae03a570	ZSTD_estimateCDictSize_advanced() ZSTD_estimateCDictSize() now uses same arguments as ZSTD_createCDict() ZSTD_estimateCDictSize_advanced() uses same arguments as ZSTD_createCDict_advanced()	2017-06-26 16:47:32 -07:00
Yann Collet	0c9a915a28	ZSTD_estimateCStreamSize_advanced()	2017-06-26 16:02:25 -07:00
Yann Collet	31af8290d1	ZSTD_estimateCCtx_advanced() ZSTD_estimateCCtx() is now a "simple" function, taking int compressionLevel as single argument. ZSTD_estimateCCtx_advanced() takes a CParams argument, which is both more complete and more complex to generate.	2017-06-26 15:52:39 -07:00
Yann Collet	ef269c1b68	Merge pull request #725 from facebook/advancedAPI2 New Advanced API	2017-06-23 09:50:47 -07:00
Yann Collet	ecb0f46866	add controls over streaming buffers	2017-06-21 17:25:01 -07:00
Yann Collet	dce789281b	fixed : decompression of skippable frames in streaming mode	2017-06-21 15:53:42 -07:00
Yann Collet	204b6b7ef6	fixed streaming buffered allocation with CDict compression	2017-06-21 15:13:00 -07:00
Yann Collet	1e4129b27b	fixed dangling pointer risk, detected by @terrelln	2017-06-21 13:26:10 -07:00
Yann Collet	83095970e6	free cdictLocal faster, suggested by @terrelln	2017-06-21 12:26:40 -07:00
Yann Collet	7bd1a2900e	added ZSTD_dictMode_e to control dictionary loading mode	2017-06-21 11:50:33 -07:00
Yann Collet	9c56b12938	Merge pull request #723 from paulcruz74/dev Adding zstd -l	2017-06-21 09:41:55 -07:00
Yann Collet	e51d51bdf7	fixed memcpy() overlap	2017-06-20 17:44:55 -07:00
Yann Collet	466f92eaa6	removed one useless streaming compression stage, detected by @terrelln	2017-06-20 16:25:29 -07:00
Yann Collet	c3bce24ef4	fixed potential dangling pointer, detected by @terrelln	2017-06-20 16:09:11 -07:00
Yann Collet	78b8234554	fixed comments, following suggestion by @terrelln	2017-06-20 14:26:48 -07:00
Yann Collet	b44ab82f7a	ensure new ZSTD_strategy starts at value 1	2017-06-20 14:11:49 -07:00
Yann Collet	c08e649e95	first implementation of bench.c with new API ZSTD_compress_generic() Doesn't speed optimize this buffer-to-buffer scenario yet. Still internally defers to streaming implementation. Also : fixed a long standing bug in ZSTDMT streaming API.	2017-06-19 18:25:35 -07:00
Yann Collet	695a0a3449	fixed IA64 compilation error, by @mcmilk	2017-06-19 15:27:30 -07:00
Yann Collet	fe234bf48b	fix attempts : fullbench for VS2008	2017-06-19 15:23:19 -07:00
Nick Terrell	55f9cd4942	[libzstd] Fix UBSAN failure	2017-06-19 15:12:28 -07:00
Yann Collet	bf99150be3	update new api presentation in zstd.h and manual	2017-06-19 12:56:25 -07:00
Yann Collet	d7a3bffba9	new api : setting compression parameters is refused if a dictionary is already loaded	2017-06-19 11:53:01 -07:00
Yuri	92bafda406	INSTALL_DATA instead of INSTALL_LIB for libzstd.a INSTALL_LIB can be passed -s flag to strip symbols. Static libraries should not be stripped, only dynamic ones should be stripped.	2017-06-17 00:23:41 -07:00
Yann Collet	381e66cfbd	added ZSTD_clampCParams() now ZSTD_adjustCParams() is always successful, it always produces a valid CParams	2017-06-16 17:34:54 -07:00
Yann Collet	aee916e37c	fixed +/-1 error for pledgedSrcSizePlusOne	2017-06-16 17:02:35 -07:00
Yann Collet	d3de3d51a3	fix attempt 2 : Visual sign conversion warning	2017-06-16 16:51:33 -07:00
Yann Collet	944be54774	fixed attempt : minor Visual sign conversion warning	2017-06-16 14:05:01 -07:00
Yann Collet	b26728c9c8	added ZSTD_startNewCompression()	2017-06-16 14:00:46 -07:00
Yann Collet	a0ba849fe6	changed frameContentSize field to pledgedSrcSizePlusOne pledgedSrcSize is proper : it's a promise, not yet fulfilled. It will be controlled at the end. PlusOne is meant to have 0 (default) == unknown	2017-06-16 13:29:17 -07:00
Yann Collet	2cf7755da7	fix : pledgedSrcSize correctly reset to unknown in "continue" mode	2017-06-16 12:34:41 -07:00
Yann Collet	9e73f2f320	fix : correctly reset pledgedSrcSize to unknown status when starting a new compression with an existing context	2017-06-16 12:24:01 -07:00
Yann Collet	33873f0e74	fixed : new advanced AIP : setting nbThreads to the same value > 1	2017-06-16 12:04:21 -07:00
Yann Collet	559ee82e90	fixed : calling ZSTD_compress_generic() to end-flush a stream in multiple steps	2017-06-16 11:58:21 -07:00
Yann Collet	bd18c885a3	added ZSTD_CCtx_reset	2017-06-16 10:17:50 -07:00
Yann Collet	cc9f9b7f4c	protection : ZSTD_CONTENTSIZE_UNKNOWN automatically disables contentSizeFlag	2017-06-15 18:17:34 -07:00
Yann Collet	05ae4b2190	added protection : MT incompatible with Static allocation	2017-06-15 18:03:34 -07:00
Paul Cruz	a9b77c83e5	cleaning up code for analyzing frames	2017-06-15 14:13:28 -07:00
Yann Collet	f129fd3970	disabled MT code path when ZSTD_MULTITHREAD is not defined	2017-06-11 18:46:09 -07:00
Yann Collet	23aace9778	added control stage to MT mode	2017-06-11 18:32:36 -07:00
Yann Collet	f35e2de61c	linked newAPI to ZSTDMT	2017-06-05 18:32:48 -07:00
cyan4973	c59162e053	minor fix for -Wdocumentation	2017-06-05 00:12:13 -07:00
cyan4973	8bcbf42617	fixed g++ prototype mismatch	2017-06-04 23:52:00 -07:00
Yann Collet	8c910d2097	updated ZSTDMT streaming API ZSTDMT streaming API is now similar and has same capabilites as single-thread streaming API. It makes it easier to blend them together.	2017-06-03 01:15:02 -07:00
Yann Collet	58e8d793e1	made debug definitions common within zstd_internal.h	2017-06-02 18:20:48 -07:00
Yann Collet	8ddf4c22d5	fixed missing initialization	2017-06-02 17:16:49 -07:00
Yann Collet	33a7e679e5	significant zlib wrapper code refactoring code indentation variable scope and names constify Only coding style changes. The logic should remain the same.	2017-06-02 17:10:49 -07:00
Yann Collet	4effccbf56	zlib_wrapper's uncompress() uses ZSTD_isFrame() for routing more generic and safer than using own routing for magic number comparison	2017-06-02 14:27:11 -07:00
Yann Collet	dcb7535352	ensure zlibwrapper uses ZSTD_malloc() and ZSTD_free() which is compatible with { NULL, NULL, NULL }	2017-06-02 14:01:21 -07:00
Yann Collet	b877e834b1	minor indent	2017-06-02 13:47:11 -07:00
Yann Collet	6056e4c3eb	added POOL_sizeof() for single-thread	2017-06-02 11:36:47 -07:00
Yann Collet	c35e535002	added support for multithreading parameters	2017-06-01 18:44:06 -07:00
Yann Collet	c4a5a21c5c	created ZSTDMT_sizeof_CCtx() and POOL_sizeof() required by ZSTD_sizeofCCtx() while adding a ZSTDMT_CCtx*	2017-06-01 17:56:14 -07:00
Yann Collet	cd2892fd1e	protected impossible switch(){default:} with assert(0) can be converted into assume(0) in some future	2017-06-01 09:44:54 -07:00
Yann Collet	06589fe516	Merge branch 'advancedAPI2' of github.com:facebook/zstd into advancedAPI2	2017-05-31 10:03:20 -07:00
Yann Collet	18ab5affa5	fixed visual warning	2017-05-31 09:59:22 -07:00
Yann Collet	9a691e0f55	fixed visual warnings	2017-05-31 01:17:44 -07:00
Yann Collet	01b1549f83	finally converted ZSTD_compressStream_generic() to use {in,ou}Buffer replacing the older read/write variables from ZBUFF_* era. Mostly to help code readability. Fixed relevant callers.	2017-05-30 18:10:26 -07:00
Yann Collet	c4f46b94ce	ZSTD_createCCtx_advanced() now uses ZSTD_calloc() initially uses calloc() instead of memset(). Performance improvement is unlikely measurable, since ZSTD_CCtx is now very small, with all tables transferred into workSpace.	2017-05-30 17:45:37 -07:00
Yann Collet	deee6e523f	expose ZSTD_compress_generic_simpleArgs() which is a binding towards ZSTD_compress_generic() using only integral types for arguments.	2017-05-30 17:42:00 -07:00
Yann Collet	ae728a43b8	removed defaultCustomMem now ZSTD_customCMem is promoted as new default. Advantages : ZSTD_customCMem = { NULL, NULL, NULL}, so it's natural default after a memset. ZSTD_customCMem is public constant (defaultCustomMem was private only). Also : makes it possible to introduce ZSTD_calloc(), which can now default to stdlib's calloc() when it detects system default. Fixed zlibwrapper which depended on defaultCustomMem.	2017-05-30 17:11:39 -07:00
Yann Collet	5bcef1ada2	removed mtctx->cstream use the first cctx in pool when ZSTDMT is used in single-thread mode now that cctx and cstream are the same object.	2017-05-30 16:37:19 -07:00
Yann Collet	beb62b15a8	Merge branch 'dev' into advancedAPI2 Fixed conflic in zstd_decompress.c	2017-05-30 16:18:57 -07:00
Yann Collet	44e45e8423	added ZSTDMT_createCCtx_advanced() make it possible to use custom allocators	2017-05-30 16:12:06 -07:00
Yann Collet	f45ca527a1	Merge branch 'advancedAPI2' of github.com:facebook/zstd into advancedAPI2	2017-05-30 10:02:03 -07:00
Yann Collet	b6dec4c3ae	fixed minor cast warning	2017-05-27 17:09:06 -07:00
Yann Collet	e071159101	mtctx->jobs allocate its own memory space to make ZSTDMT_CCtx_s size predictable so that it can be included in CCtx	2017-05-27 00:21:33 -07:00
Yann Collet	b8136f019a	static dctx is incompatible with legacy support documented, and runtime tested	2017-05-27 00:03:08 -07:00
Yann Collet	7028cbd7fd	fixed a few code comments : ZSTD_getFrameParams => ZSTD_getFrameHeader	2017-05-25 18:29:08 -07:00
Yann Collet	cdf7e82222	Added ZSTD_initStaticCDict()	2017-05-25 18:05:49 -07:00
Dmitry V. Levin	1ea655c765	Fix typo in libzstd.a-mt make rules The macro name is ZSTD_MULTITHREAD, not ZSTD_MULTHREAD. Fixes: `ca6fae7808` ("Add MT enabled targets for libzstd")	2017-05-25 23:43:05 +00:00
Yann Collet	57827f906f	added ZSTD_initStaticDDict()	2017-05-25 15:44:06 -07:00
Yann Collet	25989e361c	updated ZSTD_estimate?DictSize() to pass parameter byReference resulting ?Dict object is smaller when created byReference. Seems better than a documentation note.	2017-05-25 15:07:37 -07:00
Yann Collet	0fdc71c3dc	added ZSTD_initStaticDCtx()	2017-05-24 17:41:41 -07:00
Yann Collet	ba183005d3	merged DStream's inBuff and outBuff into a single buffer Saves one malloc(). Also : makes it easier to implement static allocation	2017-05-24 15:42:24 -07:00
Nick Terrell	55fc1f91fd	[zstd] Fix up formatting edge cases for clang-format	2017-05-24 13:50:10 -07:00
Yann Collet	2e4db3e531	fixed performance regression with ZSTD_decompress() on small files memset() was a quick fix to initialization problems, but initialize too much space (tables, buffers) which show up in decompression speed of ZSTD_decompress() since it needs to recreate DCtx at each invocation. Fixed by only initialization relevant pointers and size fields.	2017-05-24 13:15:19 -07:00
Yann Collet	11ea2f7fda	Merged ZSTD_DCtx and ZSTD_DStream objects They are now the same object. It's recommended to keep both types in source code as previous versions of library (<v1.3.0) still need this differentiation.	2017-05-23 16:19:43 -07:00
Yann Collet	b81f19ffce	move MEM_readMINMATCH() into zstd_opt.h which is its only user. Use case too narrow to belong to mem.h. renamed to ZSTD_readMINMATCH()	2017-05-23 15:41:55 -07:00
Yann Collet	c7fe262dc9	added ZSTD_initStaticCCtx() makes it possible to statically or externally allocate CCtx. static CCtx will only use provided memory area, it will never resize nor malloc.	2017-05-23 13:20:41 -07:00
Yann Collet	5ac72b417c	Buffered are now allocated inside workSpace	2017-05-23 11:18:24 -07:00
Yann Collet	1880337c30	Simplifier compression call graph Everything converge towards ZSTD_compressBegin_internal which delegated to ZSTD_copyCCtx_internal if cdict!=NULL. This simplifies routing which was previously depending on cdict.	2017-05-22 18:21:51 -07:00
Yann Collet	b0739bcf8f	simplified reset by removing full-reset policy this was meant to be applied prior to dictionary loading. But effectively, it seems redundant with later loading stage, so it can be skipped safely.	2017-05-22 17:45:15 -07:00
Yann Collet	1ad7c82eb5	Implemented separation between requested and applied parameters first version to pass cli tests with -DZSTD_NEWAPI	2017-05-22 17:06:04 -07:00
Yann Collet	24de7b0346	Implemented ZSTD_CCtx_refCDict()	2017-05-22 13:05:45 -07:00
Yann Collet	ee970398b2	Merge branch 'dev' into advancedAPI2	2017-05-22 12:33:56 -07:00
Yann Collet	8b21ec42a9	ZSTD_compress_generic() can handle dictionary compression	2017-05-19 19:46:15 -07:00
Nick Terrell	a1280406b0	[libzstd] Allow users to define custom visibility	2017-05-19 18:01:59 -07:00
Yann Collet	334a288d0d	ZSTD_CCtx_setParameter() only works during initialization stage and generate a stage_wrong error otherwise.	2017-05-19 11:04:41 -07:00
Yann Collet	48855fa0d2	fixed declaration-after-statement warning	2017-05-19 10:56:11 -07:00
Yann Collet	fa3671eac7	changed ZSTD_BLOCKSIZE_ABSOLUTEMAX into ZSTD_BLOCKSIZE_MAX Also : change ZSTD_getBlockSizeMax() into ZSTD_getBlockSize() created ZSTD_BLOCKSIZELOG_MAX	2017-05-19 10:51:30 -07:00
Yann Collet	009d604e00	ZSTD_compress_generic() supports multiple successive frames also : clarified streaming API implementation	2017-05-19 10:17:59 -07:00
Yann Collet	6d4fef36de	Added ZSTD_compress_generic() Used in fileio.c (zstd cli). Need to set macro ZSTD_NEWAPI to trigger it.	2017-05-17 18:36:15 -07:00
Yann Collet	23c256e44b	removed useless variable from CCtx CStream's pledgedSrcSize is no longer necessary srcSize control is realized within bufferless interface.	2017-05-16 18:10:11 -07:00
Yann Collet	9f95e445ab	minor comment clarifications	2017-05-16 17:26:43 -07:00
Yann Collet	0bdb575c31	Merge branch 'dev' into advancedAPI2	2017-05-16 16:32:29 -07:00
Yann Collet	7101434ec9	pedantic : added one error check on a function which (today) never fails. But who knows, maybe tomorrow ...	2017-05-16 16:28:24 -07:00
Yann Collet	bfff8999c5	added prototype ZSTD_versionString()	2017-05-16 16:12:23 -07:00
Yann Collet	4eff8136aa	added prototype ZSTD_decompressBegin_usingDDict (#700 )	2017-05-16 16:05:27 -07:00
Yann Collet	2d4d31c18a	removed gcc compilation flag -Wbad-function-cast It makes it more difficult to directly cast the result of a function, requiring to store the result in an intermediate variable. It does not necessarily help readability, and this restriction can be difficult to overcome in some constructions, like some macros. also : fixed minor Visual conversion warnings in datagencli.c	2017-05-16 11:34:38 -07:00
Yann Collet	133f0aee54	fixed redundant declarations in legacy v0.5 and v0.7 decoders triggered by new flag -Wredundant-decls	2017-05-15 17:44:04 -07:00
Yann Collet	83d0c764dc	added several compilation flags	2017-05-15 17:15:46 -07:00
Yann Collet	a5ffe3d370	pushed enum values for strategy by one (ZSTD_fast==1) this makes it possible to use `0` to mean: "do not change strategy"	2017-05-12 16:29:19 -07:00
Yann Collet	add66f816d	changed macro LOADCPARAMS by static function ZSTD_cLevelToCParams() for improved compiler checks. Also : ensure most parameters can receive value "0" to mean "do not change".	2017-05-12 16:01:15 -07:00
Yann Collet	b0edb7fb0e	added ZSTD_CCtx_setParameter()	2017-05-12 15:31:53 -07:00
Yann Collet	ef738c1b23	better error code when compressing using NULL CDict which is not allowed (but detected, and generates an error).	2017-05-12 13:55:25 -07:00
Yann Collet	db8e21d5a0	made ZSTD_compress_generic() definition accessible note that the implementation is not done yet.	2017-05-12 13:46:49 -07:00
Yann Collet	33eb7ac6b6	updated Advanced API proposal only declarations in zstd.h	2017-05-12 12:36:11 -07:00
Yann Collet	bd1964a988	Merge pull request #696 from joscollin/wip-lib-legacy-fallthrough-warn lib/legacy: warning: this statement may fall through	2017-05-11 10:45:01 -07:00
Yann Collet	4c1cfc0bb6	Merge pull request #695 from joscollin/wip-lib-compress-fallthrough-warn lib/compress: warning: this statement may fall through	2017-05-11 10:44:27 -07:00
Jos Collin	280510f2d5	lib/legacy: warning: this statement may fall through The following warning appears during build at sevaral places. ../lib/legacy/zstd_v04.c:819:40: warning: this statement may fall through [-Wimplicit-fallthrough=] case 7: bitD->bitContainer += (size_t)(((const BYTE)(bitD->start))[6]) << (sizeof(size_t)8 - 16); ../lib/legacy/zstd_v05.c:821:40: warning: this statement may fall through [-Wimplicit-fallthrough=] case 7: bitD->bitContainer += (size_t)(((const BYTE)(bitD->start))[6]) << (sizeof(size_t)8 - 16); ../lib/legacy/zstd_v06.c:913:40: warning: this statement may fall through [-Wimplicit-fallthrough=] case 7: bitD->bitContainer += (size_t)(((const BYTE)(srcBuffer))[6]) << (sizeof(bitD->bitContainer)8 - 16); ../lib/legacy/zstd_v07.c:583:40: warning: this statement may fall through [-Wimplicit-fallthrough=] case 7: bitD->bitContainer += (size_t)(((const BYTE)(srcBuffer))[6]) << (sizeof(bitD->bitContainer)8 - 16); Signed-off-by: Jos Collin <jcollin@redhat.com>	2017-05-11 14:27:40 +05:30
Jos Collin	7cd7a7564b	lib/compress: warning: this statement may fall through The following warning appears during build. ../lib/compress/huf_compress.c: In function ‘HUF_compress1X_usingCTable’: ../lib/compress/huf_compress.c:444:8: warning: this statement may fall through [-Wimplicit-fallthrough=] if (sizeof((stream)->bitContainer)8 < HUF_TABLELOG_MAX4+7) HUF_FLUSHBITS(stream) ^ ../lib/compress/huf_compress.c:465:18: note: in expansion of macro ‘HUF_FLUSHBITS_2’ HUF_FLUSHBITS_2(&bitC); ^~~~~~~~~~~~~~~ ../lib/compress/huf_compress.c:466:9: note: here case 2 : HUF_encodeSymbol(&bitC, ip[n+ 1], CTable); ../lib/compress/zstd_compress.c: In function ‘ZSTD_compressStream_generic’: ../lib/compress/zstd_compress.c:3366:34: warning: this statement may fall through [-Wimplicit-fallthrough=] zcs->streamStage = zcss_flush; /* pass-through to flush stage */ ~~~~~~~~~~~~~~~~~^~~~~~~~~~~~ ../lib/compress/zstd_compress.c:3369:9: note: here case zcss_flush: Signed-off-by: Jos Collin <jcollin@redhat.com>	2017-05-11 13:17:26 +05:30
Jos Collin	05286fdd5a	lib/common: warning: this statement may fall through The following warning appears during the build. Fixed the review comments too. zstd/lib/common/bitstream.h: In function ‘BIT_initDStream’: zstd/lib/common/bitstream.h:277:33: warning: this statement may fall through [-Wimplicit-fallthrough=] case 7: bitD->bitContainer += (size_t)(((const BYTE)(srcBuffer))[6]) << (sizeof(bitD->bitContainer)8 - 16); Signed-off-by: Jos Collin <jcollin@redhat.com>	2017-05-11 09:10:02 +05:30
Nick Terrell	374f868354	Update whitespace	2017-05-10 17:48:42 -07:00
Nick Terrell	5f2c7213c7	Merge remote-tracking branch 'upstream/dev' into btopt * upstream/dev: (305 commits) added test for ZSTD_estimateCStreamSize() changed variable name, for clarity fixed ZSTD_estimateCStreamSize() shortened ZSTD_createCStream_Advanced() fixed symbols test added ZSTD_estimateDStreamSize() changed name frameParams into frameHeader regroup memory usage function declarations separated ZSTD_estimateCStreamSize() from ZSTD_estimateCCtxSize() bumped version number added ZSTD_estimateCDictSize() and ZSTD_estimateDDictSize() Updated ZSTD_freeCCtx() updated ZSTD_estimateCCtxSize() Updated ZSTD_sizeof_CCtx() merged CCtx and CStream as a single same object cli : -d and -t do not stop after a failed decompression added dev branch CircleCI badge added dev branch Appveyor badge keep dev branch status only creates a binary archive without the `programs` directory ...	2017-05-10 16:49:58 -07:00
Yann Collet	ba41b26405	Merge pull request #689 from facebook/cctxMerge Cctx merge	2017-05-10 14:53:54 -07:00
Yann Collet	cef02d9317	changed variable name, for clarity fhiPtr -> zfhPtr https://github.com/facebook/zstd/pull/689#discussion_r115638676	2017-05-10 11:14:08 -07:00
Yann Collet	669346fe8b	fixed ZSTD_estimateCStreamSize() https://github.com/facebook/zstd/pull/689#discussion_r115637721	2017-05-10 11:08:00 -07:00
Yann Collet	6fb2f24132	shortened ZSTD_createCStream_Advanced() https://github.com/facebook/zstd/pull/689#discussion_r115637613	2017-05-10 11:06:06 -07:00
Yann Collet	f16f4497ca	added ZSTD_estimateDStreamSize()	2017-05-09 16:18:17 -07:00
Yann Collet	542c9dfcf8	changed name frameParams into frameHeader ZSTD_frameParams => ZSTD_frameHeader ZSTD_getFrameParams() -> ZSTD_getFrameHeader() The new naming is more distinctive from ZSTD_frameParameters, which is used during compression. ZSTD_frameHeader is clearer in its intention to described frame header content. It also implies we are decoding a ZSTD frame, hence we are at decoding stage.	2017-05-09 15:46:07 -07:00
Yann Collet	5a36c069e7	regroup memory usage function declarations in a single paragraph in zstd.h, for clarity	2017-05-09 15:11:30 -07:00
Yann Collet	fa8dadb294	separated ZSTD_estimateCStreamSize() from ZSTD_estimateCCtxSize() for clarity	2017-05-08 18:24:16 -07:00
Yann Collet	51652522a2	bumped version number	2017-05-08 17:52:46 -07:00
Yann Collet	a1d6704d7f	added ZSTD_estimateCDictSize() and ZSTD_estimateDDictSize() it complements ZSTD_estimateCCtxSize() for the special case of ZSTD_initCStream_usingDict()	2017-05-08 17:51:49 -07:00
Yann Collet	7855366598	Updated ZSTD_freeCCtx() which can also contain streaming buffers now. Redirected ZSTD_freeCStream() towards it.	2017-05-08 17:15:00 -07:00
Yann Collet	fc5145955a	updated ZSTD_estimateCCtxSize() added a parameter streaming, to estimate memory allocation size when the CCtx is used for streaming (CStream). Note : this function is not able to estimate memory cost of a potential internal CDict which can only happen when starting with ZSTD_initCStream_usingDict()	2017-05-08 17:07:59 -07:00
Yann Collet	791d744279	Updated ZSTD_sizeof_CCtx() can now contain buffers if object used as CStream. ZSTD_sizeof_CStream() is now just a thin wrapper of ZSTD_sizeof_CCtx().	2017-05-08 16:17:30 -07:00
Yann Collet	0be6fd3429	merged CCtx and CStream as a single same object To be changed : ZSTD_sizeof_CCtx(), ZSTD_estimateCCtxSize()	2017-05-08 16:08:01 -07:00
Yann Collet	d47709b6ea	Merge pull request #654 from iburinoc/splittable [RFC] Splittable Format and API	2017-05-08 13:41:56 -07:00
Yann Collet	a00e9599f1	removed -g from DEBUGFLAGS It inflates binary sizes, which is negative for the Windows build. It also makes it impossible to check if 2 different source codes get nonetheless compiled to the same binary, since checksum will be different, due to integrated source code.	2017-05-04 17:24:29 -07:00
Yann Collet	606c04c228	Merge branch 'dev' of github.com:facebook/zstd into dev	2017-05-02 12:13:52 -07:00
Yann Collet	072484a3bf	Merge pull request #683 from terrelln/odev [CLI] Make cover the default dictionary builder	2017-05-02 12:13:23 -07:00
Nick Terrell	f376d47c11	[CLI] Switch dictionary builder on CLI to cover	2017-05-02 11:18:27 -07:00
Nick Terrell	020b960e13	[cover] Make optimization faster	2017-05-02 11:02:48 -07:00
Nick Terrell	f2d9ef1dc0	[cover] Optimize case where d <= 8	2017-05-02 11:02:43 -07:00
Nick Terrell	865918dd04	Fix typo in zdict.h	2017-05-02 11:02:37 -07:00
Yann Collet	b184589c4c	minor code refactoring for clarity	2017-05-01 11:35:47 -07:00
Yann Collet	33c38b0925	fixed const in prototype, that Visual doesn't accept	2017-05-01 11:12:30 -07:00
Yann Collet	f39a6731ec	sync bitstream.h from fse library	2017-05-01 09:56:03 -07:00
Yann Collet	202082f285	sync bitstream from FSE project add assert into unsafe *_fast() variants	2017-04-28 17:00:31 -07:00
Yann Collet	89f50deec7	minor code refactoring clearer tables	2017-04-28 16:52:36 -07:00
Yann Collet	68a7d3d49a	added HUF_PUBLIC_API macro to huf.h to make it possible to control symbol visibility. Also : better separation and comments between "public" and "static" sections	2017-04-28 12:46:48 -07:00
Yann Collet	a51cab6e68	Merge pull request #678 from facebook/apiChange Breaking API Change around CDict	2017-04-28 10:02:45 -07:00
Yann Collet	29297c6751	Changed default level 18 (large input) Previous -18 : 4.7 MB/s, R:3.833 New -18 : 5.1 MB/s. R:3.825 It's a better fit within -17 (6.8 MB/s) and -19 (4.0 MB/s) The new level 18 also uses significantly less memory. And, it makes a good transition between level 17 (mml5) and level 19 (mml3). Up to now, there was no level with mml4. (note : minmatch setting can have a large impact on some (specific) datasets)	2017-04-27 17:44:01 -07:00
Yann Collet	a92cbb7004	Added a secondary test, checking dictID presence after setting noDictIdFLag=1	2017-04-27 15:08:56 -07:00
Yann Collet	d3694e6c70	removed C4204	2017-04-27 14:29:35 -07:00
Yann Collet	1c3ab0c77f	fixed init error on Visual 2008	2017-04-27 12:57:11 -07:00
Yann Collet	8b669535f8	bumped version number to v1.2.0	2017-04-27 12:50:20 -07:00
Yann Collet	77bf59ef50	added ZSTD_initCStream_usingCDict_advanced()	2017-04-27 11:43:04 -07:00
Yann Collet	f4bd857d81	created ZSTD_compress_usingCDict_advanced()	2017-04-27 11:31:55 -07:00
Yann Collet	69a54d138a	fixed compilation warning : declaration-after-statement	2017-04-27 01:11:26 -07:00
Yann Collet	31533bacce	Changed ZSTD_createCDict_advanced() It now only uses compressionParameters as argument. It produces many changes throughout user code, though hopefully they tend to be simple : just provide the cParams part from existing ZSTD_parameters. Some programs might depend on ZSTD_createCDict_advanced() to pass frame parameters. This change will force them to revisit this strategy and fix it, since frame parameters are effectively silently ignored in current version.	2017-04-27 00:29:04 -07:00
Yann Collet	768df129d2	changed ZSTD_compressBegin_usingCDict() No longer takes `pledgedSrcSize` as argument this is in line with similar functions ZSTD_compress_usingCDict() and ZSTD_initCStream_usingCDict().	2017-04-26 15:42:10 -07:00
Yann Collet	e42afbc6fa	Comply with suggested comments by @terrelln created FSE_CTABLE_SIZE() and FSE_DTABLE_SIZE()	2017-04-26 11:39:35 -07:00
Sean Purcell	7d37ca1d5b	Merge remote-tracking branch 'origin/dev' into splittable	2017-04-21 14:18:39 -07:00
Yann Collet	7271203bdb	transferred entropy scratch space from CCtx into workSpace Saved 6 KB	2017-04-20 23:21:19 -07:00
Yann Collet	a408645f50	made some room for entropy scratch space	2017-04-20 23:09:39 -07:00
Yann Collet	71aaa32c3c	transferred FSE tables from CCtx into workspace Saved 5 KB from CCtx	2017-04-20 23:03:38 -07:00
Yann Collet	71ddeb67b1	made room in workspace for FSE tables still need to be transferred from CCtx into workspace	2017-04-20 22:54:54 -07:00
Yann Collet	a34a39c183	changed size evaluation of entropy tables so that memcpy() does no longer depends on fse pointer being a static table	2017-04-20 18:26:25 -07:00
Yann Collet	7bb60b17d8	init entropy table pointers only once per workSpace resize	2017-04-20 17:38:56 -07:00
Yann Collet	e6fa70a0a1	reorganized ZSTD_resetCCtx_internal() clearer separation between variables and buffers clearer buffers category kept static buffers at the beginning, favoring cache locality (it will be easier to add FSE tables there later) This break a few assumptions that hashTable was always at the beginning. This is fixed. And remaining assumptions (namely that tables stand next to each other in memory) are now tested with assert.	2017-04-20 17:28:31 -07:00
Yann Collet	c17e020c9a	disable assert when compiling paramgrill paramgrill is a benchmark calibration function. Speed accuracy is critical, it cannot be altered by assert.	2017-04-20 12:50:02 -07:00
Yann Collet	16f9c572fc	Merge branch 'dev' into compressionFlow	2017-04-20 11:16:40 -07:00
Yann Collet	e348dad305	minor long line reformatting	2017-04-20 11:14:13 -07:00
Yann Collet	e847730452	slightly refined README comments on lib-mt	2017-04-18 23:15:28 -07:00
Yann Collet	2c5514c759	fixed ZSTDMT_initCStream_advanced() Must use the new ZSTD_compressBegin_usingCDict_advanced() to enforce correct frame parameters	2017-04-18 22:52:41 -07:00
Sean Purcell	98cf7fcb2a	Update README	2017-04-18 17:03:37 -07:00
Sean Purcell	0f7bd772e6	Update seekable API to simplify IO	2017-04-18 16:48:30 -07:00
Yann Collet	a4cab80183	added ZSTD_copyCCtx_internal() which respects provided fParams.	2017-04-18 14:54:54 -07:00
Sean Purcell	ca6fae7808	Add MT enabled targets for libzstd	2017-04-18 14:13:01 -07:00
Yann Collet	30fb499208	Changed ZSTD_resetCCtx_advanced() into ZSTD_resetCCtx_internal() for naming consistency : _advanced() can be invoked while _internal() are strictly static	2017-04-18 14:08:50 -07:00
Yann Collet	715b9aa113	created ZSTD_compressBegin_usingCDict_advanced()	2017-04-18 13:55:53 -07:00
Yann Collet	af4f45b682	Improved code comments for block functions	2017-04-18 03:17:44 -07:00
Yann Collet	4f818182b8	clarified frame parameters for ZSTD_compress*_usingCDict() created ZSTD_compressBegin_usingCDict_internal(), which gives direct control to frame Parameters. ZSTD_resetCStream_internal() now points into it.	2017-04-17 18:29:06 -07:00
Yann Collet	c47c68f6ca	proper evaluation of Huffman CTable size	2017-04-17 16:14:21 -07:00
Sean Purcell	5ee1135f30	s/chunk/frame/	2017-04-12 11:15:50 -07:00
Yann Collet	88009a8ba2	removed srcSize control from CStream since it's already done from lower bufferless API level	2017-04-12 00:51:24 -07:00
Yann Collet	20d5e03893	content size is controlled at bufferless level so it's active for all entry points Also : added relevant test (wrong content size) in fuzzer	2017-04-11 18:34:02 -07:00
Sean Purcell	d048fefef7	Move seekable format content to /contrib	2017-04-11 14:38:56 -07:00
Sean Purcell	45f3bc4801	Add format specification	2017-04-11 13:53:09 -07:00
Sean Purcell	a3b7c22604	Make seekable streams work w/ small buffers, misc fixes	2017-04-11 13:53:09 -07:00
Sean Purcell	c3ba15e48f	Seekable compression demo	2017-04-11 13:53:09 -07:00
Yann Collet	4ee6b15dac	force contentSizeFlag=0 when using ZSTD_initCStream_usingCDict() because by definition srcSize is not known when using this prototype. added relevant test Note : this use was already working, because at a later stage (both ZSTD_compressBegin_usingCDict() and ZSTD_copyCCtx()) pledgedSrcSize=0 is translated into "unknown", no matter the frame parameter. This is not correct, but of little importance, as the medium term plan is to no longer set fParams within CDict	2017-04-11 11:59:44 -07:00
Yann Collet	ab9162ebb4	simplified call graph by calling ZSTD_compressBegin_internal() instead of ZSTD_compressBegin_advanced()	2017-04-11 10:46:20 -07:00
Yann Collet	e88034fe26	simplified ZSTD_initCStream*() flow all variants converge towards ZSTD_initCStream_stage2()	2017-04-10 22:24:02 -07:00
Yann Collet	4b987ad8ce	Introduce ZSTD_initCStream_internal() This is now the regroup point for ZSTD_initCStream*() functions ZSTD_initCStream_advanced() now properly checks for parameters validity. Also : added <assert.h> usage inside zstd_compress.c Needs ZSTD_DEBUG=1 macro to be triggered. Will be triggered by default from `tests` directory	2017-04-10 17:50:44 -07:00
Yann Collet	0181fef545	ensure cctx internal buffer is correctly sized in case of memory error	2017-04-06 01:25:26 -07:00
Yann Collet	36c2a03757	updated comments for ZSTD_resetCStream()	2017-04-05 22:06:21 -07:00
Yann Collet	003a244324	DStream : ensure correct size of internal buffers in case of error	2017-04-05 15:28:56 -07:00
Yann Collet	02d37aa1c1	ensure correct size of internal buffers in case of error	2017-04-05 14:53:51 -07:00
Nick Terrell	405d2a1027	Explicitly convert scratchBuffer to unsigned*	2017-04-04 16:35:31 -07:00
Nick Terrell	16a739cab0	Switch call of FSE_count() to FSE_count_wksp()	2017-04-04 16:17:21 -07:00
Yann Collet	7cf78f1be7	Protects ZSTD_compressBegin_usingCDict() vs NULL cdict dereference Will issue an error (GENERIC) is cdict==NULL	2017-04-04 12:38:14 -07:00
Nick Terrell	26b046a7c4	Remove unnecessary dictID store	2017-04-03 21:46:28 -07:00
Nick Terrell	39a6cc5172	Make ZSTD_compress_usingCDict() respect contentSizeFlag	2017-04-03 21:09:55 -07:00
Nick Terrell	62ecad3819	Fix ZSTD_initCStream_usingCDict() to use dictionary	2017-04-03 21:05:59 -07:00
Yann Collet	30c7698970	optimize ZSTDMT_compress() memory usage does no longer allocate temporary buffers when there is enough room in dstBuffer to decompress directly there. (previous method would skip that for 1st chunk only). Also : fix ZSTD_compressBound() for small srcSize	2017-03-31 18:27:03 -07:00
Yann Collet	3f75d52527	Changed ZSTD_compressBound() required so that if Total = A+B compressBound(Total) <= compressBound(A) + compressBound(B) under condition of a minimum size for A and B Will help for ZSTDMT_compress() memory allocation	2017-03-31 17:11:38 -07:00
Yann Collet	7b70a1969e	Merge branch 'dev' into zstdmt	2017-03-31 16:22:33 -07:00
Yann Collet	53203e7c38	Merge pull request #640 from facebook/memAccess Changed memory strategy to __packed for gcc	2017-03-31 15:49:12 -07:00
Yann Collet	eea7858e2b	fixed minor warnings in debug code	2017-03-30 16:47:19 -07:00
Yann Collet	34cc487d05	overlap at full windowSize for max compression level as it provides max compression ratio	2017-03-30 16:23:22 -07:00
Yann Collet	458e955c23	improved ZSTDMT_compress() Use a bit more threads by default. Uses overlap segments to boost compression ratio (like the streaming variant)	2017-03-30 15:51:58 -07:00
Yann Collet	6476c51b86	Merge pull request #637 from facebook/zstdmt Zstdmt	2017-03-30 14:18:37 -07:00
Yann Collet	274f59919d	Changed memory strategy to __packed for gcc Method 1 __packed is always as good or better than memcpy(). But it's not portable, as it depends on compiler extension. For gcc, __pakced directive works fine. Furthermore, gcc has serious performance issues with memcpy() on ARM 32 bits. See #620	2017-03-30 12:52:14 -07:00
Nick Terrell	5152fb2cb2	Convert all tabs to spaces	2017-03-29 18:51:58 -07:00
Yann Collet	ca5a8bbe36	re-added patch ...	2017-03-29 17:15:27 -07:00
Yann Collet	2e2e78de47	removed unnecessary restriction on minmatchLength it's now transparently translated to nearest value when unsupported (7->6) (3->4)	2017-03-29 16:02:47 -07:00
Yann Collet	26769d88bc	Merge branch 'dev' of github.com:facebook/zstd into dev	2017-03-29 15:21:30 -07:00
Yann Collet	933ce4a1dd	fix : minmatch 7 conversion minmatch 7 now converted to minmatch 6 for strategies which do not support 7 Used to folded into "default", which applied minmatch 4	2017-03-29 14:35:38 -07:00
Sean Purcell	4708394bdd	Remove extra 'F' from skippable magic mask	2017-03-29 11:46:57 -07:00
Yann Collet	4cf0093571	restored bonus rule	2017-03-26 14:51:00 -07:00
Yann Collet	69017bf253	Merge branch 'dev' into LegacyDictBuilder	2017-03-26 14:39:13 -07:00
Yann Collet	582760818f	minor refactor add const changed if for easier to add new conditions	2017-03-26 03:04:56 -07:00
Yann Collet	858f72eeb8	fixed dictBuilder issue dictionary loading would fail during entropy analysis	2017-03-26 02:50:00 -07:00
Yann Collet	ecee9f2ef8	fixed conversion warnings	2017-03-26 00:59:14 -07:00
Yann Collet	0246d5c531	Merge pull request #630 from facebook/advancedCliCommands changed advanced commands --maxdict= and --dictID=	2017-03-26 00:13:35 -07:00
Yann Collet	4c41d37fcc	changed test for new syntax --dictID= and --maxdict=	2017-03-24 18:36:56 -07:00
Yann Collet	d41f707e88	minor improvement : remove duplicates with 1 char prefix difference	2017-03-24 17:56:45 -07:00
Yann Collet	b364caf455	Merge pull request #628 from facebook/dictBuilder_limits Ensure all limits derived from same constants	2017-03-24 17:54:42 -07:00
Yann Collet	2238870eb6	Merge pull request #625 from facebook/loadCDict limited CDict acceptation criteria to be the same as DDict	2017-03-24 16:06:20 -07:00
Yann Collet	96aa3019b2	changed advanced commands --maxdict= and --dictID= now works with the `=` variant, which is the recommended one. Old variant `--dictID #` still works, for compatibility with existing scripts. Long term objective is to remove the old variant..	2017-03-24 16:04:29 -07:00
Yann Collet	9da3b215ec	Ensure all limits derived from same constants Now uses ZDICT_DICTSIZE_MIN and ZDICT_CONTENTSIZE_MIN from zdict.h. Also : reduced values to 256 and 128 respectively	2017-03-24 15:02:09 -07:00
Yann Collet	ebe9963cf6	Merge pull request #626 from facebook/stricterDictBuilder dictBuilder fails to create dictionary on certain input	2017-03-24 14:27:28 -07:00
Yann Collet	16a0b10781	fixed ZSTD_loadZstdDictionary() forgot to add the dictionary content (tests were not failing, just compressing less). Also : added size protections when adding dict content since hc/bt table filling would fail if size < 8	2017-03-24 12:46:46 -07:00
Yann Collet	23776ce290	fixed ERROR_GENERIC on dstSize_tooSmall required by users which depends on this error code to size dest buffer	2017-03-23 17:59:50 -07:00
Yann Collet	f332ece468	dictBuilder fails to create dictionary on certain input Properly expressed with an error code (see zstd_errors.h) and a cli return code != 0	2017-03-23 16:24:02 -07:00
Yann Collet	bea78e8fc2	limited CDict acceptation criteria to be the same as DDict	2017-03-23 15:46:06 -07:00
Sean Purcell	042ba122ae	Change g_displayLevel to int and fix DISPLAYUPDATE flush	2017-03-23 11:21:59 -07:00
Nick Terrell	eaf69b07f0	Zero pointers after freeing	2017-03-21 13:20:59 -07:00
Yann Collet	f3dfcdccd1	bump version number	2017-03-21 12:18:28 -07:00
Przemyslaw Skibinski	8086d623ca	updated build of Windows packages	2017-03-18 11:19:09 +01:00
Yann Collet	7e35b352c6	Merge pull request #602 from iburinoc/doc Add functions missing from manual, and fix parameter alignment	2017-03-14 14:08:41 -07:00
Sean Purcell	dec2b96536	Add functions missing from manual, and fix parameter alignment	2017-03-14 11:24:09 -07:00
Sean Purcell	9830aeeea6	Fix legacy support=0 case and accidental double include of version headers	2017-03-13 17:19:37 -07:00
Sean Purcell	120df494e9	Update builds to not support legacy v01-v03	2017-03-13 14:44:08 -07:00
Sean Purcell	334cb34edb	ZSTD_LEGACY_SUPPORT defines lowest supported version	2017-03-13 14:32:30 -07:00
Sean Purcell	784082f49c	Change gotoDict type to uPtrDiff	2017-03-10 10:34:45 -08:00
Sean Purcell	8fe5c6862c	Fix undefined behaviour in decompressor	2017-03-10 10:17:42 -08:00
Nick Terrell	f35ef5c8e9	Whitespace only: tabs to spaces	2017-03-09 12:51:33 -08:00
Nick Terrell	eeb31eed39	s/ZSTD_btopt2/ZSTD_btultra/g	2017-03-09 11:44:25 -08:00
Nick Terrell	e65aab8e0f	Remove 'mem.h' dependency from ZSTD_WINDOWLOG_MAX	2017-03-08 15:40:13 -08:00
Yann Collet	a41a4ed39a	Merge pull request #594 from terrelln/bugs Small fixes	2017-03-08 14:56:07 -08:00
Nick Terrell	81512e9ebe	Avoid '#define inline /* ... */' Take definition of `FORCE_INLINE` from `zstd_internal.h`.	2017-03-08 14:00:21 -08:00
Nick Terrell	e06c303475	Fix ZSTD_sizeof_CStream()	2017-03-08 13:45:10 -08:00
Sean Purcell	881abe44f1	Reduce point at which we reduce offsets to protect against UB	2017-03-07 16:58:08 -08:00
Sean Purcell	3437bf2feb	Add build targets to the Makefile, and update CircleCI tests	2017-03-06 15:05:02 -08:00
Yann Collet	8b1d004031	added -Wformat-security flag, as recommended by @pixelb	2017-03-05 21:17:32 -08:00
Yann Collet	1f2c95c5f3	minor code refactor in HUF module	2017-03-05 21:07:20 -08:00
Yann Collet	5d801278dc	Merge pull request #586 from terrelln/repeat-heuristic Always check Huffman tables for ZSTD_lazy+	2017-03-03 19:38:56 -08:00
Nick Terrell	54c4babd8f	Always check Huffman tables for ZSTD_lazy+ The compressor always reuses the existing Huffman table if the literals size is at most 1 KiB. If the compression strategy is `ZSTD_lazy` or stronger always check to see if reusing the previous table or creating a new table is better. This doesn't yet weigh in decompression speed. I don't want to add any heuristics there until I have real data to work with to ensure that the heuristic works for at least one use case, preferably more.	2017-03-03 16:49:38 -08:00
Yann Collet	1af570bd05	Merge pull request #585 from terrelln/cover-leak Fix COVER_optimizeTrainFromBuffer() resource leaks	2017-03-02 20:46:35 -08:00
Yann Collet	f44b55c18d	Merge pull request #584 from terrelln/huff-repeat Allow compressor to repeat Huffman tables	2017-03-02 17:20:11 -08:00
Yann Collet	fe5d27062e	disable prefetch-decode for 32-bits target This decoder variant is detrimental to x86 architecture likely due to register pressure. Note that the variant is disabled for all 32-bits targets. It's unclear if it would help for different architectures, such as ARM, MIPS or PowerPC.	2017-03-02 17:09:21 -08:00
Nick Terrell	d051cd5b43	Use workspace for count and CTable	2017-03-02 16:38:07 -08:00
Nick Terrell	976e325b2e	Fix COVER_optimizeTrainFromBuffer() resource leaks Thanks to @nemequ for reporting the resource leaks.	2017-03-02 15:54:39 -08:00
Sean Purcell	553f67e0c1	Remove 'generic' inline strategy Seems to avoid performance loss for compression. Same strategy tested on decompression side, did not appear to improve speed.	2017-03-02 15:18:13 -08:00
Sean Purcell	3d95925a59	Merge remote-tracking branch 'origin/dev' into m32	2017-03-02 15:17:56 -08:00
Nick Terrell	a419777eb1	Allow compressor to repeat Huffman tables * Compressor saves most recently used Huffman table and reuses it if it produces better results. * I attempted to preserve CPU usage profile. I intentionally left all of the existing heuristics in place. There is only a speed difference on the second block and later. When compressing large enough blocks (say >= 4 KiB) there is no significant difference in compression speed. Dictionary compression of one block is the same speed for blocks with literals <= 1 KiB, and after that the difference is not very significant. * In the synthetic data, with blocks 10 KB or smaller, most blocks can't use repeated tables because the previous block did not contain a symbol that the current block contains. Once blocks are about 12 KB or more, most previous blocks have valid Huffman tables for the current block, and the compression ratio and decompression speed jumped. * In silesia blocks as small as 4KB can frequently reuse the previous Huffman table (85%), but it isn't as profitable, and the previous Huffman table only gets used about 3% of the time. * Microbenchmarks show that `HUF_validateCTable()` takes ~55 ns and `HUF_estimateCompressedSize()` takes ~35 ns. They are decently well optimized, the first versions took 90 ns and 120 ns respectively. `HUF_validateCTable()` could be twice as fast, if we cast the `HUF_CElt` to a `U32` and compare to 0. However, `U32` has an alignment of 4 instead of 2, so I think that might be undefined behavior. * I've ran `zstreamtest` compiled normally, with UASAN and with MSAN for 4 hours each. The worst case for the speed difference is a bunch of small blocks in the same frame. I modified `bench.c` to compress the input in a single frame but with blocks of the given block size, set by `-B`. Benchmarks on level 1: \| Program \| Block size \| Corpus \| Ratio \| Compression MB/s \| Decompression MB/s \| \|-----------\|------------\|-----------\|-------\|------------------\|--------------------\| \| zstd.base \| 256 \| synthetic \| 2.364 \| 110.0 \| 297.0 \| \| zstd \| 256 \| synthetic \| 2.367 \| 108.9 \| 297.0 \| \| zstd.base \| 256 \| silesia \| 2.204 \| 93.8 \| 415.7 \| \| zstd \| 256 \| silesia \| 2.204 \| 93.4 \| 415.7 \| \| zstd.base \| 512 \| synthetic \| 2.594 \| 144.2 \| 420.0 \| \| zstd \| 512 \| synthetic \| 2.599 \| 141.5 \| 425.7 \| \| zstd.base \| 512 \| silesia \| 2.358 \| 118.4 \| 432.6 \| \| zstd \| 512 \| silesia \| 2.358 \| 119.8 \| 432.6 \| \| zstd.base \| 1024 \| synthetic \| 2.790 \| 192.3 \| 594.1 \| \| zstd \| 1024 \| synthetic \| 2.794 \| 192.3 \| 600.0 \| \| zstd.base \| 1024 \| silesia \| 2.524 \| 148.2 \| 464.2 \| \| zstd \| 1024 \| silesia \| 2.525 \| 148.2 \| 467.6 \| \| zstd.base \| 4096 \| synthetic \| 3.023 \| 300.0 \| 1000.0 \| \| zstd \| 4096 \| synthetic \| 3.024 \| 300.0 \| 1010.1 \| \| zstd.base \| 4096 \| silesia \| 2.779 \| 223.1 \| 623.5 \| \| zstd \| 4096 \| silesia \| 2.779 \| 223.1 \| 636.0 \| \| zstd.base \| 16384 \| synthetic \| 3.131 \| 350.0 \| 1150.1 \| \| zstd \| 16384 \| synthetic \| 3.152 \| 350.0 \| 1630.3 \| \| zstd.base \| 16384 \| silesia \| 2.871 \| 296.5 \| 883.3 \| \| zstd \| 16384 \| silesia \| 2.872 \| 294.4 \| 898.3 \|	2017-03-02 13:27:52 -08:00
Yann Collet	fdb0fd34b3	Merge pull request #583 from terrelln/set-dictid Set dictID to 0 for content only dictionaries	2017-03-02 13:15:31 -08:00
Nick Terrell	3475b9b431	Set dictID to 0 for content only dictionaries	2017-03-02 12:33:02 -08:00
Sean Purcell	d44703d145	Offsets >= 32MB in 32-bits mode	2017-03-01 16:27:56 -08:00
Yann Collet	76f0494089	xxhash can be included twice in any order Previously, followed by : would fail to include the static definitions, because the second include was simply skipped by guard macro. Now it works as intended : the missing static part is included during the second include.	2017-03-01 13:29:29 -08:00
Yann Collet	4bcc69b761	solves warnings when compiling with global XXH_STATIC_LINKING_ONLY XXH_STATIC_LINKING_ONLY protection macro is intended to be triggered just before the include. The main idea is to keep this setting local : user module shall explicitly understand and accept the static linking restriction which becomes transparent when triggering the macro at project level. Global definition also triggers redefinition warnings for user modules which do locally define the macro. This new version compiles lib and cli without warning when the macro is set globally. That's not a scenario to be recommended, since it trades a local effect for a global one, but it was easy enough to provide from zstd side.	2017-03-01 11:33:25 -08:00
Yann Collet	31432cc57d	Merge pull request #579 from iburinoc/multiframe Check to ensure ddict isn't null before dereference	2017-03-01 11:02:04 -08:00
Sean Purcell	a81d4fee58	Check to ensure ddict isn't null before dereference	2017-02-28 15:28:29 -08:00
Yann Collet	22d79762ef	fixed multi frames	2017-02-28 02:12:42 -08:00
Yann Collet	a33ae64204	fixed decoding skippable frames	2017-02-28 01:15:28 -08:00
Yann Collet	d1760113ec	Improved speed of ZSTD_decompressStream() When ZSTD_decompressStream() detects that there is enough space in dst to complete decompression in a single pass, delegates to ZSTD_decompress(), for an extra ~5% speed boost	2017-02-28 00:14:28 -08:00
Yann Collet	a81c2e7e44	Merge pull request #573 from facebook/ddict Improved DDict memory usage	2017-02-27 20:54:42 -08:00
Yann Collet	dccd6b6f65	cli : fix : --rm is silent when input is stdin previously, app would produce an error message, and stop.	2017-02-27 15:57:50 -08:00
Yann Collet	0b9b894b2d	reduced ZSTD_DDict memory usage saved 128 KB	2017-02-27 00:27:30 -08:00
Yann Collet	bd7fa21deb	added ZSTD_refDDict() Now DDict does no longer depends on DCtx duplication	2017-02-26 14:43:07 -08:00
Yann Collet	d73eebc00f	loadEntropy works on new ZSTD_entropy_t type	2017-02-26 10:16:42 -08:00
Yann Collet	8629f0e41f	created entropy structure type	2017-02-25 18:33:31 -08:00
Yann Collet	8dff956dbf	Added DDict unit test in fuzzer also : slightly modified loadEntropy : know src must points at start of dictionary	2017-02-25 10:11:15 -08:00
Yann Collet	14312d833e	zstdmt : fix : loading prefix from previous segments There used to be a (very small) chance that loading prefix from previous segment would be confused with a real zstd dictionary. For that to happen, the prefix needs to start with the same value as dictionary magic. That's 1 chance in 4 billions if all values have equal probability. But in fact, since some values are more common (0x00000000 for example) others are less common, and dictionary magic was selected to be one of them, so probabilities are likely even lower. Anyway, this risk is no down to zero by adding a new CCtx parameter : ZSTD_p_forceRawDict Current parameter policy : the parameter "stick" to its CCtx, so any dictionary loading after ZSTD_p_forceRawDict is set will be loaded in "raw" ("content only") mode, even if CCtx is re-used multiple times with multiple different dictionary. It's up to the user to reset this value differently if it needs so.	2017-02-23 23:42:12 -08:00
Yann Collet	831b4890ce	minor tests/Makefile refactoring and update of zstd_manual,html	2017-02-23 23:09:10 -08:00
Yann Collet	cce8d8ba2b	Merge pull request #560 from iburinoc/findcompressedsize Change name to to findFrameCompressedSize and add skippable support	2017-02-23 13:39:23 -08:00
Sean Purcell	83038d236a	Fix bug in FSE distribution normalization	2017-02-22 13:52:48 -08:00
Sean Purcell	64417cd2ff	Describe ambiguity around skippable frames	2017-02-22 13:29:01 -08:00
Sean Purcell	9757cc811b	Update comment	2017-02-22 12:28:21 -08:00
Sean Purcell	9050e1925e	Change name to to findFrameCompressedSize and add skippable support	2017-02-22 12:12:34 -08:00
Przemyslaw Skibinski	d8114e5802	zstd_compress.c: fix memory leaks	2017-02-21 18:59:56 +01:00
Anders Oleson	517577bf53	spelling fixes in comments i.e. occurred labeled Huffman	2017-02-20 12:08:59 -08:00
Sean Purcell	6b010dec80	execSequence copies up to 2*WILDCOPY_OVERLENGTH extra	2017-02-16 12:05:40 -08:00
Sean Purcell	887eaa9e21	Fix wildcopy overwriting data still in window	2017-02-15 16:43:45 -08:00
Yann Collet	2252d29a5a	Merge branch 'dev' of github.com:facebook/zstd into dev	2017-02-15 12:00:50 -08:00
Yann Collet	4596037042	updated fse version feature minor refactoring (removing FSE_abs()) also : fix a few minor issues recently introduced in examples	2017-02-15 12:00:03 -08:00
Yann Collet	44f82d781f	Merge pull request #545 from terrelln/force-window [zstdmt] Fix MSAN failure with ZSTD_p_forceWindow	2017-02-15 10:20:15 -08:00
Yann Collet	f0b9a8dddb	Merge pull request #547 from inikep/dev11 Avoid fseek()'s 2GiB barrier with MacOS and *BSD	2017-02-14 12:29:00 -08:00
Yann Collet	9696bfc2ad	Merge pull request #544 from ds77/avoid-empty Portable way to avoid empty unit warning in threading.c	2017-02-14 00:54:55 -08:00
Przemyslaw Skibinski	b876b96ce1	Merge remote-tracking branch 'refs/remotes/facebook/dev' into dev11	2017-02-14 09:26:03 +01:00
Nick Terrell	ecf90ca24b	[zstdmt] Fix MSAN failure with ZSTD_p_forceWindow Reproduction steps: ``` make zstreamtest CC=clang CFLAGS="-O3 -g -fsanitize=memory -fsanitize-memory-track-origins" ./zstreamtest -vv -t4178 -i4178 -s4531 ``` How to get to the error in gdb (may be a more efficient way): * 2 breaks at zstd_compress.c:2418 -- in ZSTD_compressContinue_internal() * 2 breaks at zstd_compress.c:2276 -- in ZSTD_compressBlock_internal() * 1 break at zstd_compress.c:1547 Why the error occurred: When `zc->forceWindow == 1`, after calling `ZSTD_loadDictionaryContent()` we have `zc->loadedDictEnd == zc->nextToUpdate == 0`. But, we've really loaded up to `iend` into the dictionary. Then in `ZSTD_compressBlock_internal()` we see that `current > zc->nextToUpdate + 384`, so we load the last 192 bytes a second time. In this case the bytes we are loading are a block of all 0s, starting in the previous block. So when we are loading the last 192 bytes, we find a `match` in the future, 183 bytes beyond `ip`. Since the block is all 0s, the match extends to the end of the block. But in `ZSTD_count()` we only check that `pIn < pInLoopLimit`, but since `pMatch > pIn`, `pMatch` eventually points past the end of the buffer, causing the MSAN failure. The fix: The line changed sets sets `zc->nextToUpdate` to the end of the dictionary. This is the behavior that existed before `ZSTD_p_forceWindow` was introduced. This fixes the exposing test case. Since the code doesn't fail without `zc->forceWindow`, it makes sense that this works. I've run the command `./zstreamtest -T2mn` 64 times without failures. CI should also verify nothing obvious broke.	2017-02-13 19:11:22 -08:00
Yann Collet	58af614ef2	push version and NEWS to v1.1.4	2017-02-13 18:32:44 -08:00
ds77	08e6a88a97	avoid empty translation unit warning without #pragma	2017-02-14 00:46:47 +01:00
Przemyslaw Skibinski	09c8e5390d	__builtin_bswap requires gcc 4.3+	2017-02-13 12:45:53 +01:00
Sean Purcell	d7bfcac18a	Expose frameSrcSize to experimental API	2017-02-10 11:55:44 -08:00
Sean Purcell	5069b6c2c3	Merge branch 'dev' into multiframe	2017-02-10 10:08:55 -08:00
Yann Collet	bbba42acd1	Merge pull request #537 from terrelln/small-bugs Fix small bugs	2017-02-10 04:35:43 -08:00
Yann Collet	a28c34cb7a	Merge pull request #538 from iburinoc/errorstring Fix ZSTD_getErrorString and add tests	2017-02-10 03:59:56 -08:00
Sean Purcell	269b2cd3d8	Documentation updates	2017-02-09 13:25:30 -08:00
Sean Purcell	2db7249265	Make pledgedSrcSize meaning clear for other functions - Added tests - Moved new size functions to static link only	2017-02-09 11:49:58 -08:00
Nick Terrell	545987996a	Fix deprecation warnings for clang with C++14	2017-02-08 17:38:17 -08:00
Sean Purcell	e0b3265e87	Fix ZSTD_getErrorString and add tests	2017-02-08 17:28:49 -08:00
Sean Purcell	0f5c95af44	Disambiguate pledgedSrcSize == 0 - Modify ZSTD CLI to only set contentSizeFlag if it _knows_ the size - Change pzstd to stop setting contentSizeFlag without accurate pledgedSrcSize	2017-02-08 15:12:46 -08:00
Sean Purcell	ba2ad9f25c	ZSTD_decompress now handles multiple frames	2017-02-08 14:50:10 -08:00
Sean Purcell	4e709712e1	Decompressed size functions now handle multiframes and distinguish cases - Add ZSTD_findDecompressedSize - Traverses multiple frames to find total output size - Add ZSTD_getFrameContentSize - Gets the decompressed size of a single frame by reading header - Deprecate ZSTD_getDecompressedSize	2017-02-08 14:50:10 -08:00
Przemyslaw Skibinski	cdf5a7bd9f	Merge remote-tracking branch 'refs/remotes/facebook/dev' into dev11	2017-02-08 13:49:35 +01:00
Nick Terrell	71c5263c00	Attribute cover dictionary code	2017-02-07 11:35:07 -08:00
Przemyslaw Skibinski	7060aee8c2	platform.h added to build_package.bat	2017-02-06 19:43:13 +01:00
Yann Collet	b54e235bf3	fixed Mac OS-X specific directory in $(RM) list these directories are now removed with -r command	2017-02-05 10:22:58 -08:00
Yann Collet	c2a4632789	release builds use less debug symbols and warnings release build are triggered through either `make`, or their specific target `make zstd-release` and `make lib-release`.	2017-02-02 20:54:41 -08:00
Yann Collet	48bed91606	Merge pull request #527 from facebook/zstdmt zstdmt refinements	2017-01-31 16:36:46 -08:00
Yann Collet	b2e1b3d670	fixed overlapLog==0 => no overlap	2017-01-30 14:54:46 -08:00
Yann Collet	3672d06d06	zstdmt : section size is set to be a minimum of overlapSize the minimum size condition size is applied transparently (no warning, no error) like previous minimum section size condition (1 KB) which still applies.	2017-01-30 13:35:45 -08:00
Yann Collet	88df1aed61	changed advanced parameter overlapLog Follows a positive logic (increasing value => increasing overlap) which is easier to use	2017-01-30 11:00:00 -08:00
Yann Collet	b5fd15ccb2	fixed : legacy decoders v04 and v05	2017-01-30 10:45:58 -08:00
Yann Collet	cc3d1bc262	Merge pull request #525 from terrelln/covermt Multithreaded COVER dictionary training	2017-01-30 10:15:33 -08:00
Nick Terrell	43474313f8	Fix documentation about memory usage	2017-01-27 18:43:05 -08:00
Nick Terrell	b42dd27ef5	Add include guards and extern C	2017-01-27 16:00:19 -08:00
Yann Collet	f6d4a786fc	reduced zstdmt latency when using small custom section sizes with high compression levels Previous version was requiring a fairly large initial amount of input data before starting to create compression jobs. This new version starts the process much sooner.	2017-01-27 15:55:30 -08:00
Nick Terrell	c43c27127f	Merge branch 'dev' into buck * dev: updated NEWS fixed MSAN warnings in legacy decoders Fix cmake build updated NEWS Edits as per comments, and change wildcard 'X' to '?' Fix Visual Studios project Fix pool.c threading.h import Fix zstdmt_compress.h include Fixed commented issues Updated format specification to be easier to understand improved #232 fix Fixed https://github.com/facebook/zstd/issues/232 .travis.yml: different tests for "master" branch .travis.yml: optimized order of short tests .travis.yml: test jobs 12-15 JOB_NUMBER -eq 9 improved ZSTD_compressBlock_opt_extDict_generic	2017-01-27 12:05:48 -08:00
Nick Terrell	2fe9126591	Add multithread support to COVER	2017-01-27 11:56:02 -08:00
Yann Collet	609c123a01	Merge pull request #522 from terrelln/benchmt Fix some includes	2017-01-27 11:40:25 -08:00
Yann Collet	cafdd31a38	fixed MSAN warnings in legacy decoders In some extraordinary circumstances, *Length field can be generated from reading a partially uninitialized memory segment. Data is correctly identified as corrupted later on, but the read taints some later pointer arithmetic operation.	2017-01-27 10:44:03 -08:00
Nick Terrell	9c018cc140	Add BUCK files for Nuclide support	2017-01-27 10:43:12 -08:00
Przemyslaw Skibinski	29157320fb	improved ZSTD_compressBlock_opt_extDict_generic	2017-01-27 10:43:02 -08:00
Nick Terrell	e628eaf87a	Fix pool.c threading.h import	2017-01-26 15:29:10 -08:00
Yann Collet	717c65d690	Merge pull request #519 from inikep/dev11 Dev11	2017-01-26 14:23:44 -08:00
Yann Collet	ef33d00532	fixed : ZSTD_setCCtxParameter() properly exposed in DLL	2017-01-26 12:24:21 -08:00
Yann Collet	4a62f79ec9	fixed clang documentation warning	2017-01-26 09:16:56 -08:00
Yann Collet	8dafb1acf5	CLI : automatically set overlap size to max (windowSize) for max compression level	2017-01-25 17:01:13 -08:00
Yann Collet	06e7697f96	added test of new parameter ZSTD_p_forceWindow	2017-01-25 16:39:03 -08:00
Yann Collet	bb0027405a	fixed zstdmt corruption issue when enabling overlapped sections see Asana board for detailed explanation on why and how to fix it	2017-01-25 16:25:38 -08:00
Yann Collet	943cff9c37	fixed zstdmt cli freeze issue with large nb of threads fileio.c was continually pushing more content without giving a chance to flush compressed one. It would block the job queue when input data was accumulated too fast (requiring to define many threads). Fixed : fileio flushes whatever it can after each input attempt.	2017-01-25 12:35:19 -08:00
Yann Collet	dc8dae596a	overlapped section, for improved compression Sections 2+ read a bit of data from previous section in order to improve compression ratio. This also costs some CPU, to reference read data. Read data is currently fixed to window>>3 size	2017-01-24 22:32:12 -08:00
Yann Collet	f14a669054	refactor job creation code shared accross ZSTDMT_{compress,flush,end}Stream(), for easier maintenance	2017-01-24 17:41:49 -08:00
Yann Collet	512cbe8c10	zstdmt cli and API allow selection of section sizes By default, section sizes are 4x window size. This new setting allow manual selection of section sizes. The larger they are, the (slightly) better the compression ratio, but also the higher the memory allocation cost, and eventually the lesser the nb of possible threads, since each section is compressed by a single thread. It also introduces a prototype to set generic parameters, ZSTDMT_setMTCtxParameter() The idea is that it's possible to add enums to extend the list of parameters that can be set this way. This is more long-term oriented than a fixed-size struct. Consider it as a test.	2017-01-24 17:08:53 -08:00
Yann Collet	3488a4a473	ZSTDMT now supports frame checksum	2017-01-24 11:48:40 -08:00
Przemyslaw Skibinski	96f152f708	improved ZSTD_compressBlock_opt_extDict_generic	2017-01-24 13:18:50 +01:00
Yann Collet	94364bf87a	refactor ZSTDMT streaming flush code now shared by both ZSTDMT_compressStream() and ZSTDMT_flushStream()	2017-01-23 11:50:44 -08:00
Yann Collet	1cbf251e43	ZSTDMT streaming : fall back to (regular) single thread mode when nbThreads==1	2017-01-23 01:43:58 -08:00
Yann Collet	84581ff8d7	ZSTDMT_compressCCtx : fallback to single-thread mode when nbChunks==1	2017-01-23 01:20:27 -08:00
Yann Collet	1a2547f654	ZSTDMT_compressStream() becomes blocking when required to ensure forward progresses In some (rare) cases, job list could be blocked by a first job still being processed, while all following ones are completed, waiting to be flushed. In such case, the current job-table implementation is unable to accept new job. As a consequence, a call to ZSTDMT_compressStream() can be useless (nothing read, nothing flushed), with the risk to trigger a busy-wait on the caller side (needlessly loop over ZSTDMT_compressStream() ). In such a case, ZSTDMT_compressStream() will block until the first job is completed and ready to flush. It ensures some forward progress by guaranteeing it will flush at least a part of the completed job. Energy-wasting busy-wait is avoided.	2017-01-22 23:49:52 -08:00
Yann Collet	c593348722	ZSTDMT_initCStream_usingDict() can outlive dict Like ZSTD_initCStream_usingDict(), ZSTDMT_initCStream_usingDict() now keep a copy of dict internally. This way, dict can be released : it does not longer have to outlive all future compression sessions.	2017-01-22 16:44:15 -08:00
Yann Collet	9d6f7637ec	protected (mutex) read to jobCompleted, as suggested by @terrelln	2017-01-21 22:14:08 -08:00
Yann Collet	0cf74fa957	optimized pool allocation by 1 slot	2017-01-21 22:06:49 -08:00
Yann Collet	6ed29a8f44	minor : tab to spaces	2017-01-21 21:56:36 -08:00
Yann Collet	317604e0ad	fixed : compilation of zstreamtest in dll mode	2017-01-20 17:18:41 -08:00
Yann Collet	d7e3cb58c5	Resolved merge conflict dev+zstdmt	2017-01-20 16:44:50 -08:00
cyan4973	2e3b659ae1	fixed minor warnings (Visual, conversion, doxygen)	2017-01-20 14:43:09 -08:00
cyan4973	5fba09fa41	updated util's time for Windows compatibility Correctly measures time on Posix systems when running with Multi-threading Todo : check Windows measurement under multi-threading	2017-01-20 12:57:31 -08:00
Yann Collet	b459aad5b4	renamed savedRep into repToConfirm	2017-01-19 17:33:37 -08:00
Yann Collet	500014af49	zstd cli can now compress using multi-threading added : command -T# added : ZSTD_resetCStream() (zstdmt_compress) added : FIO_setNbThreads() (fileio)	2017-01-19 17:04:28 -08:00
Yann Collet	19d670ba9d	Added ZSTDMT_initCStream_advanced() variant Correctly compress with custom params and dictionary Added relevant fuzzer test in zstreamtest Also : new macro ZSTDMT_SECTION_LOGSIZE_MIN, which sets a minimum size for a full job (note : a flush() command can still generate a partial job anytime)	2017-01-19 15:32:07 -08:00
Yann Collet	0f984d94c4	changed MT enabling macro to ZSTD_MULTITHREAD	2017-01-19 14:05:07 -08:00
Yann Collet	736788f8e8	added streaming fuzzer tests for MT API Also : fixed corner case, where nb of jobs completed becomes > jobQueueSize which is possible when many flushes are issued while there is not enough dst buffer to flush completed ones.	2017-01-19 12:15:29 -08:00
Yann Collet	32dfae6f98	fixed Multi-threaded compression MT compression generates a single frame. Multi-threading operates by breaking the frames into independent sections. But from a decoder perspective, there is no difference : it's just a suite of blocks. Problem is, decoder preserves repCodes from previous block to start decoding next block. This is also valid between sections, since they are no different than changing block. Previous version would incorrectly initialize repcodes to their default value at the beginning of each section. When using them, there was a mismatch between encoder (default values) and decoder (values from previous block). This change ensures that repcodes won't be used at the beginning of a new section. It works by setting them to 0. This only works with regular (single segment) variants : extDict variants will fail ! Fortunately, sections beyond the 1st one belong to this category. To be checked : btopt strategy. This change was only validated from fast to btlazy2 strategies.	2017-01-19 10:32:55 -08:00
Yann Collet	37226c1e9f	Simplified compressChunk job minor refactoring : compression done in a single call on first chunk Avoid a mutable hSize variable and eventual recombination to cSize at the end	2017-01-19 10:18:17 -08:00
Yann Collet	dab5ea93f2	Merge pull request #515 from iburinoc/emptydict Don't create dict in streaming apis if dictSize == 0	2017-01-19 09:02:42 -08:00
Yann Collet	6073b3e6b8	ZSTDMT_endStream : nullify input buffer after flush There will be no more input after ZSTDMT_endStream invocation : only flush/end is allowed (to fully collect compressed result).	2017-01-18 15:32:38 -08:00
Yann Collet	3a01c46b26	ZSTDMT_initCStream() supports restart from invalid state ZSTDMT_initCStream() will correcly scrub for resources when it detects that previous compression was not properly finished.	2017-01-18 15:18:17 -08:00
Yann Collet	4885f591b3	trap compression errors, collect back resources from workers	2017-01-18 14:11:37 -08:00
Sean Purcell	0b5370ae38	Prefix notes with /**<	2017-01-18 13:45:02 -08:00
Yann Collet	563ef8acf4	CCtxPool starts empty, as suggested by @terrelln Also : make zstdmt now a target from root	2017-01-18 12:12:10 -08:00
Yann Collet	a6db7a7b9b	fixed cmaketest (buffer_t){NULL,0} is not considered a constant. {NULL,0} is.	2017-01-18 11:57:34 -08:00
Yann Collet	0d6b8f65a9	ZSTDMT_free() scrubs potentially unfinished jobs to release their resources In some complex scenarios (free() without finishing compression), it is possible that some resources are still into jobs and not collected back into pools. In which case, previous version of free() would miss them. This would be equivalent to a leak. New version ensures that it even foes after such resource. It requires job consumers to properly mark resources as released, by replacing entries by NULL after releasing back to the pool. Obviously, it's not recommended to free() zstdmt context mid-term, still that's now a supported scenario. The same methodology is also used to ensure proper resource collection after an error is detected. Still to do : - detect compression errors (not just allocation ones) - properly manage resource when init() is called without finishing previous compression.	2017-01-17 17:46:33 -08:00
Yann Collet	d0a1d45582	ZSTDMT_{flush,end}Stream() now block on next job completion when nothing to flush The main issue was to avoid a caller to continually loop on {flush,end}Stream() when there was nothing ready to be flushed but still some compression work ongoing in a worker thread. The continuous loop would have resulted in wasted energy. The new version makes call to {flush,end}Stream blocking when there is nothing ready to be flushed. Of course, if all worker threads have exhausted job, it will return zero (all flush completed). Note : There are still some remaining issues to report error codes and properly collect back resources into pools when an error is triggered.	2017-01-17 16:15:18 -08:00
Yann Collet	a73c412932	completed ZSTDMT streaming compression Provides the baseline compression API : size_t ZSTDMT_initCStream(ZSTDMT_CCtx* zcs, int compressionLevel); size_t ZSTDMT_compressStream(ZSTDMT_CCtx* zcs, ZSTD_outBuffer* output, ZSTD_inBuffer* input); size_t ZSTDMT_flushStream(ZSTDMT_CCtx* zcs, ZSTD_outBuffer* output); size_t ZSTDMT_endStream(ZSTDMT_CCtx* zcs, ZSTD_outBuffer* output); Not tested yet	2017-01-17 15:31:16 -08:00
Sean Purcell	57d423c5df	Don't create dict in streaming apis if dictSize == 0	2017-01-17 14:31:35 -08:00
Przemyslaw Skibinski	8a0bc30a2d	Merge remote-tracking branch 'refs/remotes/facebook/dev' into dev11	2017-01-17 13:02:29 +01:00
Przemyslaw Skibinski	d72f4b6b7a	added "Makefile is validated"	2017-01-17 12:40:06 +01:00
Gregory Szorc	7d6f478d15	Set dictionary ID in ZSTD_initCStream_usingCDict() When porting python-zstandard to use ZSTD_initCStream_usingCDict() so compression dictionaries could be reused, an automated test failed due to compressed content changing. I tracked this down to ZSTD_initCStream_usingCDict() not setting the dictID field of the ZSTD_CCtx attached to the ZSTD_CStream instance. I'm not 100% convinced this is the correct or full solution, as I'm still seeing one automated test failing with this change.	2017-01-14 17:44:54 -08:00
Yann Collet	5b726dbe4d	fix gcc-arm warning "suggest braces around empty body"	2017-01-12 17:46:46 +01:00
Yann Collet	ad9f6bd123	zstdmt : fix : resources properly collected even when early fail In previous version, main function would return early when detecting a job error. Late threads resources were therefore not collected back into pools. New version just register the error, but continue the collecting process. All buffers and context should be released back to pool before leaving main function.	2017-01-12 03:06:35 +01:00
Sean Purcell	834ab50fa3	Fixed decompress_usingDict not propagating corrupted dictionary error	2017-01-11 17:31:34 -08:00
Yann Collet	b05c4828ea	zstdmt : correctly check for cctx and buffer allocation Result from getBuffer and getCCtx could be NULL when allocation fails. Now correctly checks : job creation stop and last job reports an allocation error. releaseBuffer and releaseCCtx are now also compatible with NULL input. Identified a new potential issue : when early job fails, later jobs are not collected for resource retrieval.	2017-01-12 02:01:28 +01:00
Yann Collet	107bcbbbc2	zstdmt : changed internal naming from frame to chunk Since the result of mt compression is a single frame, changed naming, which implied the concatenation of multiple frames. minor : ensures that content size is written in header	2017-01-12 01:25:46 +01:00
Yann Collet	5eb749e734	ZSTDMT_compress() creates a single frame The new strategy involves cutting frame at block level. The result is a single frame, preserving ZSTD_getDecompressedSize() As a consequence, bench can now make a full round-trip, since the result is compatible with ZSTD_decompress(). This strategy will not make it possible to decode the frame with multiple threads since the exact cut between independent blocks is not known. MT decoding needs further discussions.	2017-01-11 18:21:25 +01:00
Yann Collet	04cbc36499	minor refactor (release CCtx 1st) and comment clarification	2017-01-11 16:08:08 +01:00
Yann Collet	085179bb78	fixed ZSTDMT_createCCtx() : checked inner objects are properly created	2017-01-11 15:58:05 +01:00
Yann Collet	8ce1cc2bec	improved ZSTD_createCCtxPool() cancellation use ZSTD_freeCCtxPool() to release the partially created pool. avoids to duplicate logic. Also : identified a new difficult corner case : when freeing the Pool, all CCtx should be previously released back to the pool. Otherwise, it means some CCtx are still in use. There is currently no clear policy on what to do in such a case. Note : it's supposed to never happen. Since pool creation/usage is static, it has no external user, which limits risks.	2017-01-11 15:44:26 +01:00
Yann Collet	47557ba2b2	fixed ZSTDMT_createCCtxPool() when inner CCtx creation fails	2017-01-11 15:35:56 +01:00
Nick Terrell	8d984699db	Document memory requirements for COVER algorithm	2017-01-09 18:20:10 -08:00
Nick Terrell	555e281637	Handle large input size in 32-bit mode correctly	2017-01-09 18:20:06 -08:00
Nick Terrell	3a1fefcf00	Simplify COVER parameters	2017-01-02 17:51:38 -08:00
Nick Terrell	96b39f65fa	Add COVER dictionary builder	2017-01-02 13:22:51 -08:00
Yann Collet	6334b04d61	compile object files, for faster recompilation	2017-01-02 03:22:18 +01:00
Yann Collet	f1cb55192c	fixed linux warnings	2017-01-02 01:11:55 +01:00
Yann Collet	0ec6a95ba1	minor fixes	2017-01-02 00:49:42 +01:00
Yann Collet	2ec635a162	use pthread_cond to send signals between threads	2017-01-01 17:31:33 +01:00
Nick Terrell	bb13387d7d	Fix pool for threading.h	2016-12-31 19:10:47 -05:00
Nick Terrell	4204e03e77	Add threading.h condition variables	2016-12-31 19:10:29 -05:00
Yann Collet	3b9d434356	extended ZSTDMT code support for non-MT systems and WIN32 (preliminary)	2016-12-31 16:32:19 +01:00
Yann Collet	c8efc1c874	simplified Buffer Pool	2016-12-31 14:45:33 +01:00
Yann Collet	3b29dbd9e8	new zstdmt version using generic treadpool	2016-12-31 06:04:25 +01:00
Yann Collet	c6a6417458	bench correctly measures time for multi-threaded compression (posix only)	2016-12-31 03:31:26 +01:00
Yann Collet	f765a375a5	Merge pull request #504 from terrelln/thread-pool [zstdmt] Add thread pool	2016-12-30 15:31:49 +01:00
Nick Terrell	e777a5be6b	Add a thread pool for ZSTDMT and COVER	2016-12-29 23:39:44 -08:00
Yann Collet	e70912c72b	Changed : input divided into roughly equal parts. Debug : can measure time waiting for mutexes to unlock.	2016-12-29 01:24:01 +01:00
Yann Collet	6c0ed9483a	compression threads use ZSTD_compressCCtx()	2016-12-28 17:08:28 +01:00
Yann Collet	8d7432914f	Merge pull request #503 from inikep/dev11 Dev11	2016-12-28 16:50:39 +01:00
Yann Collet	ce9e1452fd	protect buffer pool with a mutex	2016-12-28 15:31:19 +01:00
Przemyslaw Skibinski	75f3a3a335	changed default PREFIX and MANDIR	2016-12-28 12:32:41 +01:00
Yann Collet	3d93f2fce7	first zstdmt sketch	2016-12-27 07:19:36 +01:00
Yann Collet	39c105c605	Merge branch 'dev' of github.com:facebook/zstd into dev	2016-12-23 22:25:31 +01:00
Yann Collet	aca113f4f5	fixed ZSTD_sizeof_?Dict()	2016-12-23 22:25:03 +01:00
Yann Collet	c07d2e3a31	Merge pull request #499 from inikep/dev11 improved *BSD and Solaris compatibility	2016-12-23 21:32:03 +01:00
Przemyslaw Skibinski	63b0014b96	BSD: improved "make install"	2016-12-23 10:05:49 +01:00
Nick Terrell	1b5d4a7d53	ZDICT_finalizeDictionary() flipped comparison	2016-12-22 18:14:57 -08:00
Nick Terrell	bcbe77e994	ZDICT_finalizeDictionary() flipped comparison `ZDICT_finalizeDictionary()` had a flipped comparison. I also allowed `dictBufferCapacity == dictContentSize`. It might be the case that the user wants to fill the dictionary completely up, and then let zstd take exactly the space it needs for the entropy tables.	2016-12-22 18:01:14 -08:00
Nick Terrell	78a0072d5a	Fix failing test due to deprecation warning	2016-12-22 17:36:16 -08:00
Yann Collet	d76d1a9ef0	added ZDICT_finalizeDictionary()	2016-12-22 20:18:43 +01:00
Przemyslaw Skibinski	b999170311	Solaris: working "make -C lib install"	2016-12-22 20:14:37 +01:00
Yann Collet	ba75e9d8c3	fix : zlib wrapper compile in gnu90 mode	2016-12-21 19:57:18 +01:00
Yann Collet	0819abe3c1	added ZSTD_createDDict_byReference() body	2016-12-21 19:25:15 +01:00
Yann Collet	4e5eea61a8	added ZSTD_createDDict_byReference()	2016-12-21 16:44:35 +01:00
Yann Collet	8333106b8a	Merge branch 'dev' of github.com:facebook/zstd into dev	2016-12-21 16:44:24 +01:00
Yann Collet	0d7e84899f	Merge pull request #489 from inikep/v112 improved detection of POSIX	2016-12-21 16:42:46 +01:00
Yann Collet	1f57c2ed32	added : ZSTD_createCDict_byReference()	2016-12-21 16:20:11 +01:00
Przemyslaw Skibinski	2f6ccee6af	platform.h: removed Compiler Options	2016-12-21 13:23:34 +01:00
Przemyslaw Skibinski	5736db219e	fix basic types redefinition	2016-12-21 09:26:00 +01:00
Nick Terrell	8157a4c3cc	Fix dictionary loading bug causing an MSAN failure Offset rep codes must be in the range `[1, dictSize)`. Fix dictionary loading to reject `0` as a offset rep code.	2016-12-20 10:47:52 -08:00
Przemyslaw Skibinski	f8046b8e72	Merge remote-tracking branch 'refs/remotes/facebook/dev' into v112 # Conflicts: # appveyor.yml	2016-12-19 08:20:26 +01:00
Yann Collet	d564faa3c6	fix : ZSTD_initCStream_srcSize() correctly set srcSize in frame header	2016-12-18 21:39:15 +01:00
Yann Collet	1496c3dc47	Fix : size estimation when some samples are very large	2016-12-18 11:58:23 +01:00
Yann Collet	d46ecb58a5	added dll compilation tests	2016-12-17 16:28:12 +01:00
Nick Terrell	8de46ab51a	Export all API functions	2016-12-16 13:27:30 -08:00
Przemyslaw Skibinski	0a1caeef6d	VS: fixed 32-bit DLL compilation	2016-12-15 12:12:46 +01:00
Przemyslaw Skibinski	4e10bd339d	appveyor.yml: added tests of fullbench-dll fullbench-lib	2016-12-15 12:09:23 +01:00
Przemyslaw Skibinski	60f10aab6c	introduced ZSTDLIB_VISIBILITY	2016-12-15 11:32:31 +01:00
Yann Collet	2b36b238d3	changed variable name to estimatedSrcSize, to emphasize it does not need to be exact	2016-12-13 17:59:55 +01:00
Yann Collet	e795c8a5f6	Added ZSTD_initCStream_srcSize(). Added relevant test cases in zstreamtest	2016-12-13 17:00:14 +01:00
Yann Collet	5397a66b19	minor BMI version check	2016-12-13 15:21:06 +01:00
Yann Collet	35168679bd	Merge pull request #478 from terrelln/wildcopy-ub Fix execSequence wildcopy undefined behavior	2016-12-13 11:33:00 +01:00
Nick Terrell	064a143520	Fix execSequence wildcopy undefined behavior execSequence relied on pointer overflow to handle cases where `sequence.matchLength < 8`. Instead of passing an `size_t` to wildcopy, pass a `ptrdiff_t`.	2016-12-12 19:01:23 -08:00
Nick Terrell	e474aa55b4	Fix decompression buffer overrun Allows an adversary to write up to 3 bytes beyond the end of the buffer. Occurs if the match overlaps the `extDict` and `currentPrefix`, and the match length in the `currentPrefix` is less than `MINMATCH`, and `op-(16-MINMATCH) >= oMatchEnd > op-16`.	2016-12-12 18:05:30 -08:00
Yann Collet	c3a5c4bef8	introduced cycleLog	2016-12-12 00:47:30 +01:00
Yann Collet	c261f71f6a	minor variation of rescale fix	2016-12-12 00:25:07 +01:00
Nick Terrell	3826207a70	Simplify segfault fix Take advantage of the fact that `chainLog <= windowLog`.	2016-12-10 18:46:55 -08:00
Nick Terrell	0012332ce0	Fix compression segfault When the overflow protection kicks in, it makes sure that ip - ctx->base isn't too large. However, it didn't ensure that saved offsets are still valid. This change ensures that any valid offsets (<= windowLog) are still representable after the update. The bug would shop up on line 1056, when `offset_1 > current + 1`, which causes an underflow. This in turn, would cause a segfault on line 1063. The input must necessarily be longer than 1 GB for this issue to occur. Even then, it only occurs if one of the last 3 matches is larger than the chain size and block size.	2016-12-09 17:15:33 -08:00
Yann Collet	383b8088a3	minor lib build refactoring	2016-12-08 18:42:27 -08:00
Yann Collet	6e754fe76a	fixed lib soname. example : simple_compression : size overflow check	2016-12-08 18:26:56 -08:00
Przemyslaw Skibinski	7687913178	Merge remote-tracking branch 'refs/remotes/facebook/dev' into dev11	2016-12-08 10:42:42 +01:00
Yann Collet	426a9d4b71	changed : dll : only approved ZSTD symbols are now exposed. All other symbols remain internal.	2016-12-07 16:39:34 -08:00
Przemyslaw Skibinski	4da53219a0	zstd Manual updated to 1.1.2	2016-12-07 11:18:40 +01:00
Yann Collet	379908be3d	fixed zstd.h for manual	2016-12-06 10:36:15 -08:00
Yann Collet	9dfd0af164	ZBUFF_ as a wrapper to ZSTD streaming API.	2016-12-06 17:16:41 +01:00
Yann Collet	825dffbc43	moved zbuff source files into lib/deprecated	2016-12-05 19:28:19 -08:00
Yann Collet	8f8e2b0b4a	fixed initialization warning	2016-12-05 18:00:50 -08:00
Yann Collet	e7a41a5955	added : dictID retrieval functions. added : unit tests for dictID retrieval functions	2016-12-05 16:21:06 -08:00
Yann Collet	9ffbeea875	API : changed : streaming decompression : implicit reset on starting new frames	2016-12-02 18:37:38 -08:00
Yann Collet	2238312c2f	fix dict loading	2016-12-02 11:36:11 -08:00
Przemyslaw Skibinski	821bf1febc	fixed Doxygen trailing comment	2016-12-02 16:13:41 +01:00
Yann Collet	b89af20353	reduced table sizes for HUF_readDTableX4	2016-12-01 18:24:59 -08:00
Yann Collet	a0d742b1e4	introduced HUF_buildCTable_wksp(), to reduce stack memory usage	2016-12-01 17:47:30 -08:00
Yann Collet	643d9a234b	replaced usage of FSE_buildCTable by FSE_buildCTable_wksp, using less stack space in the process	2016-12-01 16:24:04 -08:00
Yann Collet	e928f7e16d	introduced ext_wksp variants of count to reduce stack memory usage	2016-12-01 16:13:35 -08:00
Yann Collet	979cab412b	fixed some minor visual silent cast warnings. introduced FSE_count_parallel_wksp().	2016-11-30 18:10:38 -08:00
Yann Collet	5e00b848a8	FSE_compress_wksp() uses less stack space	2016-11-30 16:46:13 -08:00
Yann Collet	d79a9a00d9	Introduced FSE_compress_wksp() and FSE_buildCTable_wksp() to reduce stack memory usage	2016-11-30 15:52:20 -08:00
Yann Collet	766431909f	introduced FSE_decompress_wksp(), to use less stack space	2016-11-30 12:36:45 -08:00
Yann Collet	95eb43be09	updated pkg config file	2016-11-30 11:06:58 -08:00
Yann Collet	42247705a3	Merge pull request #461 from obache/neatsrc/pkgconfig libzstd.pc.in: Change to use variables for libdir and includedir	2016-11-30 20:03:42 +01:00
Yann Collet	ff504de391	minor decompression speed improvement	2016-11-29 17:42:46 -08:00
Yann Collet	25f46dcc0f	minor const	2016-11-29 16:59:27 -08:00
Yann Collet	a56ac2815c	restored normal decoder speed	2016-11-29 15:30:23 -08:00
Yann Collet	37870d7a66	fixed minor visual warning	2016-11-29 14:31:57 -08:00
Yann Collet	013a2b58ae	Merge pull request #464 from terrelln/guards Fix ZSTD_STATIC_LINKING_ONLY with double include	2016-11-29 23:06:10 +01:00
Yann Collet	167c494748	Merge branch 'dev' of github.com:facebook/zstd into dev	2016-11-29 14:05:15 -08:00
Yann Collet	4f5350f610	long matches support overflow	2016-11-29 13:12:24 -08:00
Nick Terrell	05c00f2ff7	Fix ZSTD_STATIC_LINKING_ONLY with double include	2016-11-29 11:54:39 -08:00
OBATA Akio	c53eea7c1a	libzstd.pc.in: Change to use variables for libdir and includedir	2016-11-29 16:47:53 +09:00
Yann Collet	52e136ed3d	long decoder compatible with round and separate buffers	2016-11-28 19:59:11 -08:00
Yann Collet	ce3527ca0c	combined normal and long decoder	2016-11-28 18:38:52 -08:00
Yann Collet	8993bee997	restored normal mode	2016-11-28 16:11:30 -08:00
Yann Collet	764e70a4f3	added decodeSequencesLong	2016-11-28 15:50:16 -08:00
Yann Collet	73f88a66f1	added prefetch	2016-11-23 15:43:30 -08:00
Yann Collet	50524bf0da	delayed decompression	2016-11-23 15:11:07 -08:00
Przemyslaw Skibinski	fc4193bda5	fixed g++ warnings	2016-11-23 18:17:18 +01:00
Przemyslaw Skibinski	9ca65af810	zstd_opt.h: improved price function	2016-11-23 17:22:54 +01:00
Przemyslaw Skibinski	ad3e94512c	fixed warnings from static analyzer in zstd_opt.h	2016-11-21 20:22:12 +01:00
Przemyslaw Skibinski	eec700a3b7	exclude zbuff_compress.c and zbuff_decompress.c	2016-11-21 14:34:03 +01:00
Przemyslaw Skibinski	62d19a6f39	lib\README.md: added Using MinGW+MSYS to create DLL	2016-11-21 14:22:08 +01:00
Przemyslaw Skibinski	b85f767743	files to generate ZSTD Windows binary package	2016-11-21 14:10:55 +01:00
Przemyslaw Skibinski	8bb86e330b	create DLL with Windows	2016-11-21 12:51:01 +01:00
Przemyslaw Skibinski	93a09eedf1	added libzstd.def	2016-11-21 12:33:27 +01:00
Przemyslaw Skibinski	5a17223691	Merge remote-tracking branch 'refs/remotes/facebook/dev' into dev11	2016-11-18 11:47:01 +01:00
Przemyslaw Skibinski	3d18088b38	updated windres	2016-11-17 18:04:41 +01:00
Yann Collet	0d761dbe95	Merge pull request #453 from inikep/dev11 fullbench-dll	2016-11-16 15:45:30 -08:00
Yann Collet	52afb3993e	zbuff API now generates deprecation warnings	2016-11-16 08:50:54 -08:00
Przemyslaw Skibinski	179555c1d1	working fullbench-dll	2016-11-15 18:05:46 +01:00
Nick Terrell	4359d21ad7	Merge two memset() calls into one	2016-11-14 17:52:51 -08:00
Nick Terrell	24701de877	Fix uninitialized memory read	2016-11-14 13:57:05 -08:00
Yann Collet	8e4901eccd	removed zbuff.h from include installation	2016-11-08 15:45:39 -08:00
Yann Collet	fd3be6bc97	bump version number to 1.1.2	2016-11-07 14:35:41 -08:00
Nick Terrell	dc904ad17b	Fix bug in zstd v0.{5, 6} dictionary decompression Introduced by `bb68062c59`.	2016-11-04 16:18:59 -07:00
Przemyslaw Skibinski	38b590ad69	Merge remote-tracking branch 'refs/remotes/facebook/dev' into dev11 # Conflicts: # lib/Makefile	2016-11-04 10:10:54 +01:00
Yann Collet	407a11f63e	fixed Visual compatibility	2016-11-03 15:52:01 -07:00
Yann Collet	11812260d1	Merge pull request #439 from terrelln/dev ZSTD_compress_usingDict() specify dict == NULL	2016-11-03 14:15:36 -07:00
Nick Terrell	c8a9ac312b	Fix dynamic libzstd symlinks	2016-11-03 12:32:48 -07:00
Przemyslaw Skibinski	3a415594b1	fixed MinGW compilation	2016-11-03 12:59:20 +01:00
Yann Collet	7347869fb6	fixed make install	2016-11-02 22:28:37 -07:00
Nick Terrell	d82efd8a70	ZSTD_compress_usingDict() when dict gets loaded Specify that when `dict == NULL \|\| dictSize < 8` no dictionary gets loaded. Also add some periods.	2016-11-02 18:07:16 -07:00
Yann Collet	179b19776f	fileio.c does no longer need ZSTD_LEGACY_SUPPORT, and does no longer depend on zstd_legacy.h Added : ZSTD_isFrame() in experimental section	2016-11-02 17:30:49 -07:00
Yann Collet	f3f13211ae	Fix #419 : no warning when setting custom LDFLAGS	2016-11-02 17:02:45 -07:00
Yann Collet	0a5a5fb7fd	Fix #418 : printing selected segments in zdict debug mode can segfault with certain pathological patterns	2016-11-02 13:57:55 -07:00
Yann Collet	31e660e7aa	more accurate default maximum window size	2016-10-29 03:56:45 -07:00
Yann Collet	2115724c22	Merge pull request #430 from terrelln/exec-sequences ZSTD_execSequence() accepts match in last 7 bytes	2016-10-28 10:45:05 -07:00
Nick Terrell	10bfd0c0d5	Fix ZSTD_execSequence() performance regression Commit `ae1cb3b3d0` caused the regression. It is an instruction alignment issue, because if it is `U64 i` instead of `U32 i`, the regression returns. This patch fixes the regression in gcc, but only gets some of the clang performance back. Benchmarks: Run on `silesia.tar`. I only show levels 1-5 because the performance regression was uniform across all levels. I did one run on levels 1-19 and it looked good. \| Build \| Level \| Before \| While \| After \| \|-------\|-------\|-------:\|------:\|------:\| \| gcc \| 1 \| 931.4 \| 904.4 \| 932.8 \| \| gcc \| 2 \| 849.1 \| 822.6 \| 851.2 \| \| gcc \| 3 \| 815.6 \| 790.6 \| 818.9 \| \| gcc \| 4 \| 794.1 \| 770.7 \| 798.0 \| \| gcc \| 5 \| 785.7 \| 760.7 \| 788.8 \| \| clang \| 1 \| 705.5 \| 683.2 \| 693.8 \| \| clang \| 2 \| 670.0 \| 649.2 \| 660.7 \| \| clang \| 3 \| 659.6 \| 639.8 \| 651.4 \| \| clang \| 4 \| 652.5 \| 634.7 \| 645.9 \| \| clang \| 5 \| 646.9 \| 625.5 \| 637.7 \|	2016-10-27 16:19:57 -07:00
Yann Collet	ee5b725823	ZSTD_initCStream() optimization : do not allocate a CDict when no dictionary used	2016-10-27 14:20:55 -07:00
Nick Terrell	eb7873a048	ZSTD_execSequence() accepts match in last 7 bytes The zstd reference compressor will not emit a match in the last 7 bytes of a block. The decompressor will also not accept a match in the last 7 bytes. This patch makes the decompressor accept a match in the last 7 bytes.	2016-10-25 21:24:15 -07:00
Yann Collet	335ad5d4d4	added ZSTD_initDStream_usingDDict() . slightly optimized ZSTD_initDStream() when no dictionary . fixed ZSTD_sizeof_CStream() .	2016-10-25 17:47:02 -07:00
Yann Collet	9516234e67	first sketch for ZSTD_initCStream_usingCDict()	2016-10-25 16:19:52 -07:00
Yann Collet	62d9a7ddfd	Merge pull request #429 from inikep/btopt2 Btopt2	2016-10-25 14:48:43 -07:00
Przemyslaw Skibinski	5c5f01f3da	added ZSTD_btopt2 strategy	2016-10-25 12:25:07 +02:00
Yann Collet	7b5948cca7	Merge pull request #426 from terrelln/fixes Fix various {A, M}SAN bugs	2016-10-24 23:42:26 -07:00
Yann Collet	37d130031d	updated comments on context re-use	2016-10-24 17:22:12 -07:00
Nick Terrell	b2c39a22b0	Fix compiler narrowing warning	2016-10-24 14:50:13 -07:00
Nick Terrell	f698ad6deb	Merge remote-tracking branch 'upstream/dev' into fixes * upstream/dev: added doc\zstd_manual.html added contrib\gen_html zstd_compression_format.md moved to doc/ Fix small bug in ZSTD_execSequence() improved ZSTD_compressBlock_opt_extDict_generic protect ZSTD_decodeFrameHeader() from invalid usage, as suggested by @spaskob zstd_opt.h: small improvement in compression ratio improved dicitonary segment merge use implicit rules to compile zstd_decompress.c detect early impossible decompression scenario in legacy decoder v0.5 no repeat mode in legacy v0.5 fixed invalid invocation of dictionary in legacy decoder v0.5 fix edge case fix command line interpretation fixed minor corner case zstd.h: added the Introduction section fixed clang 3.5 warnings zstd.h: updated comments	2016-10-24 13:10:13 -07:00
Yann Collet	4239a207dd	Merge pull request #425 from inikep/dev11 Doc	2016-10-24 11:11:40 -07:00
Nick Terrell	f9c9af3c2e	Reject dictionaries with incomplete entropy tables If a dictionary specifies that a symbol has probability zero in its `matchLength`, `literalLength`, or `offset` FSE table, but the symbol appears when compressing input, the compressor fails. Ensure that dictionaries support all `matchLength`, and `literalLength` codes. They must also support all of the `offset` codes required to represent every possible offset that can appear in the first block.	2016-10-24 10:42:44 -07:00
Przemyslaw Skibinski	984b66cd72	added contrib\gen_html	2016-10-24 15:59:51 +02:00
Przemyslaw Skibinski	3ee94a7600	zstd_compression_format.md moved to doc/	2016-10-24 15:58:07 +02:00
Yann Collet	97611611a3	Merge pull request #423 from terrelln/exec-seq-patch Fix small bug in ZSTD_execSequence()	2016-10-21 17:02:06 -07:00
Nick Terrell	ae1cb3b3d0	Fix small bug in ZSTD_execSequence() `memmove(op, match, sequence.matchLength)` is not the desired behavior. Overlap is allowed, and handled as if we did `op++ = match++`, which is not how `memmove()` handles overlap. Only triggered if both of the following conditions are met: * The match spans extDict & currentPrefixSegment * `oLitEnd <= oend_w < oLitEnd + length1 < oMatchEnd <= oend`. These two conditions imply that the block is less than 15 bytes long. This bug isn't triggered by the streaming API, because it allocates enough space for the window size + the block size, so there cannot be a match that is within 8 bytes of the end and overlaps with itself. It cannot be triggered by the block decompression API because all of the decompressed data is in the currentPrefixSegment. Introduced by commit `7158584399`	2016-10-21 12:13:44 -07:00
Przemyslaw Skibinski	4732074a71	improved ZSTD_compressBlock_opt_extDict_generic	2016-10-21 11:19:00 +02:00
Yann Collet	da3bd8b6de	protect ZSTD_decodeFrameHeader() from invalid usage, as suggested by @spaskob	2016-10-20 20:11:00 -07:00
Przemyslaw Skibinski	d365ae3497	zstd_opt.h: small improvement in compression ratio	2016-10-20 11:49:02 +02:00
Przemyslaw Skibinski	575ab00db7	Merge remote-tracking branch 'refs/remotes/facebook/dev' into dev11	2016-10-20 11:01:52 +02:00
Nick Terrell	d760529a05	Fix stack buffer overrun when weightTotal == 0 If `weightTotal == 0`, then `BIT_highbit32(weightTotal)` is undefined behavior in the case that it calls `__builtin_clz()`. If `tableLog == HUF_TABLELOG_ABSOLUTEMAX` then we will access one byte beyond the end of the buffer.	2016-10-19 11:39:11 -07:00
Nick Terrell	bb68062c59	Unitialized memory read in ZSTD_decodeSeqHeaders() Caused by two things: 1. Not checking that `ip` is in range except for the first byte. 2. `ZSTDv0{5,6}_decodeLiteralsBlock()` could return a value larger than `srcSize`.	2016-10-18 16:41:33 -07:00
Yann Collet	52c1bf93fe	improved dicitonary segment merge	2016-10-18 16:34:58 -07:00
Nick Terrell	7b06ad7a05	Backport fix from commit `125d817` This fixes a read of unitialized memory. Full commit hash: `125d81774f`.	2016-10-18 14:52:34 -07:00
Nick Terrell	f45b157d95	Backport fix from commit `9e8b09a` Fixes uninitialized memory reads. Full commit hash: `9e8b09a7bd`	2016-10-18 14:22:49 -07:00
Yann Collet	f7906d5955	detect early impossible decompression scenario in legacy decoder v0.5	2016-10-18 13:48:32 -07:00
Yann Collet	9313c8d953	no repeat mode in legacy v0.5	2016-10-18 13:36:15 -07:00
Yann Collet	83d7bdee4b	fixed invalid invocation of dictionary in legacy decoder v0.5	2016-10-18 12:25:43 -07:00
Yann Collet	197a55ee7b	fix edge case	2016-10-18 11:27:52 -07:00
Nick Terrell	fd98087047	Fix stack buffer overflow in HUF_readCTable() If `w ==0` on line 153, then `CTable[n].nbBits == tableLog + 1`. Then `nbPerRank[CTable[n].nbBits]` and `valPerRank[CTable[n].nbBits]` are stack buffer overflows.	2016-10-17 18:16:59 -07:00
Yann Collet	06573e17be	fixed minor corner case	2016-10-17 17:28:28 -07:00
Nick Terrell	bfd943ace5	Fix buffer overrun in ZSTD_loadDictEntropyStats() The table log set by `FSE_readNCount()` was not checked in `ZSTD_loadDictEntropyStats()`. This caused `FSE_buildCTable()` to stack/heap overflow in a few places. The benchmarks look good, there is no obvious compression performance regression: > ./zstds/zstd.opt.0 -i10 -b1 -e10 ~/bench/silesia.tar 1#silesia.tar : 211988480 -> 73656930 (2.878), 271.6 MB/s , 716.8 MB/s 2#silesia.tar : 211988480 -> 70162842 (3.021), 204.8 MB/s , 671.1 MB/s 3#silesia.tar : 211988480 -> 66997986 (3.164), 156.8 MB/s , 658.6 MB/s 4#silesia.tar : 211988480 -> 66002591 (3.212), 136.4 MB/s , 665.3 MB/s 5#silesia.tar : 211988480 -> 65008480 (3.261), 98.9 MB/s , 647.0 MB/s 6#silesia.tar : 211988480 -> 62979643 (3.366), 65.2 MB/s , 670.4 MB/s 7#silesia.tar : 211988480 -> 61974560 (3.421), 44.9 MB/s , 688.2 MB/s 8#silesia.tar : 211988480 -> 61028308 (3.474), 32.4 MB/s , 711.9 MB/s 9#silesia.tar : 211988480 -> 60416751 (3.509), 21.1 MB/s , 718.1 MB/s 10#silesia.tar : 211988480 -> 60174239 (3.523), 22.2 MB/s , 721.8 MB/s > ./compress_zstds/zstd.opt.1 -i10 -b1 -e10 ~/bench/silesia.tar 1#silesia.tar : 211988480 -> 73656930 (2.878), 273.8 MB/s , 722.0 MB/s 2#silesia.tar : 211988480 -> 70162842 (3.021), 203.2 MB/s , 666.6 MB/s 3#silesia.tar : 211988480 -> 66997986 (3.164), 157.4 MB/s , 666.5 MB/s 4#silesia.tar : 211988480 -> 66002591 (3.212), 132.1 MB/s , 661.9 MB/s 5#silesia.tar : 211988480 -> 65008480 (3.261), 96.8 MB/s , 641.6 MB/s 6#silesia.tar : 211988480 -> 62979643 (3.366), 63.1 MB/s , 677.0 MB/s 7#silesia.tar : 211988480 -> 61974560 (3.421), 44.3 MB/s , 678.2 MB/s 8#silesia.tar : 211988480 -> 61028308 (3.474), 33.1 MB/s , 708.9 MB/s 9#silesia.tar : 211988480 -> 60416751 (3.509), 21.5 MB/s , 710.1 MB/s 10#silesia.tar : 211988480 -> 60174239 (3.523), 21.9 MB/s , 723.9 MB/s	2016-10-17 16:55:52 -07:00
Nick Terrell	4db751668f	Fix buffer overrun in ZSTD_loadEntropy() The table log set by `FSE_readNCount()` was not checked in `ZSTD_loadEntropy()`. This caused `FSE_buildDTable(dctx->MLTable, ...)` to overwrite the beginning of `dctx->hufTable`. The benchmarks look good, there is no obvious performance regression: > ./zstds/zstd.opt.0 -i10 -b1 -e5 ~/bench/silesia.tar 1#silesia.tar : 211988480 -> 73656930 (2.878), 268.2 MB/s , 701.0 MB/s 2#silesia.tar : 211988480 -> 70162842 (3.021), 199.5 MB/s , 666.9 MB/s 3#silesia.tar : 211988480 -> 66997986 (3.164), 154.9 MB/s , 655.6 MB/s 4#silesia.tar : 211988480 -> 66002591 (3.212), 128.9 MB/s , 648.4 MB/s 5#silesia.tar : 211988480 -> 65008480 (3.261), 98.4 MB/s , 633.4 MB/s > ./zstds/zstd.opt.2 -i10 -b1 -e5 ~/bench/silesia.tar 1#silesia.tar : 211988480 -> 73656930 (2.878), 266.1 MB/s , 703.7 MB/s 2#silesia.tar : 211988480 -> 70162842 (3.021), 199.0 MB/s , 666.6 MB/s 3#silesia.tar : 211988480 -> 66997986 (3.164), 156.2 MB/s , 656.2 MB/s 4#silesia.tar : 211988480 -> 66002591 (3.212), 133.2 MB/s , 647.4 MB/s 5#silesia.tar : 211988480 -> 65008480 (3.261), 96.3 MB/s , 633.3 MB/s	2016-10-17 15:51:15 -07:00
Nick Terrell	ccfcc643da	Check if dict is empty before reading first byte	2016-10-17 11:46:03 -07:00
Yann Collet	2b361cf2f1	minor opt	2016-10-14 16:09:07 -07:00
Yann Collet	7933434fdf	Merge branch 'dev' of github.com:facebook/zstd into dev	2016-10-14 13:32:35 -07:00
Yann Collet	d4cda27b63	new command -M#, to limit memory usage during decompression (#403 )	2016-10-14 13:32:20 -07:00
Nick Terrell	3b9cdf9220	Fix ubsan failures (pass NULL to memcpy)	2016-10-12 20:54:42 -07:00
Yann Collet	5d919e7ac3	added ZSTD_error_frameParameter_windowTooLarge (#403 )	2016-10-12 17:29:24 -07:00
Yann Collet	e19111c42f	make creates libzstd binaries (#415 )	2016-10-12 11:09:36 -07:00
Yann Collet	8b70d012f0	fix cmake	2016-10-12 10:23:53 -07:00
Yann Collet	38fb0dc4cf	Merge pull request #416 from terrelln/exec-sequence Fix ZSTD_execSequence() edge case	2016-10-12 10:17:53 -07:00
Nick Terrell	7158584399	Fix ZSTD_execSequence() edge case	2016-10-12 10:05:26 -07:00
Yann Collet	f52cd03e73	bumped version number	2016-10-11 17:29:27 -07:00
Yann Collet	ef2357d0d3	created error_private.c, so that a single list of error strings get included	2016-10-11 17:24:50 -07:00
Yann Collet	14efab827b	added zstd_errors.h to include installation	2016-10-11 16:51:29 -07:00
Yann Collet	a17fd7312a	changed error_public.h into zstd_errors.h	2016-10-11 16:41:09 -07:00
Yann Collet	18b51b99c0	sync fse	2016-10-11 08:21:09 -07:00
inikep	2d2613399a	zstd.h: added the Introduction section	2016-10-06 16:28:21 +02:00
inikep	ba1db376ac	fixed clang 3.5 warnings	2016-10-06 14:22:48 +02:00
inikep	82057aa7ec	zstd.h: updated comments	2016-10-06 13:23:52 +02:00
Yann Collet	df6797447f	update dictionary builder warning comments	2016-09-27 15:14:32 +02:00
Yann Collet	47094ea66b	added comment on filePos	2016-09-26 18:03:33 +02:00
Yann Collet	cf409a7e2a	fixed : init*_advanced() followed by reset() with different pledgedSrcSiz	2016-09-26 16:41:05 +02:00
Yann Collet	2f2639438a	zstreamtest can fuzztest pledgedSrcSize	2016-09-26 14:06:08 +02:00
Christophe Chevalier	dc245e91cb	Changed to use ZSTDLIBv06_API and ZSTDLIBv07_API for DLL exports to fix warning - changed name to prevent collision with ZSTDLIB_API used by non-legacy dll exports	2016-09-23 17:09:36 +02:00
Yann Collet	21412bb3f6	Merge branch 'dev' of github.com:Cyan4973/zstd into dev	2016-09-22 15:57:56 +02:00
Yann Collet	51f4d566c2	small decompression speed boost for very small data	2016-09-22 15:57:28 +02:00
Yann Collet	97b378a6f8	Streaming : dictionary compression on multiple files / segments can correctly provide srcSize into header (when provided) using pledgedSrcSize.	2016-09-21 17:20:19 +02:00
Yann Collet	993060e0f2	cli : better adaptation to small files	2016-09-21 16:46:08 +02:00
Yann Collet	1eb2fdc74f	bumped version number	2016-09-18 12:21:47 +02:00
Yann Collet	a6bdf55759	fixed memory leak	2016-09-15 17:02:06 +02:00
Yann Collet	644a8da88a	fixed minor conversion warning	2016-09-15 16:16:21 +02:00
Yann Collet	4cb212938c	introduced ZSTD_resetCStream()	2016-09-15 14:54:07 +02:00
Yann Collet	fa0c09760c	variable renaming	2016-09-15 14:11:01 +02:00
Yann Collet	d7c6589df8	support ZSTD_sizeof_*() on NULL added ZSTD_sizeof_CDict()	2016-09-15 02:57:27 +02:00
Yann Collet	e91c4b4cef	introduced ZSTD_resetDStream() . added : ZSTD_sizeof_DDict()	2016-09-14 16:55:44 +02:00
Yann Collet	d092d77cfc	minor variable renaming	2016-09-14 16:14:57 +02:00
Yann Collet	64deef3bee	Fixed srcSize=1	2016-09-14 00:16:07 +02:00
Yann Collet	26ec254066	new strategy for faster DDict decompression	2016-09-13 16:52:16 +02:00
Yann Collet	ac175d46d4	updated comments	2016-09-13 00:51:47 +02:00
Yann Collet	a3481d6de0	make uninstall	2016-09-12 05:04:26 +02:00
Yann Collet	b3060f7a9e	changed streaming decoder behavior : now, when all compressed frame is consumed, it means decompression is completed, with regenerated data fully flushed.	2016-09-09 16:44:16 +02:00
Yann Collet	01c199226a	updated decompression streaming example	2016-09-08 19:29:04 +02:00
Yann Collet	5c6d244973	Merge branch 'dev' of github.com:facebook/zstd into dev	2016-09-07 14:54:54 +02:00
Yann Collet	ac8bace6b1	support large skippable frames	2016-09-07 14:54:23 +02:00
Yann Collet	0e07bf3f60	added comments on searchLength min / max (#337 )	2016-09-07 06:33:02 +02:00
Yann Collet	95d07d7447	introduced CHECK_E	2016-09-06 16:38:51 +02:00
Yann Collet	3e21ec5b01	introduced CHECK_F	2016-09-06 15:36:19 +02:00
Yann Collet	5c956d593c	FORCE_INLINE common definition	2016-09-06 15:05:19 +02:00
Yann Collet	edbcd9f5b2	fixed zbufftest	2016-09-06 14:30:57 +02:00
Yann Collet	b624922b14	fixed checksum	2016-09-06 11:16:57 +02:00
Yann Collet	a7737f6a60	improved compression on small files when using same parameters	2016-09-06 09:44:59 +02:00
Yann Collet	7ae67bb18a	small compression speed gains with using_CDict	2016-09-06 06:28:05 +02:00
Yann Collet	1d4208c029	clarified streaming decompression inlined doc	2016-09-06 05:16:40 +02:00
Yann Collet	7c83dfd5c2	ZSTD_frameHeaderSize_prefix (#340 ), as result of ZSTD_initStream	2016-09-05 19:47:43 +02:00
Yann Collet	fa72f6bdce	clarified inline doc for streaming	2016-09-05 17:39:56 +02:00
Yann Collet	c73a8109bb	Merge pull request #344 from inikep/dev10 unified error codes for legacy decoders	2016-09-05 07:46:33 -07:00
inikep	45db83f98d	ZSTD_decodeLiteralsBlock renamed to ZSTDv01_decodeLiteralsBlock	2016-09-05 14:46:24 +02:00
inikep	476964f6a1	ZSTD_decodeSeqHeaders renamed to ZSTDv01_decodeSeqHeaders	2016-09-05 13:34:57 +02:00
inikep	c13faa1b0f	legacy decoders: restored #include <intrin.h> for VC++	2016-09-05 13:25:07 +02:00
inikep	8161e7321a	unified error codes for legacy decoders	2016-09-05 12:29:51 +02:00
Thomas Klausner	b85cdabd50	Enable install targets for NetBSD.	2016-09-04 14:37:57 +02:00
Yann Collet	33a0465a51	fixed a few links	2016-09-02 22:11:49 -07:00
Yann Collet	d56dbc02d3	removed g_displayLevel	2016-09-02 17:28:41 -07:00
Yann Collet	855766d73d	clarified dictionary in format description	2016-09-02 17:04:49 -07:00
Yann Collet	d725427a3c	g_time => local displayTime	2016-09-02 15:32:39 -07:00
Yann Collet	1563bfeabc	fixing FORCE_INLINE for older compilers (#330 )	2016-09-02 11:44:21 -07:00
Yann Collet	7304eb7c09	bumped version number	2016-09-01 15:49:26 -07:00
Yann Collet	901e85fe26	version bump	2016-08-31 07:51:25 -07:00
Yann Collet	1c59c20903	removed redundant files	2016-08-31 07:15:44 -07:00
Yann Collet	599c69d917	minor Makefile updates	2016-08-30 13:33:20 -07:00
David Lam	e10f7f3dcb	merge	2016-08-30 12:03:36 -07:00
Yann Collet	4ded9e591c	added boilerplate	2016-08-30 11:06:28 -07:00
Yann Collet	3b15f1f10f	minor refactor	2016-08-30 09:58:50 -07:00
Yann Collet	240795bef7	Merge branch 'dev' of github.com:Cyan4973/zstd into dev	2016-08-30 06:51:55 -07:00
Yann Collet	14200a20f0	Fixed issue #304 , reported by @borzunov	2016-08-30 06:51:00 -07:00
David Lam	da9d3b7057	Cleanup some errors in typedef comments and remove duplicated HOWTO from zbuff_decompress.c	2016-08-29 17:31:51 -07:00
Yann Collet	09c3c8e885	Merge pull request #307 from inikep/dev08 updated README.md	2016-08-29 16:32:33 -07:00
inikep	6416b0d705	updated README.md	2016-08-29 13:04:26 +02:00
Yann Collet	23b6e05d8e	ZSTD_malloc() and ZSTD_free(), to simplify customMem	2016-08-28 21:05:43 -07:00
Yann Collet	5f53b0335e	fixed continuation context	2016-08-28 10:00:49 -07:00
Yann Collet	767d8f66fa	legacy contexts can be re-used	2016-08-28 08:19:47 -07:00
Yann Collet	4bf317dd00	first version supporting legacy streams (transparent decoding)	2016-08-28 07:43:34 -07:00
Yann Collet	e19a9ef05d	update compression level table	2016-08-26 20:02:49 +02:00
Yann Collet	9a021c1aae	fixed some minor clang warnings	2016-08-26 09:05:06 +02:00
Yann Collet	cb5a320705	made -Wdocumentation a clang only flag	2016-08-26 08:06:36 +02:00
Yann Collet	87c18b2ebd	fixed multiple minor warnings for XCode	2016-08-26 01:43:47 +02:00
Yann Collet	0d59a6f73a	removed debug strings	2016-08-25 22:42:46 +02:00
Yann Collet	5a02b69215	reinforced fix for huge files	2016-08-25 22:24:59 +02:00
Yann Collet	96bdd87de4	fixed : compression bug on very large files	2016-08-25 14:34:42 +02:00
inikep	a3a47ec4d0	Merge remote-tracking branch 'refs/remotes/Cyan4973/dev' into Other	2016-08-24 21:25:49 +02:00
Yann Collet	a2cdffe556	fixed wrong parameter issue	2016-08-24 19:42:15 +02:00
inikep	e416e30019	remove unnecessary comments	2016-08-24 17:32:09 +02:00
inikep	4e90f6c1e0	removed ZSTD_LOG_ENCODE and ZSTD_LOG_BLOCK	2016-08-24 17:24:11 +02:00
inikep	83388e109f	removed ZSTD_LOG_PARSER	2016-08-24 17:22:20 +02:00
inikep	8a36f8527c	removed stats in debug mode	2016-08-24 17:19:12 +02:00
inikep	57ef4b1a0d	zstd_v07.c: removed unused macros	2016-08-24 17:16:56 +02:00
Yann Collet	24b68a5b23	update cLevel table for 256KB	2016-08-24 14:22:26 +02:00
Yann Collet	c54692faeb	improved level 3	2016-08-24 01:45:46 +02:00
Yann Collet	17e482efdd	added ZSTD_setDStreamParameter()	2016-08-23 16:58:10 +02:00
Yann Collet	3071c3e303	STREAM_WINDOW_MAX : protect streaming from unreasonable memory requirements	2016-08-23 01:34:34 +02:00
Yann Collet	70e3b31306	fixed playtests on os-x	2016-08-23 01:18:06 +02:00
Yann Collet	cb3276329a	added sizeof CStream and DStream	2016-08-23 00:31:59 +02:00
Yann Collet	d1733f7417	fixed crc bug in rare timing conditions within bench.c	2016-08-21 01:04:46 +02:00
Yann Collet	8baf78a291	minor coding style	2016-08-20 13:04:20 +02:00
Yann Collet	1bee2d5e08	slight decompression speed improvement	2016-08-20 02:59:04 +02:00
Yann Collet	0cfe2ec2fd	sync fse version	2016-08-20 00:26:26 +02:00
Yann Collet	af1960396b	sync huff0	2016-08-19 19:38:19 +02:00
Yann Collet	7be46bf8f2	promoted streaming API to stable (except _advanced() variants)	2016-08-19 18:39:36 +02:00
Yann Collet	fdba57d513	update version number	2016-08-19 18:32:30 +02:00
Yann Collet	da3fbcb302	Added ZDICT_getDictID()	2016-08-19 14:23:58 +02:00
Yann Collet	a5dbf9f629	Merge pull request #297 from borzunov/dev Export functions related to dictionary compression from DLL	2016-08-18 15:05:01 +02:00
Yann Collet	49d105cfcf	better warning and error messages in case of dictionary training failure (#292 )	2016-08-18 15:02:11 +02:00
Alexander Borzunov	0f6f17a14f	Rename ZSTDLIB_API to ZDICTLIB_API in zdict.h	2016-08-18 16:47:06 +05:00
Alexander Borzunov	1f48382b1a	Export functions related to dictionary compression from DLL	2016-08-18 16:12:49 +05:00
Yann Collet	e80d15304a	Merge pull request #296 from inikep/Other Other	2016-08-18 11:48:48 +02:00
inikep	a7bb322a93	removed never referenced functions	2016-08-18 10:30:21 +02:00
Yann Collet	18442c1482	minor refactoring	2016-08-18 01:40:32 +02:00
Yann Collet	c411902230	fixed g++ conversion warning	2016-08-17 01:50:54 +02:00
Yann Collet	53e17fbd5e	updated streaming API	2016-08-17 01:39:22 +02:00
Yann Collet	655393cc72	updated doc for streaming API	2016-08-16 15:11:28 +02:00
Yann Collet	104e5b072d	added : streaming decompression API	2016-08-16 15:11:28 +02:00
Yann Collet	5a0c8e2439	new streaming API (compression)	2016-08-16 15:11:27 +02:00
Yann Collet	ba92046031	Merge branch 'dev' of github.com:Cyan4973/zstd into dev	2016-08-11 22:10:04 +02:00
Yann Collet	e9b414d825	fixed msan warning (#281 )	2016-08-11 22:09:09 +02:00
inikep	5f49eba512	added usage of rep[0]-1 for the optimal parser	2016-08-10 15:01:53 +02:00
inikep	98e08cbe34	fixed: tree not updated after finding very long rep matches	2016-08-10 15:00:30 +02:00
inikep	038d1497c9	fixed compilation with Visual Studio 2005	2016-08-10 14:30:10 +02:00
inikep	48849f86f0	fixed compilation with Intel Compiler with Windows	2016-08-10 14:26:35 +02:00
Yann Collet	1ea5622a32	updated xxhash	2016-08-10 09:40:08 +02:00
Yann Collet	666398e7ed	added : xxhash namespace enforced from xxhash.h. added : xxhash namespace test. removed : -DXXH_NAMESPACE	2016-08-10 08:16:51 +02:00
Yann Collet	8ded0b84aa	update xxhash to v0.6.2	2016-08-10 07:40:40 +02:00
Yann Collet	280f9a8754	minor comment	2016-08-08 00:44:00 +02:00
Yann Collet	e0b4a2d40f	fixed dictionary generation, reported by Bartosz Taudul	2016-08-03 03:36:03 +02:00
Yann Collet	ae40b18d55	bumped library number	2016-08-03 01:59:21 +02:00
Yann Collet	0763905f44	ZSTD_compress_usingCDict() correctly provides original size by default in frame header Fixed dictionary examples	2016-08-03 01:57:57 +02:00
Yann Collet	bf2bc112bb	bench : controlled display update when loading lot of files	2016-08-02 23:48:13 +02:00
Yann Collet	346efccc35	fixed doc typo	2016-08-02 14:26:00 +02:00
Yann Collet	f116e87f59	fixed analyzer warning	2016-08-01 19:15:18 +02:00
Yann Collet	9ba929f1d4	Merge branch 'dev' of github.com:Cyan4973/zstd into dev	2016-08-01 02:26:59 +02:00
Yann Collet	3ca750372d	updated doc (#269 )	2016-08-01 02:26:20 +02:00
Yann Collet	c55eb18c11	Merge pull request #267 from inikep/dev08 fixed ZSTD_compressBlock_opt_extDict_generic	2016-07-31 22:00:16 +02:00
inikep	056df510aa	fixed ZSTD_compressBlock_opt_extDict_generic	2016-07-31 20:08:53 +02:00
Yann Collet	917fe188f1	Implemented repOffset "minus 1" on ll==0	2016-07-31 04:01:57 +02:00
Yann Collet	2a2ba3691c	Merge pull request #266 from jrmarino/master Enable build on FreeBSD ports (includes DragonFly BSD) [dev branch]	2016-07-31 02:21:26 +02:00
jrmarino	0d07ec0c0c	Enable build on FreeBSD ports (includes DragonFly BSD) Zstd has been introduced to FreeBSD ports (http://www.freshports.org/archivers/zstd/) which DragonFly BSD also uses. FreeBSD and DragonFly use the install targets (albeit modified in some cases) so they must be added to the associated Makefile filters.	2016-07-30 19:11:15 -05:00
Yann Collet	8cebfd1d26	fix attempt on test-zstd-speed	2016-07-31 01:59:23 +02:00
Yann Collet	66f69e58d2	restore decompression speed on fizzle	2016-07-30 15:32:47 +02:00
Yann Collet	3b2bd1d11c	zstd_opt uses same tables as zstd_compress	2016-07-30 13:21:41 +02:00
Yann Collet	f714f59c16	fixed visual warning	2016-07-30 12:05:28 +02:00
Yann Collet	761f8dbbd2	back to normal table cell copy	2016-07-30 11:43:53 +02:00
Yann Collet	3c6b808870	minor decompression speed gains	2016-07-30 03:20:47 +02:00
Yann Collet	70a9ff4af3	fixed too large selectivity level, reported by Ilona Papava	2016-07-30 01:09:14 +02:00
Yann Collet	c0ce4f1211	slightly improved compression speed	2016-07-30 00:55:13 +02:00
Yann Collet	ed57d8530a	new seqStore	2016-07-29 21:22:17 +02:00
Yann Collet	6b615d32cd	Updated API comments, following suggestions by Bryan O'Sullivan	2016-07-29 19:40:37 +02:00
Yann Collet	c00d30fbe4	Merge pull request #264 from inikep/dev08 Dev08	2016-07-29 17:42:30 +02:00
inikep	6b68ba2079	zstd_opt.h: fixed checking of rep codes (2)	2016-07-29 16:45:39 +02:00
inikep	59b86fc141	zstd_opt.h: fixed checking of rep codes	2016-07-29 11:00:33 +02:00
Yann Collet	6a82f0f8bf	minor comments	2016-07-29 00:55:45 +02:00
Yann Collet	ffa7d0ac1e	clarified comment	2016-07-28 21:01:17 +02:00
Yann Collet	4c5bbf64f9	fixed : frame concatenation without checksum	2016-07-28 20:30:25 +02:00
Yann Collet	60ba31c570	zbuff uses ZSTD_compressEnd()	2016-07-28 19:55:09 +02:00
Yann Collet	16e73033ad	introduced stage zbf_end	2016-07-28 16:32:34 +02:00
Yann Collet	62470b4bab	Changed ZSTD_compressEnd()	2016-07-28 15:29:08 +02:00
Yann Collet	e7bf9156d1	Clarified API comments, from suggestions by ‎Bryan O'Sullivan‎	2016-07-28 05:00:57 +02:00
Yann Collet	d469a98c01	Clarified API comments, from suggestions by ‎Bryan O'Sullivan‎	2016-07-28 04:55:03 +02:00
Yann Collet	19c1002e46	applied ZSTD_compressContinueThenEnd()	2016-07-28 01:25:46 +02:00
Yann Collet	5b56739b63	created ZSTD_compressContinueThenEnd()	2016-07-28 01:17:22 +02:00
Yann Collet	c991cc1828	new frame end, 32-bits checksums	2016-07-28 00:55:43 +02:00
Yann Collet	d4180cad9c	minor code refactoring	2016-07-27 21:21:36 +02:00
Yann Collet	731ef16fc1	minor code style refactoring	2016-07-27 21:05:12 +02:00
Yann Collet	4b9ca0a6b5	minor example variation	2016-07-27 19:53:19 +02:00
Yann Collet	4110534886	ZSTD_maxCLevel() is promoted to "stable" API (#254 , by @FrancescAlted)	2016-07-27 15:09:11 +02:00
Yann Collet	55a8bea0b5	fixed dictionary generation	2016-07-27 14:48:47 +02:00
Yann Collet	c154d9d6a2	better support for large dictionaries (> 128 KB)	2016-07-27 14:37:00 +02:00
Yann Collet	07626dfa51	improved dictbuilder notifications on selectivity	2016-07-27 13:28:46 +02:00
Yann Collet	f796f7ab45	removed fastscan mode	2016-07-27 12:53:54 +02:00
Yann Collet	dd25a27702	added tutorial warning messages for dictBuilder	2016-07-27 12:43:09 +02:00
inikep	003c7a8568	optimal parser: removed ZSTD_REP_INIT	2016-07-27 11:07:13 +02:00
Yann Collet	04cdd8660d	Merge pull request #262 from ebiggers/misc_updates Miscellaneous updates	2016-07-27 01:25:45 +02:00
Eric Biggers	0a55e7a0bb	ZSTD_decompressFrame(): use remainingSize instead of iend - ip Same behavior, but no need to have redundant variables.	2016-07-26 13:22:27 -07:00
Eric Biggers	aa6c70bf60	ZSTD_decompressFrame(): pass up error code from ZSTD_decodeFrameHeader()	2016-07-26 13:22:27 -07:00
Eric Biggers	e4d0265ea9	Replace remaining references to "direct mode" with "single segment mode"	2016-07-26 13:22:27 -07:00
Yann Collet	d50f9db3ea	Improved speed on clang and gcc -O2, thanks to @ebiggers ! (#263 )	2016-07-26 21:30:35 +02:00
Yann Collet	7adc2328a3	fixed --test on zero-length files, reported by @amnilsson	2016-07-26 15:49:24 +02:00
inikep	4178f5c289	fixed gcc warning: always_inline function might not be inlinable	2016-07-25 21:17:45 +02:00
inikep	fca90f8f60	legacy decoder for v0.7 format	2016-07-25 17:49:08 +02:00
Yann Collet	cbc5e9dc19	fixes oob read	2016-07-24 18:02:04 +02:00
Yann Collet	38b75ddeb2	removed special case all-1 huffman distribution	2016-07-24 15:35:59 +02:00
Yann Collet	7ed5e33b89	minor comment changes	2016-07-24 14:26:11 +02:00
Yann Collet	10b9c13d07	fixed doc on cLevel default, reported by Oliver Lange	2016-07-24 01:21:53 +02:00
Yann Collet	f8e7b5363f	unified encoding types	2016-07-23 16:31:49 +02:00
Yann Collet	571a59034a	changed enccoding type order : raw, rle, compressed, repeat-stats	2016-07-23 15:52:05 +02:00
Yann Collet	c2e1a68d81	changed streamNb order to 1-4-4-4	2016-07-22 17:30:52 +02:00
Yann Collet	772d912c2f	more complete support for literals repeat mode	2016-07-22 15:04:25 +02:00
Yann Collet	9f2d82d4a4	fixed : big-endian decoding	2016-07-22 14:37:10 +02:00
Yann Collet	32faf6c8e7	fixed conversion warnings	2016-07-22 14:37:09 +02:00
Yann Collet	5e45a5fbb3	force loop-align to 32 for zstd_decompress	2016-07-22 14:37:09 +02:00
Yann Collet	5288ac0cb7	changed filed order	2016-07-22 14:37:09 +02:00
Yann Collet	198e6aac44	Literals header fields use little endian convention	2016-07-22 14:37:09 +02:00
Yann Collet	6fa05a2371	cBlockSize uses little-endian convention	2016-07-22 14:37:09 +02:00
Yann Collet	7bf72bbf5e	update header to v0.8	2016-07-22 14:37:09 +02:00
Yann Collet	5894ea8d01	updated cLevels	2016-07-22 14:36:46 +02:00
Yann Collet	d5c5a77990	minor comments clarifications	2016-07-20 13:35:14 +02:00
Yann Collet	572b817be3	Merge pull request #253 from gymdis/heapmode_off_legacy_fix Fix compile issue with ZSTD_LEGACY_SUPPORT=1 and ZSTD_HEAPMODE=0	2016-07-19 13:52:03 +02:00
Christopher Bergqvist	780a9fa857	Fix compile issue with ZSTD_LEGACY_SUPPORT=1 and ZSTD_HEAPMODE=0	2016-07-19 13:25:38 +02:00
Yann Collet	cf05b9d477	ZSTD_getBlockSizeMax()	2016-07-18 16:52:10 +02:00
Yann Collet	16aa38b0e0	minor doc clarifications	2016-07-18 03:52:47 +02:00
Yann Collet	85f3919960	moved `zstd.h` to `/lib`	2016-07-17 20:42:21 +02:00
Yann Collet	9375590462	update version to v0.7.5	2016-07-17 16:44:18 +02:00
Yann Collet	e557fd5e92	minor compression level corrections	2016-07-17 16:21:37 +02:00
Yann Collet	d54b2d23b4	minor static assert for 32/64 bits system. Suggested by @ebiggers	2016-07-17 15:53:18 +02:00
Yann Collet	972e5806ee	fixed : premature frame end on zero-sized raw block - reported by @ebiggers	2016-07-17 15:39:24 +02:00
luben karavelov	10f999f856	Add legacy support for the low-level streaming API	2016-07-17 01:03:26 +02:00
Yann Collet	6cacd34d44	minor formatting changes	2016-07-15 17:58:13 +02:00
Yann Collet	f6ff53cd4e	implemented dictID reserved ranges	2016-07-15 17:03:38 +02:00
Yann Collet	98c8884999	added target zstd in root Makefile	2016-07-15 16:12:38 +02:00
Yann Collet	961b6a0e34	ZSTD_compressBlock() limits block size depending on windowLog parameter	2016-07-15 11:58:49 +02:00
Yann Collet	227cc39e15	improved efficiency for large messages with small dictionaries	2016-07-15 11:27:09 +02:00
Yann Collet	ea2ecdc315	fixed issue with small dictionary	2016-07-14 23:27:31 +02:00
Yann Collet	e9ed5cdc94	fixed minor coverity warning	2016-07-14 21:02:57 +02:00
Yann Collet	b23e1ce319	removed debugging traces	2016-07-14 17:46:38 +02:00
Yann Collet	17508f1a16	fixed a few minor coverity warnings	2016-07-14 17:18:20 +02:00
Yann Collet	8847238cac	simplified ZSTD_estimateCCtxSize()	2016-07-14 17:05:38 +02:00
Yann Collet	69c2cdb45c	fixed conversion warning	2016-07-14 16:52:45 +02:00
Yann Collet	5e80dd3261	fixed minor coverity warnings	2016-07-13 19:21:57 +02:00
Yann Collet	3c174f4da9	fixed minor coverity warning	2016-07-13 17:25:53 +02:00
Yann Collet	2b1a3638e6	changed macro name to ZSTDCLI_CLEVEL_DEFAULT	2016-07-13 15:16:00 +02:00
Yann Collet	3c242e79d3	updated compression levels table	2016-07-13 14:56:24 +02:00
Yann Collet	fbc69f8649	changed for #245	2016-07-13 13:52:58 +02:00
Yann Collet	eed2081e55	fixed conversion warning	2016-07-12 15:11:40 +02:00
Yann Collet	a43a854cdb	updated paramgrill	2016-07-12 13:42:10 +02:00
Yann Collet	73d74a05b9	fixed dfast strategy	2016-07-12 13:03:48 +02:00
Yann Collet	45dc35628c	first version of doubleFast	2016-07-12 09:47:31 +02:00
Yann Collet	d158c35e9f	added ZSTD_estimateDCtxSize()	2016-07-11 13:46:25 +02:00
Yann Collet	8e0ee681b8	added ZSTD_sizeofDCtx()	2016-07-11 13:09:52 +02:00
Yann Collet	3ae543ce75	added ZSTD_estimateCCtxSize()	2016-07-11 03:12:17 +02:00
Yann Collet	25c506601c	promote ZSTD_getDecompressedSize() to stable API	2016-07-10 01:46:18 +02:00
Yann Collet	3b6ae77e15	comment clarification	2016-07-08 23:42:22 +02:00
Yann Collet	722e14bb65	fixed compilation error in decompression module	2016-07-08 19:22:16 +02:00
Yann Collet	bd10607063	updated spec	2016-07-08 19:16:57 +02:00
Yann Collet	c5fb5b7fcd	support offset > 128 MB	2016-07-08 13:13:37 +02:00
Yann Collet	ed3845d3fa	introduced ZSTD_WINDOWLOG_MAX_32 (#239 ), suggested by @GregSlazinski	2016-07-08 12:57:10 +02:00
Yann Collet	26f681451f	updated doc	2016-07-08 11:45:08 +02:00
Yann Collet	19c27d27f1	simplified legacy functions, no longer need magic number	2016-07-07 14:40:13 +02:00
Yann Collet	e72efeb0a1	removed "error_public.h" dependency from "zstd.h"	2016-07-07 14:17:40 +02:00
Yann Collet	974f52fc5d	Added "dictionary decompression" example	2016-07-07 14:08:00 +02:00
Yann Collet	e09d38e921	removed `mem.h` dependency from `zbuff.h` (experimental section)	2016-07-07 13:17:37 +02:00
Yann Collet	f323bf7d32	added : ZSTD_getDecompressedSize()	2016-07-07 13:14:21 +02:00
Yann Collet	52c04fe58f	removed `mem.h` dependency from `zstd.h` (experimental section)	2016-07-07 11:53:18 +02:00
Yann Collet	f246cf5423	ZSTD_decompress_usingDDict() compatible with Legacy mode	2016-07-06 20:32:27 +02:00
Yann Collet	29652e2618	sample set limitation closer to 2 GB	2016-07-06 16:25:46 +02:00
Yann Collet	99b045b70a	dictBuilder protection vs huge sample sets (>2 GB)	2016-07-06 16:12:38 +02:00
Yann Collet	445d49d898	fixed conversion warning	2016-07-06 13:27:22 +02:00
Yann Collet	a295b3170f	fixed conversion warning	2016-07-06 13:13:12 +02:00
Yann Collet	517e1ba623	fixed dictBuilder issue with HC levels. Reported by Bartosz Taudul.	2016-07-06 12:35:09 +02:00
Yann Collet	fe07eaa972	simplified ZSTD_decodeSequence()	2016-07-06 02:25:44 +02:00
Yann Collet	9ca73364e6	updated spec	2016-07-05 10:53:38 +02:00
Yann Collet	f9cac7a734	Added GNU separator `--`, to specifies that all following arguments are necessary file names (and not commands). Suggested by @chipturner (#230 )	2016-07-04 18:18:24 +02:00
Yann Collet	23f05ccc6b	updated specifications	2016-07-04 16:13:11 +02:00
Yann Collet	d916c908e0	updated doc	2016-07-04 00:42:58 +02:00
Yann Collet	698cb63305	Updated specifications	2016-07-03 18:49:35 +02:00
Yann Collet	d57dffbe76	ZSTD_storeSeq takes an U32 as offset type	2016-07-03 01:48:26 +02:00
Yann Collet	302ff036f6	simplified repcodes for lazy_extDict	2016-07-03 01:28:16 +02:00
Yann Collet	9634f67107	fix lazy parser	2016-07-03 01:23:58 +02:00
Yann Collet	92d75667e4	fix for fast mode	2016-07-03 01:10:53 +02:00
Yann Collet	5e734ad09b	revert fix	2016-07-02 23:55:34 +02:00
Yann Collet	0d5bf8f06f	fixed risk of segfault on very large files (multiple GB)	2016-07-02 21:39:47 +02:00
Yann Collet	2fa9904844	update specification and comments	2016-07-01 20:55:28 +02:00
Yann Collet	c093208ab8	fix : potential leak (#229 )	2016-06-30 14:07:30 +02:00
Yann Collet	6c6e1751f6	use ZSTD_getParams() to simplify code	2016-06-27 15:28:45 +02:00
Yann Collet	3d2cd7f816	Introduced ZSTD_getParams() bench now uses ZSTD_createCDict_advanced()	2016-06-27 15:12:26 +02:00
Yann Collet	529d9c7dee	updated version to v0.7.2	2016-06-27 10:03:10 +02:00
Yann Collet	d4f4e58ee1	fixed ZSTD_decompressBlock() using multiple blocks	2016-06-27 01:31:35 +02:00
Yann Collet	63b5e7a2ea	Improved comments	2016-06-26 17:42:15 +02:00
Yann Collet	3755eb8fea	fixed strict-aliasing warning on gcc6	2016-06-22 13:15:53 +02:00
Yann Collet	23042929da	Fixed : dictBuilder fails if first sample is too small	2016-06-22 11:05:34 +02:00
Yann Collet	391a128794	fix : segfault in command line during automatic overwrite protection mode	2016-06-21 17:06:25 +02:00
Yann Collet	bda68c253b	refactored ZBUFF_compressEnd() for better maintainability	2016-06-21 15:18:11 +02:00
Yann Collet	aa29226b7c	fix : ZBUFF_compressEnd() gives right amount remaining to flush, including future epilogue	2016-06-21 14:04:57 +02:00
Yann Collet	f15c1cb00c	Fixed : ZBUFF_compressEnd() called multiple times with too small dst buffer (#206 )	2016-06-21 13:11:48 +02:00
Yann Collet	a49e066b26	clarified comments on `ZSTD_compressContinue()`	2016-06-21 11:54:03 +02:00
Yann Collet	d4f38d0dcd	updated library to v0.7.1	2016-06-21 10:15:43 +02:00
Yann Collet	22d76322ce	minor refactor	2016-06-21 08:01:51 +02:00
Yann Collet	a436a529bc	minor : fast_extDict does no longer skip first byte	2016-06-20 23:34:04 +02:00
Yann Collet	4623d11571	new correction, less extreme replacement value	2016-06-20 19:15:37 +02:00
Yann Collet	5477cc25f7	fixed corruption error related to inter-blocks rep-offset	2016-06-20 18:31:25 +02:00
Yann Collet	e4811ba761	Modified : ZSTD_createDDict() accepts dictionary < 8 bytes in pure content mode (reported by @chipturner)	2016-06-19 23:06:54 +02:00
Yann Collet	06d9a73b48	minor refactor, using `WILDCOPY_OVERLENGTH` macro instead of hard-coded 8	2016-06-19 14:27:21 +02:00
Yann Collet	19cab46f2f	Joined `seqStore` initialization at dispatch point	2016-06-17 12:54:52 +02:00
Yann Collet	510cff3570	minor comment change	2016-06-16 16:39:55 +02:00
Yann Collet	4948f270b3	make room for reserved "information bit" in frame header	2016-06-16 15:38:51 +02:00
Yann Collet	23ba41533a	Fixed zstd_opt encoding error with repeat-offsets	2016-06-16 13:20:46 +02:00
Yann Collet	80d033fb43	fixed ptr arithmetic warning	2016-06-16 01:41:50 +02:00
Yann Collet	ad39b7a718	zdict stores standard rep-offset. It can use custom ones, but the proper formula and impact on statistics is not done yet.	2016-06-16 01:14:41 +02:00
Yann Collet	736d419289	strengthened dict loading on decompresson side	2016-06-16 01:05:04 +02:00
Yann Collet	8e36a9c169	decoder restores repOffsets from dictionary	2016-06-16 01:05:04 +02:00
Yann Collet	52a0622beb	RepsCodes are saved into Dict (uncomplete : need decompression to regenerate them)	2016-06-16 01:05:04 +02:00
Yann Collet	efd0b4993a	fixed fuzzer error (inter-block repeated offsets)	2016-06-16 00:53:56 +02:00
Yann Collet	9b998e4d08	Fixed decompression of literals in dictionary mode	2016-06-15 23:11:20 +02:00
Yann Collet	d059092897	fixed conversion warnings	2016-06-14 15:34:24 +02:00
Yann Collet	45c03c564f	fixed corruption with inter-blocks repeated offsets	2016-06-14 13:46:11 +02:00
Yann Collet	4266c0a2fd	adding inter-blocks rep-offsets	2016-06-14 01:49:25 +02:00
Yann Collet	43dfe01919	Check `repIndex` for validity	2016-06-13 21:43:06 +02:00
Yann Collet	18c8f79f3e	fixed gcc warning on uninitialized structure variable	2016-06-12 22:51:52 +02:00
Yann Collet	cd98f93cff	Fixed decompression issue with invalid data	2016-06-11 23:26:22 +02:00
Yann Collet	37fece22e8	enable repeat-entropic-stats mode	2016-06-11 02:52:42 +02:00
Yann Collet	d60a5bf900	Literal decompression builds Huffman tables within shared space (for later re-use)	2016-06-11 02:35:31 +02:00
Yann Collet	237ad4beb3	Added single-stream decompression variant using external DTable	2016-06-11 01:46:03 +02:00
Yann Collet	289bbd52e5	Updated huff0	2016-06-11 01:31:54 +02:00
Yann Collet	1869f7966e	Merge pull request #205 from inikep/dev legacy decoder for v0.6	2016-06-10 17:13:07 +02:00
Yann Collet	0974f681a4	completed `.gitignore`	2016-06-10 14:44:16 +02:00
Yann Collet	9dd12742f3	`litBlockType_t` is an `enum`	2016-06-10 00:12:26 +02:00
inikep	4923222412	fixed warnings from Travis	2016-06-09 20:03:30 +02:00
inikep	4000945a1d	project updated for legacy decoder zstd_v06.c	2016-06-09 18:12:06 +02:00
inikep	bf853d5510	added legacy decoder for v0.6 format	2016-06-09 17:59:18 +02:00
Yann Collet	662a541431	updated huff0 - now generates a common HUF_DTable type for all decoding tables	2016-06-08 11:11:02 +02:00
Yann Collet	302fb53a76	Removed `ZSTD_*_usingPrepared?Ctx()` declaration from public space	2016-06-07 12:16:49 +02:00
Yann Collet	81e13ef7cf	first implementation of the new dictionary API (untested)	2016-06-07 00:51:51 +02:00
Yann Collet	9d504ae85b	Added decoding of RLE blocks	2016-06-06 19:52:35 +02:00
Yann Collet	2cc72f1fd3	fixed initialization issue in bench	2016-06-06 17:50:07 +02:00
Yann Collet	e3d529403d	fixed initialization mismatch in `ZSTD_copyCCtx()`	2016-06-06 11:07:33 +02:00
Yann Collet	142acbdea7	fixed minor visual conversion warning	2016-06-06 00:46:56 +02:00
Yann Collet	673f0d7cdc	new frame format, allowing custom window size	2016-06-06 00:26:38 +02:00
Yann Collet	89703d20fb	reduced dependencies	2016-06-05 01:50:33 +02:00
Yann Collet	51778b7cca	updated README following merging of `*_static.h`	2016-06-05 01:38:10 +02:00
Yann Collet	a91ca620cf	removed `HUF_readStats()` from public space	2016-06-05 01:33:55 +02:00
Yann Collet	d0e2cd15cb	Merged `fse_static` into `fse.h` . Now requires `FSE_STATIC_LINKING_ONLY` macro.	2016-06-05 00:58:01 +02:00
Yann Collet	130fe11394	merged `huf_static.h` into `huf.h` . Requires `HUF_STATIC_LINKING_ONLY` macro.	2016-06-05 00:42:28 +02:00
Yann Collet	dc048d18d3	minor comment (detailing an `#include` motivation)	2016-06-05 00:32:23 +02:00
Yann Collet	49bb0041af	removed `ZSTD_highbit()` from `zstd_internal.h`, as it is only used by `zstd_compress.c`	2016-06-04 20:17:38 +02:00
Yann Collet	d3b7f8d21f	Merged `zstd_static.h` into `zstd.h` . Now requires `ZSTD_STATIC_LINKING_ONLY` macro	2016-06-04 19:47:02 +02:00
Yann Collet	ac110a1f21	Removed ZBUFF internal util function from public area	2016-06-04 19:16:49 +02:00
Yann Collet	5347aee8f7	merged `zbuff_static.h` into `zbuff.h` . Now requires `ZBUFF_STATIC_LINKING_ONLY` macro	2016-06-04 19:12:48 +02:00
Yann Collet	e69b8ccceb	merged `zdict_static.h` into `zdict.h`. Now requires `ZDICT_STATIC_LINKING_ONLY` macro.	2016-06-04 18:56:23 +02:00
Yann Collet	198d127b35	minor comment change (unfinished description of new header format)	2016-06-04 18:40:55 +02:00
Yann Collet	f4f5affdf7	restore ZBUFF full-block-size, for better performance on small input	2016-06-03 23:09:28 +02:00
Yann Collet	ab7b6f1ece	Merge pull request #198 from inikep/dev070 Dev070	2016-06-03 21:37:49 +02:00
inikep	3640396b1a	fixed: deallocation of structures in case of error in ZBUFF_createCCtx and ZBUFF_createDCtx	2016-06-03 16:36:50 +02:00
Yann Collet	fe48775868	minor decoder code refactoring	2016-06-03 15:41:51 +02:00
inikep	2a74609b90	zlibWrapper: ZWRAP_createCCtx and ZWRAP_freeCCtx use custom memory allocation functions	2016-06-03 14:53:51 +02:00
inikep	3763c77f6b	defaultCustomNULL replaced with defaultCustomMem	2016-06-03 13:28:20 +02:00
inikep	36fac00149	removed calloc calls from lib/	2016-06-03 13:23:04 +02:00
inikep	db2f540414	added defaultCustomNULL	2016-06-03 12:56:56 +02:00
inikep	b74a468fad	Merge remote-tracking branch 'refs/remotes/Cyan4973/dev070' into dev070	2016-06-02 22:09:09 +02:00
Yann Collet	923938edde	Added `-Wdeclaration-after-statement` compilation flag	2016-06-02 17:56:00 +02:00
inikep	ff9114aee3	zlibWrapper: added support for custom memory allocation functions	2016-06-02 16:52:36 +02:00
inikep	c4807f4d2f	default custom allocation functions moved to zstd_internal.h	2016-06-02 15:11:39 +02:00
inikep	2866951558	opaque parameter for custom memory allocation functions	2016-06-02 13:04:18 +02:00
inikep	9242816b56	fparamsPtr->windowLog==0 means that a frame is skippable	2016-06-01 18:47:04 +02:00
Yann Collet	70d1301d6e	Changed `ZSTD_adjustCParams()` prototype `ZSTD_adjustCParams()` is now automatically invoked at the end of `ZSTD_getCParams()`	2016-06-01 18:45:34 +02:00
Yann Collet	83c3f4427c	upgraded zbufftest to also test advanced frame parameters no/checksum no/dictID	2016-06-01 17:44:53 +02:00
inikep	13f42d9085	VS2010 project: reverted zstdlib.rc	2016-06-01 14:44:31 +02:00
inikep	5c2771710d	Merge remote-tracking branch 'refs/remotes/Cyan4973/dev070' into dev070 # Conflicts: # .gitignore # lib/decompress/zstd_decompress.c # programs/zbufftest.c	2016-06-01 09:16:11 +02:00
Yann Collet	202844ebd0	fixed zbufftest :	2016-06-01 00:44:36 +02:00
Yann Collet	8e3a36a6db	decompression validates frame content checksum	2016-06-01 00:18:28 +02:00
inikep	a6b942018d	Merge remote-tracking branch 'refs/remotes/origin/dev' into dev070 # Conflicts: # .travis.yml # Makefile # lib/common/zstd_static.h # programs/Makefile # projects/VS2008/zstd/zstd.vcproj # projects/VS2008/zstdlib/zstdlib.vcproj # projects/cmake/lib/CMakeLists.txt # projects/cmake/programs/CMakeLists.txt	2016-06-01 00:07:09 +02:00
Yann Collet	f2a3b6e7b4	added : frame content checksum	2016-05-31 22:23:45 +02:00
inikep	43aa9fe8b3	fixed skippable frame	2016-05-31 19:36:51 +02:00
inikep	f772bf54a5	support for skippable frames	2016-05-31 12:43:46 +02:00
Giuseppe Ottaviano	370b751e24	Expose function to add entropy tables to pre-built dictionary. In some cases a custom dictionary building algorithm tailored for a specific input can be more effective than the one produced by `ZDICT_trainFromBuffer`, but with the current API it's not possible encode the entropy tables into the custom-built dictionary. This commit extracts the logic to add entropy tables to a dictionary from `ZDICT_trainFromBuffer` and exposes it as a function `ZDICT_addEntropyTablesFromBuffer`.	2016-05-30 19:50:09 -07:00
Yann Collet	290aaa7521	Added : ability to manually select the dictionary ID of a newly created dictionary	2016-05-30 21:18:52 +02:00
Yann Collet	30009521d7	fuzzer tests dictBuilder. Added : ability to not store dictID during compression; decompression doesn't check dictID then	2016-05-30 16:17:33 +02:00
Yann Collet	c0a9bf3c2e	minor code refactoring	2016-05-30 04:48:32 +02:00
Yann Collet	c46fb924df	added dictionary ID (incomplete)	2016-05-29 05:01:04 +02:00
Yann Collet	f51e0660f4	Simplified list of `*.c` files	2016-05-29 01:39:19 +02:00
Yann Collet	0c5e8b17ad	moved xxhash to lib/common	2016-05-29 01:06:30 +02:00
inikep	957823f56f	zstdcli: -r (operate recursively on directories) works with dictBuilder and compression	2016-05-25 15:30:55 +02:00

... 34 35 36 37 38 ...

4226 Commits