Commit Graph

143 Commits

Author SHA1 Message Date
Martin Mitas
53f65852be test/spec.txt: Little update.
Somehow we were having little different spec.txt version that the one
from CommonMark repo tag 0.28. But we still pass all its compliance
test suite.
2019-03-11 19:03:34 +02:00
Martin Mitas
685b714453 Move codespan detection from md_analyze_backtick() into...
md_is_code_span(), called from md_collect_marks().

We have to do this at the same time as detecting raw inline HTML to
follow CommonMark priority requirements.

Also it is done very differently now:

When scanning for the closer mark, we remember (the latest) position of
potential closers for all other lengths as well.

This means that:

(1) If we find it, we reduced the task because all subsequent scan shall
begin after the closer.

(2) If we do not find it, then we have to reach the end of the block and
hence we then know (for every allowed marker length) the position of last
such backtick sequence.

(3) That makes the guaranty that any subsequent call with either succeed
in its scan (and reduce the task even further); or that we shall be able
to detect instantly there is no suitable closer.

I.e. every call either reduces the task by O(n) scan (1); or collects
all the data in O(n) because (2) happens at most once; or fails in O(1)
(3).

This makes O(n) guaranty of the function complexity.

Fixes #59.
2019-03-11 13:02:17 +01:00
Martin Mitas
0cb61205b1 Move raw inline HTML detection from md_analyze_lt_qt() into md_collect_marks().
Fixes #58:

For resolving raw inline HTML the function tried closer with all
potential openers, because raw HTML can have '<' inside of an attribute.

However this caused O(n^2) for input like "<><><><><><><>...".

We solved by handling raw HTML in earlier stage, directly in
md_collect_marks(), where we can scan linerary forward.

Fixes #61:

As a side effect, this also fixes the issue that MD_FLAG_NOHTMLSPANS
disabled also recognition of CommonMark autolinks.
2019-03-11 13:02:17 +01:00
Martin Mitáš
8e01a769ea
Implement task lists. (#50)
Fixes #30.
2019-02-10 22:58:42 +01:00
Martin Mitas
d32aa2e076 Fix conflict in parsing permissive autolinks and ordinary links.
The issues is caused by the fact that we do not know exact position
of permissive auto-link in time of md_collect_marks() because there
is no syntax to mark its end on the 1st place.

This causes that eventually, the closer mark in ctx->marks[] can be
out-of-order somewhat.

As a consequence, if some other mark range (e.g. ordinary link)
shadows the auto-link, the closer mark may be left outside the shadowed
range and survive till the phase when we generate the output.

We fix by using an extra mark flag to remember we did really output
the opener mark, and output the closer only in such case.

Fixes #53.
2019-02-09 10:40:52 +01:00
Martin Mitas
67401e7019 md_analyze_inlines: Resolve table cell boundaries before links.
This brings some corner cases closer to cmark-gfm.

Also fixes #51.
2019-02-06 04:31:25 +01:00
Martin Mitas
8fc692badc md_rollback: Do not touch TABLECELLBOUNDARIES chain.
This chain is not normal opener/closer inline mark chain.

Fixes #42.
2018-06-11 18:18:56 +02:00
Martin Mitas
e6e2ea4c5a md_analyze_line: Fix mixing list and table parsing.
If table header underline is not nested the same way as the preceding
line (i.e. the wannabe table header line), then it cannot form a table.

Fixes #41.
2018-06-11 11:43:47 +02:00
Martin Mitas
4ef024fbb7 md_process_inlines: Fix link/image closers spanning over multiple lines.
Fixes #40.
2018-05-29 23:30:02 +02:00
Martin Mitas
7deaccf65d md_is_link_label: Fix if the link label contains just backslash escapes.
The function did not remember the label start line index, leading to bad
consequences.

Fixes #39.
2018-05-29 18:38:51 +02:00
Martin Mitas
bf022cb656 Fix md_split_simple_pairing_mark().
When splitting a mark into two, make sure each of them gets the right
share od dummies for case that we will have to split once more.

Fixes #36.
2018-05-28 21:16:29 +02:00
Martin Mitas
e7b84d65a4 pathological_tests.py: Fix test compatibility with Windows. 2018-05-28 21:09:32 +02:00
Martin Mitas
81e2a5cac2 pathological_tests.py: Test deeply nested lists. 2018-04-12 17:04:12 +02:00
Martin Mitas
0d1a41a4d2 md_build_attr_append_substr: Fix +1 allocation error.
Fixes #33.
2018-03-28 08:21:21 +02:00
Martin Mitas
19b24bdd11 Simplify the pathological test "many references". 2017-08-16 18:16:49 +02:00
Martin Mitas
07cec7dcd6 Add regression test for #24. 2017-08-16 16:34:50 +02:00
Martin Mitas
ee3bee1a5d Upgrade to CommonMark specification 0.28. 2017-08-02 00:38:54 +02:00
Martin Mitas
938460d564 Improve/unify output of test scripts. 2017-07-25 03:25:42 +02:00
Martin Mitas
c52a50a3db pathological_tests.py: Add test for reference definition lookup. 2017-07-25 03:25:42 +02:00
Martin Mitas
c51fb31058 md_analyze_marks: Walk only required range of the marks.
This changes causes that when recursing to analysis of link contents,
only the marks between the link opener and closer are iterated in
md_analyze_marks().

Fixes #22
2017-07-24 23:33:25 +02:00
Martin Mitas
a27aefded9 pathological_tests.py: Allow short option -p as a synonym of --program. 2017-07-24 20:17:50 +02:00
Martin Mitas
f4f7b2230c pathological_tests.py: Allow Windowish line ends. 2017-07-24 20:15:09 +02:00
Martin Mitas
26f14899ed Add pathological_tests.py from cmark. 2017-07-24 20:12:13 +02:00
Martin Mitas
ad4f28bb85 md_analyze_simple_pairing_mark: Fix the "rule of three".
If the first emphasis opener is refused due the rule of three, a previous
opener is examined. However the variable opener_orig_size_module3 was not
(re)set accordingly.

Fixes #21.
2017-07-24 20:09:23 +02:00
Martin Mitas
cfbce75910 Rework ref. def. dictionary.
It now uses FNV1a and we now sort/bsearch only contents of single bucket.
Additionally we fix #20 by disabling the invalid ref. definitions during
hashtable build.
2017-07-18 18:49:52 +02:00
Martin Mitas
f2821cbd8e md_analyze_permissive_email_autolink: Make it compatible with CMark-gfm. 2017-07-14 17:10:45 +02:00
Martin Mitas
1bc7f3a84e render_url_escaped: Fix escaping of ampersand.
This affected generating href attribute if links or src attribute of
images.
2017-07-14 02:24:21 +02:00
Martin Mitas
f3f9404e53 Improve URL autolinks extension.
It is now much more compatible to Cmark-gfm.

With the flag MD_FLAG_PERMISSIVEWWWAUTOLINKS, we now also support the
WWW autolinks (when the http: scheme is omitted).
2017-07-14 02:06:23 +02:00
Martin Mitas
25a156ee1b Implement strikethrough extension. 2017-07-12 23:30:14 +02:00
Martin Mitas
8999e1844a Fix "rule of three" for emphasis resolution (issue #14). 2017-01-04 15:20:46 +01:00
Martin Mitas
c63909df8e When splitting emphasis opener mark, we have to retain 'dummy' marks available for more splitting in the future (issue #15). 2017-01-04 15:06:14 +01:00
Martin Mitas
5271238426 When parsing tables, pipes inside a link/image/code span cannot make cell boundary (issue #7). 2016-12-27 22:52:06 +01:00
Martin Mitas
f9b4cb8f6e md_process_inlines: Fix when an expanded mark shadows some nested marks (issue #11). 2016-12-15 16:47:41 +01:00
Martin Mitas
c235a02ee8 test/coverage.txt: Add some tests for higher code coverage. 2016-12-15 13:18:48 +01:00
Martin Mitas
a725fee3f6 md_enter_child_containers: Fix crash (issue #10).
Calling md_push_container_bytes() may result in ending a current block
which may result in removing some contents from ctx->block_bytes when
removing some lines with link reference definitions.

This in effect means we have to end the block explicitly before storing
the offset into the ctx->block_bytes.
2016-12-14 16:51:24 +01:00
Martin Mitas
ba29d0075e md_is_link_reference_definition: Fix handling of multiline label (issue #9). 2016-12-12 23:31:59 +01:00
Martin Mitas
09ae86095f Handle images more like links.
Remove MD_SPAN_IMG_DETAIL::alt. Instead, the contents of the image is
propagated to the renderer via MD_RENDERER::text() callback.

 * This fixes handling of entities inside the image text (issue #4).
 * It simplifies parsing and, more importantly, it better distingusshes
   what is responsibility of parser or renderer respectively.
 * This allows more flexibility on renderers side. Renderer who do not
 * really support images can just output the image content as any
   other text.

The cost is a renderer into HTML (if it wants to render image contents
into the attribute ALT of the IMG tag), has to handle images with more
care. Typically such renderer has to track whether it is inside an image,
and if so, then render span enter/leave as an empty string.
2016-12-07 23:56:47 +01:00
Martin Mitas
23312d6d65 md_is_html_tag: Fix parsing unquoted attribute value (issue #2). 2016-12-05 11:13:43 +01:00
Martin Mitas
b40d595044 Fix file permissions of python scripts. 2016-12-04 17:01:00 +01:00
Martin Mitas
be7fcc16ff Implement tables.
Note it is implemented as an extension. To enable it, the flag MD_FLAG_TABLES
must be explicitly specified.
2016-11-21 13:39:45 +01:00
Martin Mitas
809e611b3c Migrate to CommonMark pecification 0.27. 2016-11-20 00:57:32 +01:00
Martin Mitas
ef5f230ffa Implement permissive autolinks extensions.
With MD_FLAG_PERMISSIVEURLAUTOLINKS, we treat not overly complicated URLs
as autolinks even without '<' and '>'.

With MD_FLAG_PERMISSIVEEMAILAUTOLINKS, we treat not overly complicated
e-mail addresses as autolinks even without '<', '>' and without the
'mailto:' scheme.

Also expanded md2html utility and tests to cover these.
2016-10-14 19:56:05 +02:00
Martin Mitas
1cfc6a5f42 Incorporate the specification testsuite from CommonMark. 2016-10-11 01:10:11 +02:00