xref: /aosp_15_r20/external/pcre/NEWS (revision 22dc650d8ae982c6770746019a6f94af92b0f024)
1*22dc650dSSadaf EbrahimiNews about PCRE2 releases
2*22dc650dSSadaf Ebrahimi-------------------------
3*22dc650dSSadaf Ebrahimi
4*22dc650dSSadaf Ebrahimi
5*22dc650dSSadaf EbrahimiVersion 10.44 07-June-2024
6*22dc650dSSadaf Ebrahimi--------------------------
7*22dc650dSSadaf Ebrahimi
8*22dc650dSSadaf EbrahimiThis is mostly a bug-fix and tidying release. There is one new function, to set
9*22dc650dSSadaf Ebrahimia maximum size for a compiled pattern. The maximum name length for groups is
10*22dc650dSSadaf Ebrahimiincreased to 128. Some auxiliary files for building under VMS are added.
11*22dc650dSSadaf Ebrahimi
12*22dc650dSSadaf Ebrahimi
13*22dc650dSSadaf EbrahimiVersion 10.43 16-February-2024
14*22dc650dSSadaf Ebrahimi------------------------------
15*22dc650dSSadaf Ebrahimi
16*22dc650dSSadaf EbrahimiThere are quite a lot of changes in this release (see ChangeLog and git log for
17*22dc650dSSadaf Ebrahimia list). Those that are not bugfixes or code tidies are:
18*22dc650dSSadaf Ebrahimi
19*22dc650dSSadaf Ebrahimi* The JIT code no longer supports ARMv5 architecture.
20*22dc650dSSadaf Ebrahimi
21*22dc650dSSadaf Ebrahimi* A new function pcre2_get_match_data_heapframes_size() for finer heap control.
22*22dc650dSSadaf Ebrahimi
23*22dc650dSSadaf Ebrahimi* New option flags to restrict the interaction between ASCII and non-ASCII
24*22dc650dSSadaf Ebrahimi  characters for caseless matching and \d and friends. There are also new
25*22dc650dSSadaf Ebrahimi  pattern constructs to control these flags from within a pattern.
26*22dc650dSSadaf Ebrahimi
27*22dc650dSSadaf Ebrahimi* Upgrade to Unicode 15.0.0.
28*22dc650dSSadaf Ebrahimi
29*22dc650dSSadaf Ebrahimi* Treat a NULL pattern with zero length as an empty string.
30*22dc650dSSadaf Ebrahimi
31*22dc650dSSadaf Ebrahimi* Added support for limited-length variable-length lookbehind assertions, with
32*22dc650dSSadaf Ebrahimi  a default maximum length of 255 characters (same as Perl) but with a function
33*22dc650dSSadaf Ebrahimi  to adjust the limit.
34*22dc650dSSadaf Ebrahimi
35*22dc650dSSadaf Ebrahimi* Support for LoongArch in JIT.
36*22dc650dSSadaf Ebrahimi
37*22dc650dSSadaf Ebrahimi* Perl changed the meaning of (for example) {,3} which did not used to be
38*22dc650dSSadaf Ebrahimi  recognized as a quantifier. Now it means {0,3} and PCRE2 has also changed.
39*22dc650dSSadaf Ebrahimi  Note that {,} is still not a quantifier.
40*22dc650dSSadaf Ebrahimi
41*22dc650dSSadaf Ebrahimi* Following Perl, allow spaces and tabs after { and before } in all Perl-
42*22dc650dSSadaf Ebrahimi  compatible items that use braces, and also around commas in quantifiers. The
43*22dc650dSSadaf Ebrahimi  one exception in PCRE2 is \u{...}, which is from ECMAScript, not Perl, and
44*22dc650dSSadaf Ebrahimi  PCRE2 follows ECMAScript usage.
45*22dc650dSSadaf Ebrahimi
46*22dc650dSSadaf Ebrahimi* Changed the meaning of \w and its synonyms and derivatives (\b and \B) in UCP
47*22dc650dSSadaf Ebrahimi  mode to follow Perl. It now matches characters whose general categories are L
48*22dc650dSSadaf Ebrahimi  or N or whose particular categories are Mn (non-spacing mark) or Pc
49*22dc650dSSadaf Ebrahimi  (combining punctuation).
50*22dc650dSSadaf Ebrahimi
51*22dc650dSSadaf Ebrahimi* Changed the default meaning of [:xdigit:] in UCP mode to follow Perl. It now
52*22dc650dSSadaf Ebrahimi  matches the "fullwidth" versions of hex digits. PCRE2_EXTRA_ASCII_DIGIT can
53*22dc650dSSadaf Ebrahimi  be used to keep it ASCII only.
54*22dc650dSSadaf Ebrahimi
55*22dc650dSSadaf Ebrahimi* Make PCRE2_UCP the default in UTF mode in pcre2grep and add -no_ucp,
56*22dc650dSSadaf Ebrahimi  --case-restrict and --posix-digit.
57*22dc650dSSadaf Ebrahimi
58*22dc650dSSadaf Ebrahimi* Add --group-separator and --no-group-separator to pcre2grep.
59*22dc650dSSadaf Ebrahimi
60*22dc650dSSadaf Ebrahimi
61*22dc650dSSadaf EbrahimiVersion 10.42 11-December-2022
62*22dc650dSSadaf Ebrahimi------------------------------
63*22dc650dSSadaf Ebrahimi
64*22dc650dSSadaf EbrahimiThis is an unexpectedly early release to fix a problem that was introduced in
65*22dc650dSSadaf Ebrahimi10.41. ChangeLog number 19 (GitHub #139) added the default definition of
66*22dc650dSSadaf EbrahimiPCRE2_CALL_CONVENTION to pcre2posix.c instead of pcre2posix.h, which meant that
67*22dc650dSSadaf Ebrahimiprograms including pcre2posix.h but not pcre2.h couldn't compile. A new test
68*22dc650dSSadaf Ebrahimithat checks this case has been added.
69*22dc650dSSadaf Ebrahimi
70*22dc650dSSadaf EbrahimiA couple of other minor issues are also fixed, and a patch for an intermittent
71*22dc650dSSadaf EbrahimiJIT fault is also included. See ChangeLog and the Git log.
72*22dc650dSSadaf Ebrahimi
73*22dc650dSSadaf Ebrahimi
74*22dc650dSSadaf EbrahimiVersion 10.41 06-December-2022
75*22dc650dSSadaf Ebrahimi------------------------------
76*22dc650dSSadaf Ebrahimi
77*22dc650dSSadaf EbrahimiThis is another mainly bug-fixing and code-tidying release. There is one
78*22dc650dSSadaf Ebrahimisignificant upgrade to pcre2grep: it now behaves like GNU grep when matching
79*22dc650dSSadaf Ebrahimimore than one pattern and a later pattern matches at an earlier point in the
80*22dc650dSSadaf Ebrahimisubject when the matched substrings are being identified by colour or by
81*22dc650dSSadaf Ebrahimioffsets.
82*22dc650dSSadaf Ebrahimi
83*22dc650dSSadaf Ebrahimi
84*22dc650dSSadaf EbrahimiVersion 10.40 15-April-2022
85*22dc650dSSadaf Ebrahimi---------------------------
86*22dc650dSSadaf Ebrahimi
87*22dc650dSSadaf EbrahimiThis is mostly a bug-fixing and code-tidying release. However, there are some
88*22dc650dSSadaf Ebrahimiextensions to Unicode property handling:
89*22dc650dSSadaf Ebrahimi
90*22dc650dSSadaf Ebrahimi* Added support for Bidi_Class and a number of binary Unicode properties,
91*22dc650dSSadaf Ebrahimiincluding Bidi_Control.
92*22dc650dSSadaf Ebrahimi
93*22dc650dSSadaf Ebrahimi* A number of changes to script matching for \p and \P:
94*22dc650dSSadaf Ebrahimi
95*22dc650dSSadaf Ebrahimi  (a) Script extensions for a character are now coded as a bitmap instead of
96*22dc650dSSadaf Ebrahimi      a list of script numbers, which should be faster and does not need a
97*22dc650dSSadaf Ebrahimi      loop.
98*22dc650dSSadaf Ebrahimi
99*22dc650dSSadaf Ebrahimi  (b) Added the syntax \p{script:xxx} and \p{script_extensions:xxx} (synonyms
100*22dc650dSSadaf Ebrahimi      sc and scx).
101*22dc650dSSadaf Ebrahimi
102*22dc650dSSadaf Ebrahimi  (c) Changed \p{scriptname} from being the same as \p{sc:scriptname} to being
103*22dc650dSSadaf Ebrahimi      the same as \p{scx:scriptname} because this change happened in Perl at
104*22dc650dSSadaf Ebrahimi      release 5.26.
105*22dc650dSSadaf Ebrahimi
106*22dc650dSSadaf Ebrahimi  (d) The standard Unicode 4-letter abbreviations for script names are now
107*22dc650dSSadaf Ebrahimi      recognized.
108*22dc650dSSadaf Ebrahimi
109*22dc650dSSadaf Ebrahimi  (e) In accordance with Unicode and Perl's "loose matching" rules, spaces,
110*22dc650dSSadaf Ebrahimi      hyphens, and underscores are ignored in property names, which are then
111*22dc650dSSadaf Ebrahimi      matched independent of case.
112*22dc650dSSadaf Ebrahimi
113*22dc650dSSadaf EbrahimiAs always, see ChangeLog for a list of all changes (also the Git log).
114*22dc650dSSadaf Ebrahimi
115*22dc650dSSadaf Ebrahimi
116*22dc650dSSadaf EbrahimiVersion 10.39 29-October-2021
117*22dc650dSSadaf Ebrahimi-----------------------------
118*22dc650dSSadaf Ebrahimi
119*22dc650dSSadaf EbrahimiThis release is happening soon after 10.38 because the bug fix is important.
120*22dc650dSSadaf Ebrahimi
121*22dc650dSSadaf Ebrahimi1. Fix incorrect detection of alternatives in first character search in JIT.
122*22dc650dSSadaf Ebrahimi
123*22dc650dSSadaf Ebrahimi2. Update to Unicode 14.0.0.
124*22dc650dSSadaf Ebrahimi
125*22dc650dSSadaf Ebrahimi3. Some code cleanups (see ChangeLog).
126*22dc650dSSadaf Ebrahimi
127*22dc650dSSadaf Ebrahimi
128*22dc650dSSadaf EbrahimiVersion 10.38 01-October-2021
129*22dc650dSSadaf Ebrahimi-----------------------------
130*22dc650dSSadaf Ebrahimi
131*22dc650dSSadaf EbrahimiAs well as some bug fixes and tidies (as always, see ChangeLog for details),
132*22dc650dSSadaf Ebrahimithe documentation is updated to list the new URLs, following the move of the
133*22dc650dSSadaf Ebrahimisource repository to GitHub and the mailing list to Google Groups.
134*22dc650dSSadaf Ebrahimi
135*22dc650dSSadaf Ebrahimi* The CMake build system can now build both static and shared libraries in one
136*22dc650dSSadaf Ebrahimigo.
137*22dc650dSSadaf Ebrahimi
138*22dc650dSSadaf Ebrahimi* Following Perl's lead, \K is now locked out in lookaround assertions by
139*22dc650dSSadaf Ebrahimidefault, but an option is provided to re-enable the previous behaviour.
140*22dc650dSSadaf Ebrahimi
141*22dc650dSSadaf Ebrahimi
142*22dc650dSSadaf EbrahimiVersion 10.37 26-May-2021
143*22dc650dSSadaf Ebrahimi-------------------------
144*22dc650dSSadaf Ebrahimi
145*22dc650dSSadaf EbrahimiA few more bug fixes and tidies. The only change of real note is the removal of
146*22dc650dSSadaf Ebrahimithe actual POSIX names regcomp etc. from the POSIX wrapper library because
147*22dc650dSSadaf Ebrahimithese have caused issues for some applications (see 10.33 #2 below).
148*22dc650dSSadaf Ebrahimi
149*22dc650dSSadaf Ebrahimi
150*22dc650dSSadaf EbrahimiVersion 10.36 04-December-2020
151*22dc650dSSadaf Ebrahimi------------------------------
152*22dc650dSSadaf Ebrahimi
153*22dc650dSSadaf EbrahimiAgain, mainly bug fixes and tidies. The only enhancements are the addition of
154*22dc650dSSadaf EbrahimiGNU grep's -m (aka --max-count) option to pcre2grep, and also unifying the
155*22dc650dSSadaf Ebrahimihandling of substitution strings for both -O and callouts in pcre2grep, with
156*22dc650dSSadaf Ebrahimithe addition of $x{...} and $o{...} to allow for characters whose code points
157*22dc650dSSadaf Ebrahimiare greater than 255 in Unicode mode.
158*22dc650dSSadaf Ebrahimi
159*22dc650dSSadaf EbrahimiNOTE: there is an outstanding issue with JIT support for MacOS on arm64
160*22dc650dSSadaf Ebrahimihardware. For details, please see Bugzilla issue #2618.
161*22dc650dSSadaf Ebrahimi
162*22dc650dSSadaf Ebrahimi
163*22dc650dSSadaf EbrahimiVersion 10.35 15-April-2020
164*22dc650dSSadaf Ebrahimi---------------------------
165*22dc650dSSadaf Ebrahimi
166*22dc650dSSadaf EbrahimiBugfixes, tidies, and a few new enhancements.
167*22dc650dSSadaf Ebrahimi
168*22dc650dSSadaf Ebrahimi1. Capturing groups that contain recursive backreferences to themselves are no
169*22dc650dSSadaf Ebrahimilonger automatically atomic, because the restriction is no longer necessary
170*22dc650dSSadaf Ebrahimias a result of the 10.30 restructuring.
171*22dc650dSSadaf Ebrahimi
172*22dc650dSSadaf Ebrahimi2. Several new options for pcre2_substitute().
173*22dc650dSSadaf Ebrahimi
174*22dc650dSSadaf Ebrahimi3. When Unicode is supported and PCRE2_UCP is set without PCRE2_UTF, Unicode
175*22dc650dSSadaf Ebrahimicharacter properties are used for upper/lower case computations on characters
176*22dc650dSSadaf Ebrahimiwhose code points are greater than 127.
177*22dc650dSSadaf Ebrahimi
178*22dc650dSSadaf Ebrahimi4. The character tables (for low-valued characters) can now more easily be
179*22dc650dSSadaf Ebrahimisaved and restored in binary.
180*22dc650dSSadaf Ebrahimi
181*22dc650dSSadaf Ebrahimi5. Updated to Unicode 13.0.0.
182*22dc650dSSadaf Ebrahimi
183*22dc650dSSadaf Ebrahimi
184*22dc650dSSadaf EbrahimiVersion 10.34 21-November-2019
185*22dc650dSSadaf Ebrahimi------------------------------
186*22dc650dSSadaf Ebrahimi
187*22dc650dSSadaf EbrahimiAnother release with a few enhancements as well as bugfixes and tidies. The
188*22dc650dSSadaf Ebrahimimain new features are:
189*22dc650dSSadaf Ebrahimi
190*22dc650dSSadaf Ebrahimi1. There is now some support for matching in invalid UTF strings.
191*22dc650dSSadaf Ebrahimi
192*22dc650dSSadaf Ebrahimi2. Non-atomic positive lookarounds are implemented in the pcre2_match()
193*22dc650dSSadaf Ebrahimiinterpreter, but not in JIT.
194*22dc650dSSadaf Ebrahimi
195*22dc650dSSadaf Ebrahimi3. Added two new functions: pcre2_get_match_data_size() and
196*22dc650dSSadaf Ebrahimipcre2_maketables_free().
197*22dc650dSSadaf Ebrahimi
198*22dc650dSSadaf Ebrahimi4. Upgraded to Unicode 12.1.0.
199*22dc650dSSadaf Ebrahimi
200*22dc650dSSadaf Ebrahimi
201*22dc650dSSadaf EbrahimiVersion 10.33 16-April-2019
202*22dc650dSSadaf Ebrahimi---------------------------
203*22dc650dSSadaf Ebrahimi
204*22dc650dSSadaf EbrahimiYet more bugfixes, tidies, and a few enhancements, summarized here (see
205*22dc650dSSadaf EbrahimiChangeLog for the full list):
206*22dc650dSSadaf Ebrahimi
207*22dc650dSSadaf Ebrahimi1. Callouts from pcre2_substitute() are now available.
208*22dc650dSSadaf Ebrahimi
209*22dc650dSSadaf Ebrahimi2. The POSIX functions are now all called pcre2_regcomp() etc., with wrapper
210*22dc650dSSadaf Ebrahimifunctions that use the standard POSIX names. However, in pcre2posix.h the POSIX
211*22dc650dSSadaf Ebrahiminames are defined as macros. This should help avoid linking with the wrong
212*22dc650dSSadaf Ebrahimilibrary in some environments, while still exporting the POSIX names for
213*22dc650dSSadaf Ebrahimipre-existing programs that use them.
214*22dc650dSSadaf Ebrahimi
215*22dc650dSSadaf Ebrahimi3. Some new options:
216*22dc650dSSadaf Ebrahimi
217*22dc650dSSadaf Ebrahimi   (a) PCRE2_EXTRA_ESCAPED_CR_IS_LF makes \r behave as \n.
218*22dc650dSSadaf Ebrahimi
219*22dc650dSSadaf Ebrahimi   (b) PCRE2_EXTRA_ALT_BSUX enables support for ECMAScript 6's \u{hh...}
220*22dc650dSSadaf Ebrahimi       construct.
221*22dc650dSSadaf Ebrahimi
222*22dc650dSSadaf Ebrahimi   (c) PCRE2_COPY_MATCHED_SUBJECT causes a copy of a matched subject to be
223*22dc650dSSadaf Ebrahimi       made, instead of just remembering a pointer.
224*22dc650dSSadaf Ebrahimi
225*22dc650dSSadaf Ebrahimi4. Some new Perl features:
226*22dc650dSSadaf Ebrahimi
227*22dc650dSSadaf Ebrahimi   (a) Perl 5.28's experimental alphabetic names for atomic groups and
228*22dc650dSSadaf Ebrahimi       lookaround assertions, for example, (*pla:...) and (*atomic:...).
229*22dc650dSSadaf Ebrahimi
230*22dc650dSSadaf Ebrahimi   (b) The new Perl "script run" features (*script_run:...) and
231*22dc650dSSadaf Ebrahimi       (*atomic_script_run:...) aka (*sr:...) and (*asr:...).
232*22dc650dSSadaf Ebrahimi
233*22dc650dSSadaf Ebrahimi   (c) When PCRE2_UTF is set, allow non-ASCII letters and decimal digits in
234*22dc650dSSadaf Ebrahimi       capture group names.
235*22dc650dSSadaf Ebrahimi
236*22dc650dSSadaf Ebrahimi5. --disable-percent-zt disables the use of %zu and %td in formatting strings
237*22dc650dSSadaf Ebrahimiin pcre2test. They were already automatically disabled for VC and older C
238*22dc650dSSadaf Ebrahimicompilers.
239*22dc650dSSadaf Ebrahimi
240*22dc650dSSadaf Ebrahimi6. Some changes related to callouts in pcre2grep:
241*22dc650dSSadaf Ebrahimi
242*22dc650dSSadaf Ebrahimi   (a) Support for running an external program under VMS has been added, in
243*22dc650dSSadaf Ebrahimi       addition to Windows and fork() support.
244*22dc650dSSadaf Ebrahimi
245*22dc650dSSadaf Ebrahimi   (b) --disable-pcre2grep-callout-fork restricts the callout support in
246*22dc650dSSadaf Ebrahimi       to the inbuilt echo facility.
247*22dc650dSSadaf Ebrahimi
248*22dc650dSSadaf Ebrahimi
249*22dc650dSSadaf EbrahimiVersion 10.32 10-September-2018
250*22dc650dSSadaf Ebrahimi-------------------------------
251*22dc650dSSadaf Ebrahimi
252*22dc650dSSadaf EbrahimiThis is another mainly bugfix and tidying release with a few minor
253*22dc650dSSadaf Ebrahimienhancements. These are the main ones:
254*22dc650dSSadaf Ebrahimi
255*22dc650dSSadaf Ebrahimi1. pcre2grep now supports the inclusion of binary zeros in patterns that are
256*22dc650dSSadaf Ebrahimiread from files via the -f option.
257*22dc650dSSadaf Ebrahimi
258*22dc650dSSadaf Ebrahimi2. ./configure now supports --enable-jit=auto, which automatically enables JIT
259*22dc650dSSadaf Ebrahimiif the hardware supports it.
260*22dc650dSSadaf Ebrahimi
261*22dc650dSSadaf Ebrahimi3. In pcre2_dfa_match(), internal recursive calls no longer use the stack for
262*22dc650dSSadaf Ebrahimilocal workspace and local ovectors. Instead, an initial block of stack is
263*22dc650dSSadaf Ebrahimireserved, but if this is insufficient, heap memory is used. The heap limit
264*22dc650dSSadaf Ebrahimiparameter now applies to pcre2_dfa_match().
265*22dc650dSSadaf Ebrahimi
266*22dc650dSSadaf Ebrahimi4. Updated to Unicode version 11.0.0.
267*22dc650dSSadaf Ebrahimi
268*22dc650dSSadaf Ebrahimi5. (*ACCEPT:ARG), (*FAIL:ARG), and (*COMMIT:ARG) are now supported.
269*22dc650dSSadaf Ebrahimi
270*22dc650dSSadaf Ebrahimi6. Added support for \N{U+dddd}, but only in Unicode mode.
271*22dc650dSSadaf Ebrahimi
272*22dc650dSSadaf Ebrahimi7. Added support for (?^) to unset all imnsx options.
273*22dc650dSSadaf Ebrahimi
274*22dc650dSSadaf Ebrahimi
275*22dc650dSSadaf EbrahimiVersion 10.31 12-February-2018
276*22dc650dSSadaf Ebrahimi------------------------------
277*22dc650dSSadaf Ebrahimi
278*22dc650dSSadaf EbrahimiThis is mainly a bugfix and tidying release (see ChangeLog for full details).
279*22dc650dSSadaf EbrahimiHowever, there are some minor enhancements.
280*22dc650dSSadaf Ebrahimi
281*22dc650dSSadaf Ebrahimi1. New pcre2_config() options: PCRE2_CONFIG_NEVER_BACKSLASH_C and
282*22dc650dSSadaf EbrahimiPCRE2_CONFIG_COMPILED_WIDTHS.
283*22dc650dSSadaf Ebrahimi
284*22dc650dSSadaf Ebrahimi2. New pcre2_pattern_info() option PCRE2_INFO_EXTRAOPTIONS to retrieve the
285*22dc650dSSadaf Ebrahimiextra compile time options.
286*22dc650dSSadaf Ebrahimi
287*22dc650dSSadaf Ebrahimi3. There are now public names for all the pcre2_compile() error numbers.
288*22dc650dSSadaf Ebrahimi
289*22dc650dSSadaf Ebrahimi4. Added PCRE2_CALLOUT_STARTMATCH and PCRE2_CALLOUT_BACKTRACK bits to a new
290*22dc650dSSadaf Ebrahimifield callout_flags in callout blocks.
291*22dc650dSSadaf Ebrahimi
292*22dc650dSSadaf Ebrahimi
293*22dc650dSSadaf EbrahimiVersion 10.30 14-August-2017
294*22dc650dSSadaf Ebrahimi----------------------------
295*22dc650dSSadaf Ebrahimi
296*22dc650dSSadaf EbrahimiThe full list of changes that includes bugfixes and tidies is, as always, in
297*22dc650dSSadaf EbrahimiChangeLog. These are the most important new features:
298*22dc650dSSadaf Ebrahimi
299*22dc650dSSadaf Ebrahimi1. The main interpreter, pcre2_match(), has been refactored into a new version
300*22dc650dSSadaf Ebrahimithat does not use recursive function calls (and therefore the system stack) for
301*22dc650dSSadaf Ebrahimiremembering backtracking positions. This makes --disable-stack-for-recursion a
302*22dc650dSSadaf EbrahimiNOOP. The new implementation allows backtracking into recursive group calls in
303*22dc650dSSadaf Ebrahimipatterns, making it more compatible with Perl, and also fixes some other
304*22dc650dSSadaf Ebrahimipreviously hard-to-do issues. For patterns that have a lot of backtracking, the
305*22dc650dSSadaf Ebrahimiheap is now used, and there is an explicit limit on the amount, settable by
306*22dc650dSSadaf Ebrahimipcre2_set_heap_limit() or (*LIMIT_HEAP=xxx). The "recursion limit" is retained,
307*22dc650dSSadaf Ebrahimibut is renamed as "depth limit" (though the old names remain for
308*22dc650dSSadaf Ebrahimicompatibility).
309*22dc650dSSadaf Ebrahimi
310*22dc650dSSadaf EbrahimiThere is also a change in the way callouts from pcre2_match() are handled. The
311*22dc650dSSadaf Ebrahimioffset_vector field in the callout block is no longer a pointer to the
312*22dc650dSSadaf Ebrahimiactual ovector that was passed to the matching function in the match data
313*22dc650dSSadaf Ebrahimiblock. Instead it points to an internal ovector of a size large enough to hold
314*22dc650dSSadaf Ebrahimiall possible captured substrings in the pattern.
315*22dc650dSSadaf Ebrahimi
316*22dc650dSSadaf Ebrahimi2. The new option PCRE2_ENDANCHORED insists that a pattern match must end at
317*22dc650dSSadaf Ebrahimithe end of the subject.
318*22dc650dSSadaf Ebrahimi
319*22dc650dSSadaf Ebrahimi3. The new option PCRE2_EXTENDED_MORE implements Perl's /xx feature, and
320*22dc650dSSadaf Ebrahimipcre2test is upgraded to support it. Setting within the pattern by (?xx) is
321*22dc650dSSadaf Ebrahimialso supported.
322*22dc650dSSadaf Ebrahimi
323*22dc650dSSadaf Ebrahimi4. (?n) can be used to set PCRE2_NO_AUTO_CAPTURE, because Perl now has this.
324*22dc650dSSadaf Ebrahimi
325*22dc650dSSadaf Ebrahimi5. Additional compile options in the compile context are now available, and the
326*22dc650dSSadaf Ebrahimifirst two are: PCRE2_EXTRA_ALLOW_SURROGATE_ESCAPES and
327*22dc650dSSadaf EbrahimiPCRE2_EXTRA_BAD_ESCAPE_IS_LITERAL.
328*22dc650dSSadaf Ebrahimi
329*22dc650dSSadaf Ebrahimi6. The newline type PCRE2_NEWLINE_NUL is now available.
330*22dc650dSSadaf Ebrahimi
331*22dc650dSSadaf Ebrahimi7. The match limit value now also applies to pcre2_dfa_match() as there are
332*22dc650dSSadaf Ebrahimipatterns that can use up a lot of resources without necessarily recursing very
333*22dc650dSSadaf Ebrahimideeply.
334*22dc650dSSadaf Ebrahimi
335*22dc650dSSadaf Ebrahimi8. The option REG_PEND (a GNU extension) is now available for the POSIX
336*22dc650dSSadaf Ebrahimiwrapper. Also there is a new option PCRE2_LITERAL which is used to support
337*22dc650dSSadaf EbrahimiREG_NOSPEC.
338*22dc650dSSadaf Ebrahimi
339*22dc650dSSadaf Ebrahimi9. PCRE2_EXTRA_MATCH_LINE and PCRE2_EXTRA_MATCH_WORD are implemented for the
340*22dc650dSSadaf Ebrahimibenefit of pcre2grep, and pcre2grep's -F, -w, and -x options are re-implemented
341*22dc650dSSadaf Ebrahimiusing PCRE2_LITERAL, PCRE2_EXTRA_MATCH_WORD, and PCRE2_EXTRA_MATCH_LINE. This
342*22dc650dSSadaf Ebrahimiis tidier and also fixes some bugs.
343*22dc650dSSadaf Ebrahimi
344*22dc650dSSadaf Ebrahimi10. The Unicode tables are upgraded from Unicode 8.0.0 to Unicode 10.0.0.
345*22dc650dSSadaf Ebrahimi
346*22dc650dSSadaf Ebrahimi11. There are some experimental functions for converting foreign patterns
347*22dc650dSSadaf Ebrahimi(globs and POSIX patterns) into PCRE2 patterns.
348*22dc650dSSadaf Ebrahimi
349*22dc650dSSadaf Ebrahimi
350*22dc650dSSadaf EbrahimiVersion 10.23 14-February-2017
351*22dc650dSSadaf Ebrahimi------------------------------
352*22dc650dSSadaf Ebrahimi
353*22dc650dSSadaf Ebrahimi1. ChangeLog has the details of a lot of bug fixes and tidies.
354*22dc650dSSadaf Ebrahimi
355*22dc650dSSadaf Ebrahimi2. There has been a major re-factoring of the pcre2_compile.c file. Most syntax
356*22dc650dSSadaf Ebrahimichecking is now done in the pre-pass that identifies capturing groups. This has
357*22dc650dSSadaf Ebrahimireduced the amount of duplication and made the code tidier. While doing this,
358*22dc650dSSadaf Ebrahimisome minor bugs and Perl incompatibilities were fixed (see ChangeLog for
359*22dc650dSSadaf Ebrahimidetails.)
360*22dc650dSSadaf Ebrahimi
361*22dc650dSSadaf Ebrahimi3. Back references are now permitted in lookbehind assertions when there are
362*22dc650dSSadaf Ebrahimino duplicated group numbers (that is, (?| has not been used), and, if the
363*22dc650dSSadaf Ebrahimireference is by name, there is only one group of that name. The referenced
364*22dc650dSSadaf Ebrahimigroup must, of course be of fixed length.
365*22dc650dSSadaf Ebrahimi
366*22dc650dSSadaf Ebrahimi4. \g{+<number>} (e.g. \g{+2} ) is now supported. It is a "forward back
367*22dc650dSSadaf Ebrahimireference" and can be useful in repetitions (compare \g{-<number>} ). Perl does
368*22dc650dSSadaf Ebrahiminot recognize this syntax.
369*22dc650dSSadaf Ebrahimi
370*22dc650dSSadaf Ebrahimi5. pcre2grep now automatically expands its buffer up to a maximum set by
371*22dc650dSSadaf Ebrahimi--max-buffer-size.
372*22dc650dSSadaf Ebrahimi
373*22dc650dSSadaf Ebrahimi6. The -t option (grand total) has been added to pcre2grep.
374*22dc650dSSadaf Ebrahimi
375*22dc650dSSadaf Ebrahimi7. A new function called pcre2_code_copy_with_tables() exists to copy a
376*22dc650dSSadaf Ebrahimicompiled pattern along with a private copy of the character tables that is
377*22dc650dSSadaf Ebrahimiuses.
378*22dc650dSSadaf Ebrahimi
379*22dc650dSSadaf Ebrahimi8. A user supplied a number of patches to upgrade pcre2grep under Windows and
380*22dc650dSSadaf Ebrahimitidy the code.
381*22dc650dSSadaf Ebrahimi
382*22dc650dSSadaf Ebrahimi9. Several updates have been made to pcre2test and test scripts (see
383*22dc650dSSadaf EbrahimiChangeLog).
384*22dc650dSSadaf Ebrahimi
385*22dc650dSSadaf Ebrahimi
386*22dc650dSSadaf EbrahimiVersion 10.22 29-July-2016
387*22dc650dSSadaf Ebrahimi--------------------------
388*22dc650dSSadaf Ebrahimi
389*22dc650dSSadaf Ebrahimi1. ChangeLog has the details of a number of bug fixes.
390*22dc650dSSadaf Ebrahimi
391*22dc650dSSadaf Ebrahimi2. The POSIX wrapper function regcomp() did not used to support back references
392*22dc650dSSadaf Ebrahimiand subroutine calls if called with the REG_NOSUB option. It now does.
393*22dc650dSSadaf Ebrahimi
394*22dc650dSSadaf Ebrahimi3. A new function, pcre2_code_copy(), is added, to make a copy of a compiled
395*22dc650dSSadaf Ebrahimipattern.
396*22dc650dSSadaf Ebrahimi
397*22dc650dSSadaf Ebrahimi4. Support for string callouts is added to pcre2grep.
398*22dc650dSSadaf Ebrahimi
399*22dc650dSSadaf Ebrahimi5. Added the PCRE2_NO_JIT option to pcre2_match().
400*22dc650dSSadaf Ebrahimi
401*22dc650dSSadaf Ebrahimi6. The pcre2_get_error_message() function now returns with a negative error
402*22dc650dSSadaf Ebrahimicode if the error number it is given is unknown.
403*22dc650dSSadaf Ebrahimi
404*22dc650dSSadaf Ebrahimi7. Several updates have been made to pcre2test and test scripts (see
405*22dc650dSSadaf EbrahimiChangeLog).
406*22dc650dSSadaf Ebrahimi
407*22dc650dSSadaf Ebrahimi
408*22dc650dSSadaf EbrahimiVersion 10.21 12-January-2016
409*22dc650dSSadaf Ebrahimi-----------------------------
410*22dc650dSSadaf Ebrahimi
411*22dc650dSSadaf Ebrahimi1. Many bugs have been fixed. A large number of them were provoked only by very
412*22dc650dSSadaf Ebrahimistrange pattern input, and were discovered by fuzzers. Some others were
413*22dc650dSSadaf Ebrahimidiscovered by code auditing. See ChangeLog for details.
414*22dc650dSSadaf Ebrahimi
415*22dc650dSSadaf Ebrahimi2. The Unicode tables have been updated to Unicode version 8.0.0.
416*22dc650dSSadaf Ebrahimi
417*22dc650dSSadaf Ebrahimi3. For Perl compatibility in EBCDIC environments, ranges such as a-z in a
418*22dc650dSSadaf Ebrahimiclass, where both values are literal letters in the same case, omit the
419*22dc650dSSadaf Ebrahiminon-letter EBCDIC code points within the range.
420*22dc650dSSadaf Ebrahimi
421*22dc650dSSadaf Ebrahimi4. There have been a number of enhancements to the pcre2_substitute() function,
422*22dc650dSSadaf Ebrahimigiving more flexibility to replacement facilities. It is now also possible to
423*22dc650dSSadaf Ebrahimicause the function to return the needed buffer size if the one given is too
424*22dc650dSSadaf Ebrahimismall.
425*22dc650dSSadaf Ebrahimi
426*22dc650dSSadaf Ebrahimi5. The PCRE2_ALT_VERBNAMES option causes the "name" parts of special verbs such
427*22dc650dSSadaf Ebrahimias (*THEN:name) to be processed for backslashes and to take note of
428*22dc650dSSadaf EbrahimiPCRE2_EXTENDED.
429*22dc650dSSadaf Ebrahimi
430*22dc650dSSadaf Ebrahimi6. PCRE2_INFO_HASBACKSLASHC makes it possible for a client to find out if a
431*22dc650dSSadaf Ebrahimipattern uses \C, and --never-backslash-C makes it possible to compile a version
432*22dc650dSSadaf EbrahimiPCRE2 in which the use of \C is always forbidden.
433*22dc650dSSadaf Ebrahimi
434*22dc650dSSadaf Ebrahimi7. A limit to the length of pattern that can be handled can now be set by
435*22dc650dSSadaf Ebrahimicalling pcre2_set_max_pattern_length().
436*22dc650dSSadaf Ebrahimi
437*22dc650dSSadaf Ebrahimi8. When matching an unanchored pattern, a match can be required to begin within
438*22dc650dSSadaf Ebrahimia given number of code units after the start of the subject by calling
439*22dc650dSSadaf Ebrahimipcre2_set_offset_limit().
440*22dc650dSSadaf Ebrahimi
441*22dc650dSSadaf Ebrahimi9. The pcre2test program has been extended to test new facilities, and it can
442*22dc650dSSadaf Ebrahiminow run the tests when LF on its own is not a valid newline sequence.
443*22dc650dSSadaf Ebrahimi
444*22dc650dSSadaf Ebrahimi10. The RunTest script has also been updated to enable more tests to be run.
445*22dc650dSSadaf Ebrahimi
446*22dc650dSSadaf Ebrahimi11. There have been some minor performance enhancements.
447*22dc650dSSadaf Ebrahimi
448*22dc650dSSadaf Ebrahimi
449*22dc650dSSadaf EbrahimiVersion 10.20 30-June-2015
450*22dc650dSSadaf Ebrahimi--------------------------
451*22dc650dSSadaf Ebrahimi
452*22dc650dSSadaf Ebrahimi1. Callouts with string arguments and the pcre2_callout_enumerate() function
453*22dc650dSSadaf Ebrahimihave been implemented.
454*22dc650dSSadaf Ebrahimi
455*22dc650dSSadaf Ebrahimi2. The PCRE2_NEVER_BACKSLASH_C option, which locks out the use of \C, is added.
456*22dc650dSSadaf Ebrahimi
457*22dc650dSSadaf Ebrahimi3. The PCRE2_ALT_CIRCUMFLEX option lets ^ match after a newline at the end of a
458*22dc650dSSadaf Ebrahimisubject in multiline mode.
459*22dc650dSSadaf Ebrahimi
460*22dc650dSSadaf Ebrahimi4. The way named subpatterns are handled has been refactored. The previous
461*22dc650dSSadaf Ebrahimiapproach had several bugs.
462*22dc650dSSadaf Ebrahimi
463*22dc650dSSadaf Ebrahimi5. The handling of \c in EBCDIC environments has been changed to conform to the
464*22dc650dSSadaf Ebrahimiperlebcdic document. This is an incompatible change.
465*22dc650dSSadaf Ebrahimi
466*22dc650dSSadaf Ebrahimi6. Bugs have been mended, many of them discovered by fuzzers.
467*22dc650dSSadaf Ebrahimi
468*22dc650dSSadaf Ebrahimi
469*22dc650dSSadaf EbrahimiVersion 10.10 06-March-2015
470*22dc650dSSadaf Ebrahimi---------------------------
471*22dc650dSSadaf Ebrahimi
472*22dc650dSSadaf Ebrahimi1. Serialization and de-serialization functions have been added to the API,
473*22dc650dSSadaf Ebrahimimaking it possible to save and restore sets of compiled patterns, though
474*22dc650dSSadaf Ebrahimirestoration must be done in the same environment that was used for compilation.
475*22dc650dSSadaf Ebrahimi
476*22dc650dSSadaf Ebrahimi2. The (*NO_JIT) feature has been added; this makes it possible for a pattern
477*22dc650dSSadaf Ebrahimicreator to specify that JIT is not to be used.
478*22dc650dSSadaf Ebrahimi
479*22dc650dSSadaf Ebrahimi3. A number of bugs have been fixed. In particular, bugs that caused building
480*22dc650dSSadaf Ebrahimion Windows using CMake to fail have been mended.
481*22dc650dSSadaf Ebrahimi
482*22dc650dSSadaf Ebrahimi
483*22dc650dSSadaf EbrahimiVersion 10.00 05-January-2015
484*22dc650dSSadaf Ebrahimi-----------------------------
485*22dc650dSSadaf Ebrahimi
486*22dc650dSSadaf EbrahimiVersion 10.00 is the first release of PCRE2, a revised API for the PCRE
487*22dc650dSSadaf Ebrahimilibrary. Changes prior to 10.00 are logged in the ChangeLog file for the old
488*22dc650dSSadaf EbrahimiAPI, up to item 20 for release 8.36. New programs are recommended to use the
489*22dc650dSSadaf Ebrahiminew library. Programs that use the original (PCRE1) API will need changing
490*22dc650dSSadaf Ebrahimibefore linking with the new library.
491*22dc650dSSadaf Ebrahimi
492*22dc650dSSadaf Ebrahimi****
493