[PATCH] [zeus] aspell: CVE-2019-20433

All of lore.kernel.org
 help / color / mirror / Atom feed

* [PATCH] [zeus] aspell: CVE-2019-20433
@ 2020-03-12  9:23 Stefan Ghinea
  2020-03-12 12:25 ` Mittal, Anuj
  0 siblings, 1 reply; 7+ messages in thread
From: Stefan Ghinea @ 2020-03-12  9:23 UTC (permalink / raw)
  To: openembedded-core

libaspell.a in GNU Aspell before 0.60.8 has a buffer over-read for a string
ending with a single '\0' byte, if the encoding is set to ucs-2 or ucs-4
outside of the application, as demonstrated by the ASPELL_CONF environment
variable.

References:
https://nvd.nist.gov/vuln/detail/CVE-2019-20433

Upstream patches:
https://github.com/GNUAspell/aspell/commit/de29341638833ba7717bd6b5e6850998454b044b
https://github.com/GNUAspell/aspell/commit/cefd447e5528b08bb0cd6656bc52b4255692cefc

Signed-off-by: Stefan Ghinea <stefan.ghinea@windriver.com>
---
 .../aspell/aspell/CVE-2019-20433-0001.patch   | 999 ++++++++++++++++++
 .../aspell/aspell/CVE-2019-20433-0002.patch   |  68 ++
 meta/recipes-support/aspell/aspell_0.60.7.bb  |   2 +
 3 files changed, 1069 insertions(+)
 create mode 100644 meta/recipes-support/aspell/aspell/CVE-2019-20433-0001.patch
 create mode 100644 meta/recipes-support/aspell/aspell/CVE-2019-20433-0002.patch

diff --git a/meta/recipes-support/aspell/aspell/CVE-2019-20433-0001.patch b/meta/recipes-support/aspell/aspell/CVE-2019-20433-0001.patch
new file mode 100644
index 0000000000..fd68461e32
--- /dev/null
+++ b/meta/recipes-support/aspell/aspell/CVE-2019-20433-0001.patch
@@ -0,0 +1,999 @@
+From de29341638833ba7717bd6b5e6850998454b044b Mon Sep 17 00:00:00 2001
+From: Kevin Atkinson <kevina@gnu.org>
+Date: Sat, 17 Aug 2019 17:06:53 -0400
+Subject: [PATCH 1/2] Don't allow null-terminated UCS-2/4 strings using the
+ original API.
+
+Detect if the encoding is UCS-2/4 and the length is -1 in affected API
+functions and refuse to convert the string.  If the string ends up
+being converted somehow, abort with an error message in DecodeDirect
+and ConvDirect.  To convert a null terminated string in
+Decode/ConvDirect, a negative number corresponding to the width of the
+underlying character type for the encoding is expected; for example,
+if the encoding is "ucs-2" then a the size is expected to be -2.
+
+Also fix a 1-3 byte over-read in DecodeDirect when reading UCS-2/4
+strings when a size is provided (found by OSS-Fuzz).
+
+Also fix a bug in DecodeDirect that caused DocumentChecker to return
+the wrong offsets when working with UCS-2/4 strings.
+
+CVE: CVE-2019-20433
+Upstream-Status: Backport [https://github.com/GNUAspell/aspell/commit/de29341638833ba7717bd6b5e6850998454b044b]
+
+[SG: - adjusted context
+     - discarded test changes as test framework is not available
+     - discarded manual entry changes for features that aren't backported]
+Signed-off-by: Stefan Ghinea <stefan.ghinea@windriver.com>
+---
+ auto/MkSrc/CcHelper.pm      | 99 ++++++++++++++++++++++++++++++++++---
+ auto/MkSrc/Create.pm        |  5 +-
+ auto/MkSrc/Info.pm          |  5 +-
+ auto/MkSrc/ProcCc.pm        | 24 +++++----
+ auto/MkSrc/ProcImpl.pm      | 57 +++++++++++++++------
+ auto/MkSrc/Read.pm          |  4 +-
+ auto/mk-src.in              | 44 +++++++++++++++--
+ common/convert.cpp          | 39 ++++++++++++---
+ common/convert.hpp          | 38 +++++++++++++-
+ common/document_checker.cpp | 17 ++++++-
+ common/document_checker.hpp |  1 +
+ common/version.cpp          | 15 ++++--
+ configure.ac                |  8 +++
+ manual/aspell.texi          | 58 ++++++++++++++++------
+ manual/readme.texi          | 70 +++++++++++++++++++++-----
+ 15 files changed, 409 insertions(+), 75 deletions(-)
+
+diff --git a/auto/MkSrc/CcHelper.pm b/auto/MkSrc/CcHelper.pm
+index f2de991..0044335 100644
+--- a/auto/MkSrc/CcHelper.pm
++++ b/auto/MkSrc/CcHelper.pm
+@@ -10,8 +10,8 @@ BEGIN {
+   use Exporter;
+   our @ISA = qw(Exporter);
+   our @EXPORT = qw(to_c_return_type c_error_cond
+-		   to_type_name make_desc make_func call_func
+-		   make_c_method call_c_method form_c_method
++		   to_type_name make_desc make_func call_func get_c_func_name
++		   make_c_method make_wide_macro call_c_method form_c_method
+ 		   make_cxx_method);
+ }
+ 
+@@ -90,6 +90,69 @@ sub make_func ( $ \@ $ ; \% ) {
+ 	   ')'));
+ }
+ 
++=item make_wide_version NAME @TYPES PARMS ; %ACCUM
++
++Creates the wide character version of the function if needed
++
++=cut
++
++sub make_wide_version ( $ \@ $ ; \% ) {
++  my ($name, $d, $p, $accum) = @_;
++  my @d = @$d;
++  shift @d;
++  return '' unless grep {$_->{type} eq 'encoded string'} @d;
++  $accum->{sys_headers}{'stddef.h'} = true;
++  $accum->{suffix}[5] = <<'---';
++
++/******************* private implemantion details *********************/
++
++#ifdef __cplusplus
++#  define aspell_cast_(type, expr) (static_cast<type>(expr))
++#  define aspell_cast_from_wide_(str) (static_cast<const void *>(str))
++#else
++#  define aspell_cast_(type, expr) ((type)(expr))
++#  define aspell_cast_from_wide_(str) ((const char *)(str))
++#endif
++---
++  my @parms = map {$_->{type} eq 'encoded string'
++                       ? ($_->{name}, $_->{name}.'_size')
++                       : $_->{name}} @d;
++  $name = to_lower $name;
++  $accum->{suffix}[0] = <<'---';
++/**********************************************************************/
++
++#ifdef ASPELL_ENCODE_SETTING_SECURE
++---
++  $accum->{suffix}[2] = "#endif\n";
++  my @args = map  {$_->{type} eq 'encoded string'
++                       ? ($_->{name}, "$_->{name}_size", '-1')
++                       : $_->{name}} @d;
++  $accum->{suffix}[1] .=
++      (join '',
++       "#define $name",
++       '(', join(', ', @parms), ')',
++       "\\\n    ",
++       $name, '_wide',
++       '(', join(', ', @args), ')',
++       "\n");
++  @args = map  {$_->{type} eq 'encoded string'
++                    ? ("aspell_cast_from_wide_($_->{name})",
++                       "$_->{name}_size*aspell_cast_(int,sizeof(*($_->{name})))",
++                       "sizeof(*($_->{name}))")
++                    : $_->{name}} @d;
++  return (join '',
++          "\n",
++          "/* version of $name that is safe to use with (null terminated) wide characters */\n",
++          '#define ',
++          $name, '_w',
++          '(', join(', ', @parms), ')', 
++          "\\\n    ",
++          $name, '_wide',
++          '(', join(', ', @args), ')',
++          "\n");
++}
++
++
+ =item call_func NAME @TYPES PARMS ; %ACCUM
+ 
+ Return a string to call a func.  Will prefix the function with return
+@@ -103,7 +166,6 @@ Parms can be any of:
+ 
+ sub call_func ( $ \@ $ ; \% ) {
+   my ($name, $d, $p, $accum) = @_;
+-  $accum = {} unless defined $accum;
+   my @d = @$d;
+   my $func_ret = to_type_name(shift @d, {%$p,pos=>'return'}, %$accum);
+   return (join '',
+@@ -148,8 +210,14 @@ sub to_type_name ( $ $ ; \% ) {
+   my $name = $t->{name};
+   my $type = $t->{type};
+ 
+-  return ( (to_type_name {%$d, type=>'string'}, $p, %$accum) ,
+-	   (to_type_name {%$d, type=>'int', name=>"$d->{name}_size"}, $p, %$accum) )
++  if ($name eq 'encoded string' && $is_cc && $pos eq 'parm') {
++    my @types = ((to_type_name {%$d, type=>($p->{wide}?'const void pointer':'string')}, $p, %$accum),
++                 (to_type_name {%$d, type=>'int', name=>"$d->{name}_size"}, $p, %$accum));
++    push @types, (to_type_name {%$d, type=>'int', name=>"$d->{name}_type_width"}, $p, %$accum) if $p->{wide};
++    return @types;
++  }
++  return ( (to_type_name {%$d, type=>($p->{wide}?'const void pointer':'string')}, $p, %$accum) ,
++           (to_type_name {%$d, type=>'int', name=>"$d->{name}_size"}, $p, %$accum) )
+       if $name eq 'encoded string' && $is_cc && $pos eq 'parm';
+ 
+   my $str;
+@@ -174,7 +242,7 @@ sub to_type_name ( $ $ ; \% ) {
+ 	$str .= "String";
+       }
+     } elsif ($name eq 'encoded string') {
+-      $str .= "const char *";
++      $str .= $p->{wide} ? "const void *" : "const char *";
+     } elsif ($name eq '') {
+       $str .= "void";
+     } elsif ($name eq 'bool' && $is_cc) {
+@@ -186,7 +254,7 @@ sub to_type_name ( $ $ ; \% ) {
+       if ($t->{pointer}) {
+ 	$accum->{types}->{$name} = $t;
+       } else {
+-	$accum->{headers}->{$t->{created_in}} = true;
++        $accum->{headers}->{$t->{created_in}} = true unless $mode eq 'cc';
+       }
+       $str .= "$c_type Aspell" if $mode eq 'cc';
+       $str .= to_mixed($name);
+@@ -214,6 +282,7 @@ sub to_type_name ( $ $ ; \% ) {
+   return $str;
+ }
+ 
++
+ =item make_desc DESC ; LEVEL
+ 
+ Make a C comment out of DESC optionally indenting it LEVEL spaces.
+@@ -286,6 +355,7 @@ sub form_c_method ($ $ $ ; \% )
+     } else {
+       $func = "aspell $class $name";
+     }
++    $func .= " wide" if $p->{wide};
+     if (exists $d->{'const'}) {
+       splice @data, 1, 0, {type => "const $class", name=> $this_name};
+     } else {
+@@ -306,6 +376,21 @@ sub make_c_method ($ $ $ ; \%)
+   return &make_func(@ret);
+ }
+ 
++sub get_c_func_name ($ $ $)
++{
++  my @ret = &form_c_method(@_);
++  return undef unless @ret > 0;
++  return to_lower $ret[0];
++}
++
++sub make_wide_macro ($ $ $ ; \%)
++{
++  my @ret = &form_c_method(@_);
++  return undef unless @ret > 0;
++  my $str = &make_wide_version(@ret);
++  return $str;
++}
++
+ sub call_c_method ($ $ $ ; \%)
+ {
+   my @ret = &form_c_method(@_);
+diff --git a/auto/MkSrc/Create.pm b/auto/MkSrc/Create.pm
+index d39b60e..630ede5 100644
+--- a/auto/MkSrc/Create.pm
++++ b/auto/MkSrc/Create.pm
+@@ -77,8 +77,10 @@ sub create_cc_file ( % )  {
+   $file .= "#include \"aspell.h\"\n" if $p{type} eq 'cxx';
+   $file .= "#include \"settings.h\"\n" if $p{type} eq 'native_impl' && $p{name} eq 'errors';
+   $file .= "#include \"gettext.h\"\n" if $p{type} eq 'native_impl' && $p{name} eq 'errors';
++  $file .= cmap {"#include <$_>\n"} sort keys %{$accum{sys_headers}};
+   $file .= cmap {"#include \"".to_lower($_).".hpp\"\n"} sort keys %{$accum{headers}};
+-  $file .= "#ifdef __cplusplus\nextern \"C\" {\n#endif\n" if $p{header} && !$p{cxx};
++  $file .= "\n#ifdef __cplusplus\nextern \"C\" {\n#endif\n" if $p{header} && !$p{cxx};
++  $file .= join('', grep {defined $_} @{$accum{prefix}});
+   $file .= "\nnamespace $p{namespace} {\n\n" if $p{cxx};
+   if (defined $info{forward}{proc}{$p{type}}) {
+     my @types = sort {$a->{name} cmp $b->{name}} (values %{$accum{types}});
+@@ -86,6 +88,7 @@ sub create_cc_file ( % )  {
+   }
+   $file .= "\n";
+   $file .= $body;
++  $file .= join('', grep {defined $_} @{$accum{suffix}});
+   $file .= "\n\n}\n\n" if $p{cxx};
+   $file .= "#ifdef __cplusplus\n}\n#endif\n" if $p{header} && !$p{cxx};
+   $file .= "#endif /* $hm */\n" if $p{header};
+diff --git a/auto/MkSrc/Info.pm b/auto/MkSrc/Info.pm
+index c644028..ace8e21 100644
+--- a/auto/MkSrc/Info.pm
++++ b/auto/MkSrc/Info.pm
+@@ -60,6 +60,7 @@ each proc sub should take the following argv
+     the object from which it is a member of
+   no native: do not attempt to create a native implementation
+   treat as object: treat as a object rather than a pointer
++  no conv: do not converted an encoded string
+ 
+ The %info structure is initialized as follows:
+ 
+@@ -104,8 +105,8 @@ The %info structure is initialized as follows:
+   errors => {}, # possible errors
+   method => {
+     # A class method
+-    options => ['desc', 'posib err', 'c func', 'const',
+-		'c only', 'c impl', 'cxx impl'],
++    options => ['desc', 'posib err', 'c func', 'const', 'no conv', 'on conv error',
++		'c only', 'c impl', 'cxx impl', 'cc extra'],
+     groups => undef},
+   constructor => {
+     # A class constructor
+diff --git a/auto/MkSrc/ProcCc.pm b/auto/MkSrc/ProcCc.pm
+index 47c4338..98cc435 100644
+--- a/auto/MkSrc/ProcCc.pm
++++ b/auto/MkSrc/ProcCc.pm
+@@ -23,7 +23,7 @@ use MkSrc::Info;
+ sub make_c_object ( $ @ );
+ 
+ $info{group}{proc}{cc} = sub {
+-  my ($data) = @_;
++  my ($data,@rest) = @_;
+   my $ret;
+   my $stars = (70 - length $data->{name})/2;
+   $ret .= "/";
+@@ -33,14 +33,14 @@ $info{group}{proc}{cc} = sub {
+   $ret .= "/\n";
+   foreach my $d (@{$data->{data}}) {
+     $ret .= "\n\n";
+-    $ret .= $info{$d->{type}}{proc}{cc}->($d);
++    $ret .= $info{$d->{type}}{proc}{cc}->($d,@rest);
+   }
+   $ret .= "\n\n";
+   return $ret;
+ };
+ 
+ $info{enum}{proc}{cc} = sub {
+-  my ($d) = @_;
++  my ($d,@rest) = @_;
+   my $n = "Aspell".to_mixed($d->{name});
+   return ("\n".
+ 	  make_desc($d->{desc}).
+@@ -58,21 +58,26 @@ $info{struct}{proc}{cc} = sub {
+ };
+ 
+ $info{union}{proc}{cc} = sub {
+-  return make_c_object "union", $_[0];
++  return make_c_object "union", @_;
+ };
+ 
+ $info{class}{proc}{cc} = sub {
+-  my ($d) = @_;
++  my ($d,$accum) = @_;
+   my $class = $d->{name};
+   my $classname = "Aspell".to_mixed($class);
+   my $ret = "";
+   $ret .= "typedef struct $classname $classname;\n\n";
+   foreach (@{$d->{data}}) {
+-    my $s = make_c_method($class, $_, {mode=>'cc'});
++    my $s = make_c_method($class, $_, {mode=>'cc'}, %$accum);
+     next unless defined $s;
+     $ret .= "\n";
+     $ret .= make_desc($_->{desc});
+-    $ret .= make_c_method($class, $_, {mode=>'cc'}).";\n";
++    $ret .= make_c_method($class, $_, {mode=>'cc'}, %$accum).";\n";
++    if (grep {$_->{type} eq 'encoded string'} @{$_->{data}}) {
++      $ret .= make_c_method($class, $_, {mode=>'cc', wide=>true}, %$accum).";\n";
++      $ret .= make_wide_macro($class, $_, {mode=>'cc'}, %$accum);
++    }
++    $ret .= "\n".$_->{'cc extra'}."\n" if defined $_->{'cc extra'};
+   }
+   $ret .= "\n";
+   return $ret;
+@@ -105,7 +110,8 @@ $info{errors}{proc}{cc} = sub {
+ };
+ 
+ sub make_c_object ( $ @ ) {
+-  my ($t, $d) = @_;
++  my ($t, $d, $accum) = @_;
++  $accum = {} unless defined $accum;
+   my $struct;
+   $struct .= "Aspell";
+   $struct .= to_mixed($d->{name});
+@@ -120,7 +126,7 @@ sub make_c_object ( $ @ ) {
+ 		"\n};\n"),
+ 	  "typedef $t $struct $struct;",
+ 	  join ("\n",
+-		map {make_c_method($d->{name}, $_, {mode=>'cc'}).";"}
++		map {make_c_method($d->{name}, $_, {mode=>'cc'}, %$accum).";"}
+ 		grep {$_->{type} eq 'method'}
+ 		@{$d->{data}})
+ 	  )."\n";
+diff --git a/auto/MkSrc/ProcImpl.pm b/auto/MkSrc/ProcImpl.pm
+index b8628fd..3d0f220 100644
+--- a/auto/MkSrc/ProcImpl.pm
++++ b/auto/MkSrc/ProcImpl.pm
+@@ -45,10 +45,13 @@ $info{class}{proc}{impl} = sub {
+   foreach (grep {$_ ne ''} split /\s*,\s*/, $data->{'c impl headers'}) {
+     $accum->{headers}{$_} = true;
+   }
+-  foreach my $d (@{$data->{data}}) {
++  my @d = @{$data->{data}};
++  while (@d) {
++    my $d = shift @d;
++    my $need_wide = false;
+     next unless one_of $d->{type}, qw(method constructor destructor);
+     my @parms = @{$d->{data}} if exists $d->{data};
+-    my $m = make_c_method $data->{name}, $d, {mode=>'cc_cxx', use_name=>true}, %$accum;
++    my $m = make_c_method $data->{name}, $d, {mode=>'cc_cxx', use_name=>true, wide=>$d->{wide}}, %$accum;
+     next unless defined $m;
+     $ret .= "extern \"C\" $m\n";
+     $ret .= "{\n";
+@@ -57,24 +60,49 @@ $info{class}{proc}{impl} = sub {
+     } else {
+       if ($d->{type} eq 'method') {
+ 	my $ret_type = shift @parms;
+-	my $ret_native = to_type_name $ret_type, {mode=>'native_no_err', pos=>'return'}, %$accum;
++	my $ret_native = to_type_name $ret_type, {mode=>'native_no_err', pos=>'return', wide=>$d->{wide}}, %$accum;
+ 	my $snum = 0;
++        my $call_fun = $d->{name};
++        my @call_parms;
+ 	foreach (@parms) {
+ 	  my $n = to_lower($_->{name});
+-	  if ($_->{type} eq 'encoded string') {
+-	    $accum->{headers}{'mutable string'} = true;
+-	    $accum->{headers}{'convert'} = true;
+-	    $ret .= "  ths->temp_str_$snum.clear();\n";
+-	    $ret .= "  ths->to_internal_->convert($n, ${n}_size, ths->temp_str_$snum);\n";
+-	    $ret .= "  unsigned int s$snum = ths->temp_str_$snum.size();\n";
+-	    $_ = "MutableString(ths->temp_str_$snum.mstr(), s$snum)";
+-	    $snum++;
++	  if ($_->{type} eq 'encoded string' && !exists($d->{'no conv'})) {
++            $need_wide = true unless $d->{wide};
++            die unless exists $d->{'posib err'};
++            $accum->{headers}{'mutable string'} = true;
++            $accum->{headers}{'convert'} = true;
++            my $name = get_c_func_name $data->{name}, $d, {mode=>'cc_cxx', use_name=>true, wide=>$d->{wide}};
++            $ret .= "  ths->temp_str_$snum.clear();\n";
++            if ($d->{wide}) {
++              $ret .= "  ${n}_size = get_correct_size(\"$name\", ths->to_internal_->in_type_width(), ${n}_size, ${n}_type_width);\n";
++            } else {
++              $ret .= "  PosibErr<int> ${n}_fixed_size = get_correct_size(\"$name\", ths->to_internal_->in_type_width(), ${n}_size);\n";
++              if (exists($d->{'on conv error'})) {
++                $ret .= "  if (${n}_fixed_size.get_err()) {\n";
++                $ret .= "    ".$d->{'on conv error'}."\n";
++                $ret .= "  } else {\n";
++                $ret .= "    ${n}_size = ${n}_fixed_size;\n";
++                $ret .= "  }\n";
++              } else {
++                $ret .= "  ths->err_.reset(${n}_fixed_size.release_err());\n";
++                $ret .= "  if (ths->err_ != 0) return ".(c_error_cond $ret_type).";\n";
++              }
++            }
++            $ret .= "  ths->to_internal_->convert($n, ${n}_size, ths->temp_str_$snum);\n";
++            $ret .= "  unsigned int s$snum = ths->temp_str_$snum.size();\n";
++            push @call_parms, "MutableString(ths->temp_str_$snum.mstr(), s$snum)";
++            $snum++;
++          } elsif ($_->{type} eq 'encoded string') {
++            $need_wide = true unless $d->{wide};
++            push @call_parms, $n, "${n}_size";
++            push @call_parms, "${n}_type_width" if $d->{wide};
++            $call_fun .= " wide" if $d->{wide};
+ 	  } else {
+-	    $_ = $n;
++	    push @call_parms, $n;
+ 	  }
+ 	}
+-	my $parms = '('.(join ', ', @parms).')';
+-	my $exp = "ths->".to_lower($d->{name})."$parms";
++	my $parms = '('.(join ', ', @call_parms).')';
++	my $exp = "ths->".to_lower($call_fun)."$parms";
+ 	if (exists $d->{'posib err'}) {
+ 	  $accum->{headers}{'posib err'} = true;
+ 	  $ret .= "  PosibErr<$ret_native> ret = $exp;\n";
+@@ -118,6 +146,7 @@ $info{class}{proc}{impl} = sub {
+       }
+     }
+     $ret .= "}\n\n";
++    unshift @d,{%$d, wide=>true} if $need_wide;
+   }
+   return $ret;
+ };
+diff --git a/auto/MkSrc/Read.pm b/auto/MkSrc/Read.pm
+index 4b3d1d0..4bf640e 100644
+--- a/auto/MkSrc/Read.pm
++++ b/auto/MkSrc/Read.pm
+@@ -88,13 +88,13 @@ sub advance ( ) {
+     $in_pod = $1 if $line =~ /^\=(\w+)/;
+     $line = '' if $in_pod;
+     $in_pod = undef if $in_pod && $in_pod eq 'cut';
+-    $line =~ s/\#.*$//;
++    $line =~ s/(?<!\\)\#.*$//;
+     $line =~ s/^(\t*)//;
+     $level = $base_level + length($1);
+       $line =~ s/\s*$//;
+     ++$base_level if $line =~ s/^\{$//;
+     --$base_level if $line =~ s/^\}$//;
+-    $line =~ s/\\([{}])/$1/g;
++    $line =~ s/\\([{}#\\])/$1/g;
+   } while ($line eq '');
+   #print "$level:$line\n";
+ }
+diff --git a/auto/mk-src.in b/auto/mk-src.in
+index 0e7833a..eb3353f 100644
+--- a/auto/mk-src.in
++++ b/auto/mk-src.in
+@@ -608,6 +608,7 @@ errors:
+ 		invalid expression
+ 			mesg => "%expression" is not a valid regular expression.
+ 			parms => expression
++
+ }
+ group: speller
+ {
+@@ -650,6 +651,7 @@ class: speller
+ 		posib err
+ 		desc => Returns 0 if it is not in the dictionary,
+ 			1 if it is, or -1 on error.
++		on conv error => return 0;
+ 		/
+ 		bool
+ 		encoded string: word
+@@ -715,6 +717,8 @@ class: speller
+ 		desc => Return NULL on error.
+ 			The word list returned by suggest is only
+ 			valid until the next call to suggest.
++		on conv error =>
++			word = NULL; word_size = 0;
+ 		/
+ 		const word list
+ 		encoded string: word
+@@ -840,7 +844,6 @@ class: document checker
+ 		void
+ 
+ 	method: process
+-
+ 		desc => Process a string.
+ 			The string passed in should only be split on
+ 			white space characters.  Furthermore, between
+@@ -849,10 +852,10 @@ class: document checker
+ 			in the document.  Passing in strings out of
+ 			order, skipping strings or passing them in
+ 			more than once may lead to undefined results.
++		no conv
+ 		/
+ 		void
+-		string: str
+-		int: size
++		encoded string: str
+ 
+ 	method: next misspelling
+ 
+@@ -860,9 +863,23 @@ class: document checker
+ 			processed string.  If there are no more
+ 			misspelled words, then token.word will be
+ 			NULL and token.size will be 0
++		cc extra =>
++			\#define aspell_document_checker_next_misspelling_w(type, ths) \\
++			    aspell_document_checker_next_misspelling_adj(ths, sizeof(type))
+ 		/
+ 		token object
+ 
++	method: next misspelling adj
++		desc => internal: do not use
++		c impl =>
++			Token res = ths->next_misspelling();
++			res.offset /= type_width;
++			res.len /= type_width;
++			return res;
++		/
++		token object
++		int: type_width
++
+ 	method: filter
+ 
+ 		desc => Returns the underlying filter class.
+@@ -922,9 +939,30 @@ class: string enumeration
+ 			  ths->from_internal_->append_null(ths->temp_str);
+ 			  return ths->temp_str.data();
+ 			\}
++		cc extra =>
++			\#define aspell_string_enumeration_next_w(type, ths) \\
++			    aspell_cast_(const type *, aspell_string_enumeration_next_wide(ths, sizeof(type)))
+ 		/
+ 		const string
+ 
++	method: next wide
++		c impl =>
++			const char * s = ths->next();
++			if (s == 0) {
++			  return s;
++			} else if (ths->from_internal_ == 0) \{
++			  assert(type_width == 1);
++			  return s;
++			\} else \{
++			  assert(type_width == ths->from_internal_->out_type_width());
++			  ths->temp_str.clear();
++			  ths->from_internal_->convert(s,-1,ths->temp_str);
++			  ths->from_internal_->append_null(ths->temp_str);
++			  return ths->temp_str.data();
++			\}
++		/
++		const void pointer
++		int: type_width
+ }
+ group: info
+ {
+diff --git a/common/convert.cpp b/common/convert.cpp
+index 1add95a..7ae0317 100644
+--- a/common/convert.cpp
++++ b/common/convert.cpp
+@@ -541,18 +541,25 @@ namespace acommon {
+   // Trivial Conversion
+   //
+ 
++  const char * unsupported_null_term_wide_string_msg =
++    "Null-terminated wide-character strings unsupported when used this way.";
++
+   template <typename Chr>
+   struct DecodeDirect : public Decode 
+   {
++    DecodeDirect() {type_width = sizeof(Chr);}
+     void decode(const char * in0, int size, FilterCharVector & out) const {
+       const Chr * in = reinterpret_cast<const Chr *>(in0);
+-      if (size == -1) {
++      if (size == -sizeof(Chr)) {
+         for (;*in; ++in)
+-          out.append(*in);
++          out.append(*in, sizeof(Chr));
++      } else if (size <= -1) {
++        fprintf(stderr, "%s\n", unsupported_null_term_wide_string_msg);
++        abort();
+       } else {
+-        const Chr * stop = reinterpret_cast<const Chr *>(in0 +size);
++        const Chr * stop = reinterpret_cast<const Chr *>(in0) + size/sizeof(Chr);
+         for (;in != stop; ++in)
+-          out.append(*in);
++          out.append(*in, sizeof(Chr));
+       }
+     }
+     PosibErr<void> decode_ec(const char * in0, int size, 
+@@ -565,6 +572,7 @@ namespace acommon {
+   template <typename Chr>
+   struct EncodeDirect : public Encode
+   {
++    EncodeDirect() {type_width = sizeof(Chr);}
+     void encode(const FilterChar * in, const FilterChar * stop, 
+                 CharVector & out) const {
+       for (; in != stop; ++in) {
+@@ -594,11 +602,15 @@ namespace acommon {
+   template <typename Chr>
+   struct ConvDirect : public DirectConv
+   {
++    ConvDirect() {type_width = sizeof(Chr);}
+     void convert(const char * in0, int size, CharVector & out) const {
+-      if (size == -1) {
++      if (size == -sizeof(Chr)) {
+         const Chr * in = reinterpret_cast<const Chr *>(in0);
+         for (;*in != 0; ++in)
+           out.append(in, sizeof(Chr));
++      } else if (size <= -1) {
++        fprintf(stderr, "%s\n", unsupported_null_term_wide_string_msg);
++        abort();
+       } else {
+         out.append(in0, size);
+       }
+@@ -1121,5 +1133,20 @@ namespace acommon {
+     }
+     return 0;
+   }
+-  
++
++  PosibErr<void> unsupported_null_term_wide_string_err_(const char * func) {
++    static bool reported_to_stderr = false;
++    PosibErr<void> err = make_err(other_error, unsupported_null_term_wide_string_msg);
++    if (!reported_to_stderr) {
++      CERR.printf("ERROR: %s: %s\n", func, unsupported_null_term_wide_string_msg);
++      reported_to_stderr = true;
++    }
++    return err;
++  }
++
++  void unsupported_null_term_wide_string_abort_(const char * func) {
++    CERR.printf("%s: %s\n", unsupported_null_term_wide_string_msg);
++    abort();
++  }
++ 
+ }
+diff --git a/common/convert.hpp b/common/convert.hpp
+index 76332ee..c948973 100644
+--- a/common/convert.hpp
++++ b/common/convert.hpp
+@@ -7,6 +7,8 @@
+ #ifndef ASPELL_CONVERT__HPP
+ #define ASPELL_CONVERT__HPP
+ 
++#include "settings.h"
++
+ #include "string.hpp"
+ #include "posib_err.hpp"
+ #include "char_vector.hpp"
+@@ -25,8 +27,9 @@ namespace acommon {
+     typedef const Config CacheConfig;
+     typedef const char * CacheKey;
+     String key;
++    int type_width; // type width in bytes
+     bool cache_key_eq(const char * l) const  {return key == l;}
+-    ConvBase() {}
++    ConvBase() : type_width(1) {}
+   private:
+     ConvBase(const ConvBase &);
+     void operator=(const ConvBase &);
+@@ -56,6 +59,8 @@ namespace acommon {
+     virtual ~Encode() {}
+   };
+   struct DirectConv { // convert directly from in_code to out_code.
++    int type_width; // type width in bytes
++    DirectConv() : type_width(1) {}
+     // should not take ownership of decode and encode.
+     // decode and encode guaranteed to stick around for the life
+     // of the object.
+@@ -126,6 +131,9 @@ namespace acommon {
+     const char * in_code() const   {return decode_->key.c_str();}
+     const char * out_code() const  {return encode_->key.c_str();}
+ 
++    int in_type_width() const {return decode_->type_width;}
++    int out_type_width() const {return encode_->type_width;}
++
+     void append_null(CharVector & out) const
+     {
+       const char nul[4] = {0,0,0,0}; // 4 should be enough
+@@ -191,6 +199,10 @@ namespace acommon {
+       }
+     }
+ 
++    void convert(const void * in, int size, CharVector & out) {
++      convert(static_cast<const char *>(in), size, out);
++    }
++
+     void generic_convert(const char * in, int size, CharVector & out);
+     
+   };
+@@ -412,6 +424,30 @@ namespace acommon {
+       return operator()(str, str + byte_size);}
+   };
+ 
++#ifdef SLOPPY_NULL_TERM_STRINGS
++  static const bool sloppy_null_term_strings = true;
++#else
++  static const bool sloppy_null_term_strings = false;
++#endif
++  
++  PosibErr<void> unsupported_null_term_wide_string_err_(const char * func);
++  void unsupported_null_term_wide_string_abort_(const char * func);
++    
++  static inline PosibErr<int> get_correct_size(const char * func, int conv_type_width, int size) {
++    if (sloppy_null_term_strings && size <= -1)
++      return -conv_type_width;
++    if (size <= -1 && -conv_type_width != size)
++      return unsupported_null_term_wide_string_err_(func);
++    return size;
++  }
++  static inline int get_correct_size(const char * func, int conv_type_width, int size, int type_width) {
++    if ((sloppy_null_term_strings || type_width <= -1) && size <= -1)
++      return -conv_type_width;
++    if (size <= -1 && conv_type_width != type_width)
++      unsupported_null_term_wide_string_abort_(func);
++    return size;
++  }
++
+ }
+ 
+ #endif
+diff --git a/common/document_checker.cpp b/common/document_checker.cpp
+index 5e510c4..0ccf1cd 100644
+--- a/common/document_checker.cpp
++++ b/common/document_checker.cpp
+@@ -44,7 +44,9 @@ namespace acommon {
+   void DocumentChecker::process(const char * str, int size)
+   {
+     proc_str_.clear();
+-    conv_->decode(str, size, proc_str_);
++    PosibErr<int> fixed_size = get_correct_size("aspell_document_checker_process", conv_->in_type_width(), size);
++    if (!fixed_size.has_err())
++      conv_->decode(str, fixed_size, proc_str_);
+     proc_str_.append(0);
+     FilterChar * begin = proc_str_.pbegin();
+     FilterChar * end   = proc_str_.pend() - 1;
+@@ -53,6 +55,19 @@ namespace acommon {
+     tokenizer_->reset(begin, end);
+   }
+ 
++  void DocumentChecker::process_wide(const void * str, int size, int type_width)
++  {
++    proc_str_.clear();
++    int fixed_size = get_correct_size("aspell_document_checker_process", conv_->in_type_width(), size, type_width);
++    conv_->decode(static_cast<const char *>(str), fixed_size, proc_str_);
++    proc_str_.append(0);
++    FilterChar * begin = proc_str_.pbegin();
++    FilterChar * end   = proc_str_.pend() - 1;
++    if (filter_)
++      filter_->process(begin, end);
++    tokenizer_->reset(begin, end);
++  }
++  
+   Token DocumentChecker::next_misspelling()
+   {
+     bool correct;
+diff --git a/common/document_checker.hpp b/common/document_checker.hpp
+index d35bb88..11a3c73 100644
+--- a/common/document_checker.hpp
++++ b/common/document_checker.hpp
+@@ -36,6 +36,7 @@ namespace acommon {
+     PosibErr<void> setup(Tokenizer *, Speller *, Filter *);
+     void reset();
+     void process(const char * str, int size);
++    void process_wide(const void * str, int size, int type_width);
+     Token next_misspelling();
+     
+     Filter * filter() {return filter_;}
+diff --git a/common/version.cpp b/common/version.cpp
+index 414d938..9e60b75 100644
+--- a/common/version.cpp
++++ b/common/version.cpp
+@@ -1,8 +1,17 @@
+ #include "settings.h"
+ 
+-extern "C" const char * aspell_version_string() {
+ #ifdef NDEBUG
+-  return VERSION " NDEBUG";
++#  define NDEBUG_STR " NDEBUG"
++#else
++#  define NDEBUG_STR
++#endif
++
++#ifdef SLOPPY_NULL_TERM_STRINGS
++#  define SLOPPY_STR " SLOPPY"
++#else
++#  define SLOPPY_STR
+ #endif
+-  return VERSION;
++
++extern "C" const char * aspell_version_string() {
++  return VERSION NDEBUG_STR SLOPPY_STR;
+ }
+diff --git a/configure.ac b/configure.ac
+index 60e3b39..a5d51e3 100644
+--- a/configure.ac
++++ b/configure.ac
+@@ -73,6 +73,9 @@ AC_ARG_ENABLE(filter-version-control,
+ AC_ARG_ENABLE(32-bit-hash-fun,
+   AS_HELP_STRING([--enable-32-bit-hash-fun],[use 32-bit hash function for compiled dictionaries]))
+ 
++AC_ARG_ENABLE(sloppy-null-term-strings,
++  AS_HELP_STRING([--enable-sloppy-null-term-strings],[allows allow null terminated UCS-2 and UCS-4 strings]))
++
+ AC_ARG_ENABLE(pspell-compatibility,
+   AS_HELP_STRING([--disable-pspell-compatibility],[don't install pspell compatibility libraries]))
+ 
+@@ -141,6 +144,11 @@ then
+   AC_DEFINE(USE_32_BIT_HASH_FUN, 1, [Defined if 32-bit hash function should be used for compiled dictionaries.])
+ fi
+ 
++if test "$enable_sloppy_null_term_strings" = "yes"
++then
++  AC_DEFINE(SLOPPY_NULL_TERM_STRINGS, 1, [Defined if null-terminated UCS-2 and UCS-4 strings should always be allowed.])
++fi
++
+ AM_CONDITIONAL(PSPELL_COMPATIBILITY,  
+   [test "$enable_pspell_compatibility" != "no"])
+ AM_CONDITIONAL(INCREMENTED_SONAME,    
+diff --git a/manual/aspell.texi b/manual/aspell.texi
+index 45fa091..f400e06 100644
+--- a/manual/aspell.texi
++++ b/manual/aspell.texi
+@@ -158,7 +158,8 @@ Installing
+ 
+ * Generic Install Instructions::  
+ * HTML Manuals and "make clean"::  
+-* Curses Notes::                
++* Curses Notes::
++* Upgrading from Aspell 0.60.7::
+ * Loadable Filter Notes::       
+ * Upgrading from Aspell 0.50::  
+ * Upgrading from Aspell .33/Pspell .12::  
+@@ -2206,18 +2207,26 @@ int correct = aspell_speller_check(spell_checker, @var{word}, @var{size});
+ @end smallexample
+ 
+ @noindent
+-@var{word} is expected to be a @code{const char *} character
+-string.  If the encoding is set to be @code{ucs-2} or
+-@code{ucs-4} @var{word} is expected to be a cast
+-from either @code{const u16int *} or @code{const u32int *}
+-respectively.  @code{u16int} and @code{u32int} are generally
+-@code{unsigned short} and @code{unsigned int} respectively.
+-@var{size} is the length of the string or @code{-1} if the string
+-is null terminated.  If the string is a cast from @code{const u16int
+-*} or @code{const u32int *} then @code{@i{size}} is the amount of
+-space in bytes the string takes up after being cast to @code{const
+-char *} and not the true size of the string.  @code{sspell_speller_check}
+-will return @code{0} if it is not found and non-zero otherwise.
++@var{word} is expected to be a @code{const char *} character string.
++@var{size} is the length of the string or @code{-1} if the string is
++null terminated.  @code{aspell_speller_check} will return @code{0} if it is not found
++and non-zero otherwise.
++
++If you are using the @code{ucs-2} or @code{ucs-4} encoding then the
++string is expected to be either a 2 or 4 byte wide integer
++(respectively) and the @code{_w} macro vesion should be used:
++
++@smallexample
++int correct = aspell_speller_check_w(spell_checker, @var{word}, @var{size});
++@end smallexample
++
++The macro will cast the string to to the correct type and convert
++@var{size} into bytes for you and then a call the special wide version of the
++function that will make sure the encoding is correct for the type
++passed in.  For compatibility with older versions of Aspell the normal
++non-wide functions can still be used provided that the size of the
++string, in bytes, is also passed in.  Null terminated @code{ucs-2} or
++@code{ucs-4} are no longer supported when using the non-wide functions.
+ 
+ If the word is not correct, then the @code{suggest} method can be used
+ to come up with likely replacements.
+@@ -2236,7 +2245,28 @@ delete_aspell_string_enumeration(elements);
+ 
+ Notice how @code{elements} is deleted but @code{suggestions} is not.
+ The value returned by @code{suggestions} is only valid to the next
+-call to @code{suggest}.  Once a replacement is made the
++call to @code{suggest}.
++
++If you are using the @code{ucs-2} or @code{ucs-4} encoding then, in
++addition to using the @code{_w} macro for the @code{suggest} method, you
++should also use the @code{_w} macro with the @code{next} method which
++will cast the string to the correct type for you.  For example, if you
++are using the @code{ucs-2} encoding and the string is a @code{const
++uint16_t *} then you should use:
++
++@smallexample
++AspellWordList * suggestions = aspell_speller_suggest_w(spell_checker,
++                                                        @var{word}, @var{size});
++AspellStringEnumeration * elements = aspell_word_list_elements(suggestions);
++const uint16_t * word;
++while ( (word = aspell_string_enumeration_next_w(uint16_t, aspell_elements)) != NULL )
++@{
++  // add to suggestion list
++@}
++delete_aspell_string_enumeration(elements);
++@end smallexample
++
++Once a replacement is made the
+ @code{store_repl} method should be used to communicate the replacement
+ pair back to the spell checker (for the reason, @pxref{Notes on
+ Storing Replacement Pairs}).  Its usage is as follows:
+diff --git a/manual/readme.texi b/manual/readme.texi
+index 669ab8e..531721f 100644
+--- a/manual/readme.texi
++++ b/manual/readme.texi
+@@ -15,15 +15,16 @@ The latest version can always be found at GNU Aspell's home page at
+ @uref{http://aspell.net}.
+ 
+ @menu
+-* Generic Install Instructions::  
+-* HTML Manuals and "make clean"::  
+-* Curses Notes::                
+-* Loadable Filter Notes::       
+-* Using 32-Bit Dictionaries on a 64-Bit System::  
+-* Upgrading from Aspell 0.50::  
+-* Upgrading from Aspell .33/Pspell .12::  
+-* Upgrading from a Pre-0.50 snapshot::  
+-* WIN32 Notes::                 
++* Generic Install Instructions::
++* HTML Manuals and "make clean"::
++* Curses Notes::
++* Upgrading from Aspell 0.60.7::
++* Loadable Filter Notes::
++* Using 32-Bit Dictionaries on a 64-Bit System::
++* Upgrading from Aspell 0.50::
++* Upgrading from Aspell .33/Pspell .12::
++* Upgrading from a Pre-0.50 snapshot::
++* WIN32 Notes::
+ @end menu
+ 
+ @node Generic Install Instructions
+@@ -121,17 +122,62 @@ In addition your system must also support the @code{mblen} function.
+ Although this function was defined in the ISO C89 standard (ANSI
+ X3.159-1989), not all systems have it.
+ 
++@node Upgrading from Aspell 0.60.7
++@appendixsec Upgrading from Aspell 0.60.7
++
++To prevent a potentially unbounded buffer over-read, Aspell no longer
++supports null-terminated UCS-2 and UCS-4 encoded strings with the
++original C API.  Null-termianted 8-bit or UTF-8 encoded strings are
++still supported, as are UCS-2 and UCS-4 encoded strings when the
++length is passed in.
++
++As of Aspell 0.60.8 a function from the original API that expects an
++encoded string as a parameter will return meaningless results (or an
++error code) if string is null terminated and the encoding is set to
++@code{ucs-2} or @code{ucs-4}.  In addition, a single:
++@example
++ERROR: aspell_speller_check: Null-terminated wide-character strings unsupported when used this way.
++@end example
++will be printed to standard error the first time one of those
++functions is called.
++
++Application that use null-terminated UCS-2/4 strings should either (1)
++use the interface intended for working with wide-characters
++(@xref{Through the C API}); or (2) define
++@code{ASPELL_ENCODE_SETTING_SECURE} before including @code{aspell.h}.
++In the latter case is is important that the application explicitly
++sets the encoding to a known value.  Defining
++@code{ASPELL_ENCODE_SETTING_SECURE} and not setting the encoding
++explicitly or allowing user of the application to set the encoding
++could result in an unbounded buffer over-read.
++
++If it is necessary to preserve binary compatibility with older
++versions of Aspell, the easiest thing would be to determine the length
++of the UCS-2/4 string---in bytes---and pass that in.  Due to an
++implemenation detail, existing API functions can be made to work with
++null-terminated UCS-2/4 strings safely by passing in either @code{-2}
++or @code{-4} (corresponding to the width of the character type) as the
++size.  Doing so, however, will cause a buffer over-read for unpatched
++version of Aspell.  To avoid this it will be necessary to parse the
++version string to determine the correct value to use.  However, no
++official support will be provided for the latter method.
++
++If the application can not be recompiled, then Aspell can be configured
++to preserve the old behavior by passing
++@option{--enable-sloppy-null-term-strings} to @command{configure}.  When Aspell
++is compiled this way the version string will include the string
++@samp{ SLOPPY}.
++
+ @node Loadable Filter Notes
+ @appendixsec Loadable Filter Notes
+-
++             
+ Support for being able to load additional filter modules at run-time
+ has only been verified to work on Linux platforms.  If you get linker
+ errors when trying to use a filter, then it is likely that loadable
+ filter support is not working yet on your platform.  Thus, in order to
+ get Aspell to work correctly you will need to avoid compiling the
+ filters as individual modules by using the
+-@option{--enable-compile-in-filters} when configuring Aspell with
+-@command{./configure}.
++@option{--enable-compile-in-filters} @command{configure} option.
+ 
+ @node Using 32-Bit Dictionaries on a 64-Bit System
+ @appendixsec Using 32-Bit Dictionaries on a 64-Bit System
+-- 
+2.17.1
+
diff --git a/meta/recipes-support/aspell/aspell/CVE-2019-20433-0002.patch b/meta/recipes-support/aspell/aspell/CVE-2019-20433-0002.patch
new file mode 100644
index 0000000000..9569ddeebe
--- /dev/null
+++ b/meta/recipes-support/aspell/aspell/CVE-2019-20433-0002.patch
@@ -0,0 +1,68 @@
+From cefd447e5528b08bb0cd6656bc52b4255692cefc Mon Sep 17 00:00:00 2001
+From: Kevin Atkinson <kevina@gnu.org>
+Date: Sat, 17 Aug 2019 20:25:21 -0400
+Subject: [PATCH 2/2] Increment library version to reflect API changes.
+
+CVE: CVE-2019-20433
+Upstream-Status: Backport [https://github.com/GNUAspell/aspell/commit/cefd447e5528b08bb0cd6656bc52b4255692cefc]
+
+Signed-off-by: Stefan Ghinea <stefan.ghinea@windriver.com>
+---
+ Makefile.am | 31 +++++++++++++++++--------------
+ 1 file changed, 17 insertions(+), 14 deletions(-)
+
+diff --git a/Makefile.am b/Makefile.am
+index 7e15851..19dc044 100644
+--- a/Makefile.am
++++ b/Makefile.am
+@@ -94,18 +94,25 @@ libaspell_la_SOURCES =\
+ 
+ libaspell_la_LIBADD =  $(LTLIBINTL) $(PTHREAD_LIB)
+ 
+-## Libtool to so name
+-## C:R:A => (C-A).(A).(R)
+-## 16:5:0 => 16.0.5
+-## 16:5:1 => 15.1.5
+-## 18:0:2 => 16.2.0 
+-## 17:0:2 => 15.2.0
+-
++## The version string is current[:revision[:age]]
++##
++## Before a release that has changed the source code at all
++## increment revision.
++##
++## After merging changes that have changed the API in a backwards
++## comptable way set revision to 0 and bump both current and age.
++##
++## Do not change the API in a backwards incompatible way.
++##
++## See "Libtool: Updating version info"
++## (https://www.gnu.org/software/libtool/manual/html_node/Updating-version-info.html)
++## for more into
++##
+ if INCREMENTED_SONAME
+-libaspell_la_LDFLAGS = -version-info 18:0:2 -no-undefined
++libaspell_la_LDFLAGS = -version-info 19:0:3 -no-undefined
+ else
+ ## Use C-1:R:A 
+-libaspell_la_LDFLAGS = -version-info 17:0:2 -no-undefined
++libaspell_la_LDFLAGS = -version-info 18:0:3 -no-undefined
+ endif
+ 
+ if PSPELL_COMPATIBILITY
+@@ -113,11 +120,7 @@ libpspell_la_SOURCES = lib/dummy.cpp
+ 
+ libpspell_la_LIBADD = libaspell.la
+ 
+-if INCREMENTED_SONAME
+-libpspell_la_LDFLAGS = -version-info 18:0:2 -no-undefined
+-else
+-libpspell_la_LDFLAGS = -version-info 17:0:2 -no-undefined
+-endif
++libpspell_la_LDFLAGS = $(libaspell_la_LDFLAGS)
+ 
+ endif
+ 
+-- 
+2.17.1
+
diff --git a/meta/recipes-support/aspell/aspell_0.60.7.bb b/meta/recipes-support/aspell/aspell_0.60.7.bb
index b565cb3c6e..1e104c263c 100644
--- a/meta/recipes-support/aspell/aspell_0.60.7.bb
+++ b/meta/recipes-support/aspell/aspell_0.60.7.bb
@@ -8,6 +8,8 @@ PR = "r1"
 
 SRC_URI = "${GNU_MIRROR}/aspell/aspell-${PV}.tar.gz \
            file://0001-Fix-various-bugs-found-by-OSS-Fuze.patch \
+           file://CVE-2019-20433-0001.patch \
+           file://CVE-2019-20433-0002.patch \
           "
 SRC_URI[md5sum] = "8ef2252609c511cd2bb26f3a3932ef28"
 SRC_URI[sha256sum] = "5ca8fc8cb0370cc6c9eb5b64c6d1bc5d57b3750dbf17887726c3407d833b70e4"
-- 
2.17.1



^ permalink raw reply related	[flat|nested] 7+ messages in thread

* Re: [PATCH] [zeus] aspell: CVE-2019-20433
  2020-03-12  9:23 [PATCH] [zeus] aspell: CVE-2019-20433 Stefan Ghinea
@ 2020-03-12 12:25 ` Mittal, Anuj
  2020-03-12 12:34   ` Mikko.Rapeli
  2020-03-12 14:35   ` Stefan Robert Ghinea
  0 siblings, 2 replies; 7+ messages in thread
From: Mittal, Anuj @ 2020-03-12 12:25 UTC (permalink / raw)
  To: openembedded-core@lists.openembedded.org,
	stefan.ghinea@windriver.com

It looks like this is changing the API. I wonder if this would need any
other change or break something elsewhere in OE-core, meta-oe?

http://aspell.net/buffer-overread-ucs.txt

Thanks,

Anuj

On Thu, 2020-03-12 at 11:23 +0200, Stefan Ghinea wrote:
> libaspell.a in GNU Aspell before 0.60.8 has a buffer over-read for a
> string
> ending with a single '\0' byte, if the encoding is set to ucs-2 or
> ucs-4
> outside of the application, as demonstrated by the ASPELL_CONF
> environment
> variable.
> 
> References:
> https://nvd.nist.gov/vuln/detail/CVE-2019-20433
> 
> Upstream patches:
> https://github.com/GNUAspell/aspell/commit/de29341638833ba7717bd6b5e6850998454b044b
> https://github.com/GNUAspell/aspell/commit/cefd447e5528b08bb0cd6656bc52b4255692cefc
> 
> Signed-off-by: Stefan Ghinea <stefan.ghinea@windriver.com>
> ---
>  .../aspell/aspell/CVE-2019-20433-0001.patch   | 999
> ++++++++++++++++++
>  .../aspell/aspell/CVE-2019-20433-0002.patch   |  68 ++
>  meta/recipes-support/aspell/aspell_0.60.7.bb  |   2 +
>  3 files changed, 1069 insertions(+)
>  create mode 100644 meta/recipes-support/aspell/aspell/CVE-2019-
> 20433-0001.patch
>  create mode 100644 meta/recipes-support/aspell/aspell/CVE-2019-
> 20433-0002.patch
> 
> diff --git a/meta/recipes-support/aspell/aspell/CVE-2019-20433-
> 0001.patch b/meta/recipes-support/aspell/aspell/CVE-2019-20433-
> 0001.patch
> new file mode 100644
> index 0000000000..fd68461e32
> --- /dev/null
> +++ b/meta/recipes-support/aspell/aspell/CVE-2019-20433-0001.patch
> @@ -0,0 +1,999 @@
> +From de29341638833ba7717bd6b5e6850998454b044b Mon Sep 17 00:00:00
> 2001
> +From: Kevin Atkinson <kevina@gnu.org>
> +Date: Sat, 17 Aug 2019 17:06:53 -0400
> +Subject: [PATCH 1/2] Don't allow null-terminated UCS-2/4 strings
> using the
> + original API.
> +
> +Detect if the encoding is UCS-2/4 and the length is -1 in affected
> API
> +functions and refuse to convert the string.  If the string ends up
> +being converted somehow, abort with an error message in DecodeDirect
> +and ConvDirect.  To convert a null terminated string in
> +Decode/ConvDirect, a negative number corresponding to the width of
> the
> +underlying character type for the encoding is expected; for example,
> +if the encoding is "ucs-2" then a the size is expected to be -2.
> +
> +Also fix a 1-3 byte over-read in DecodeDirect when reading UCS-2/4
> +strings when a size is provided (found by OSS-Fuzz).
> +
> +Also fix a bug in DecodeDirect that caused DocumentChecker to return
> +the wrong offsets when working with UCS-2/4 strings.
> +
> +CVE: CVE-2019-20433
> +Upstream-Status: Backport [
> https://github.com/GNUAspell/aspell/commit/de29341638833ba7717bd6b5e6850998454b044b
> ]
> +
> +[SG: - adjusted context
> +     - discarded test changes as test framework is not available
> +     - discarded manual entry changes for features that aren't
> backported]
> +Signed-off-by: Stefan Ghinea <stefan.ghinea@windriver.com>
> +---
> + auto/MkSrc/CcHelper.pm      | 99
> ++++++++++++++++++++++++++++++++++---
> + auto/MkSrc/Create.pm        |  5 +-
> + auto/MkSrc/Info.pm          |  5 +-
> + auto/MkSrc/ProcCc.pm        | 24 +++++----
> + auto/MkSrc/ProcImpl.pm      | 57 +++++++++++++++------
> + auto/MkSrc/Read.pm          |  4 +-
> + auto/mk-src.in              | 44 +++++++++++++++--
> + common/convert.cpp          | 39 ++++++++++++---
> + common/convert.hpp          | 38 +++++++++++++-
> + common/document_checker.cpp | 17 ++++++-
> + common/document_checker.hpp |  1 +
> + common/version.cpp          | 15 ++++--
> + configure.ac                |  8 +++
> + manual/aspell.texi          | 58 ++++++++++++++++------
> + manual/readme.texi          | 70 +++++++++++++++++++++-----
> + 15 files changed, 409 insertions(+), 75 deletions(-)
> +
> +diff --git a/auto/MkSrc/CcHelper.pm b/auto/MkSrc/CcHelper.pm
> +index f2de991..0044335 100644
> +--- a/auto/MkSrc/CcHelper.pm
> ++++ b/auto/MkSrc/CcHelper.pm
> +@@ -10,8 +10,8 @@ BEGIN {
> +   use Exporter;
> +   our @ISA = qw(Exporter);
> +   our @EXPORT = qw(to_c_return_type c_error_cond
> +-		   to_type_name make_desc make_func call_func
> +-		   make_c_method call_c_method form_c_method
> ++		   to_type_name make_desc make_func call_func
> get_c_func_name
> ++		   make_c_method make_wide_macro call_c_method
> form_c_method
> + 		   make_cxx_method);
> + }
> + 
> +@@ -90,6 +90,69 @@ sub make_func ( $ \@ $ ; \% ) {
> + 	   ')'));
> + }
> + 
> ++=item make_wide_version NAME @TYPES PARMS ; %ACCUM
> ++
> ++Creates the wide character version of the function if needed
> ++
> ++=cut
> ++
> ++sub make_wide_version ( $ \@ $ ; \% ) {
> ++  my ($name, $d, $p, $accum) = @_;
> ++  my @d = @$d;
> ++  shift @d;
> ++  return '' unless grep {$_->{type} eq 'encoded string'} @d;
> ++  $accum->{sys_headers}{'stddef.h'} = true;
> ++  $accum->{suffix}[5] = <<'---';
> ++
> ++/******************* private implemantion details
> *********************/
> ++
> ++#ifdef __cplusplus
> ++#  define aspell_cast_(type, expr) (static_cast<type>(expr))
> ++#  define aspell_cast_from_wide_(str) (static_cast<const void
> *>(str))
> ++#else
> ++#  define aspell_cast_(type, expr) ((type)(expr))
> ++#  define aspell_cast_from_wide_(str) ((const char *)(str))
> ++#endif
> ++---
> ++  my @parms = map {$_->{type} eq 'encoded string'
> ++                       ? ($_->{name}, $_->{name}.'_size')
> ++                       : $_->{name}} @d;
> ++  $name = to_lower $name;
> ++  $accum->{suffix}[0] = <<'---';
> ++/******************************************************************
> ****/
> ++
> ++#ifdef ASPELL_ENCODE_SETTING_SECURE
> ++---
> ++  $accum->{suffix}[2] = "#endif\n";
> ++  my @args = map  {$_->{type} eq 'encoded string'
> ++                       ? ($_->{name}, "$_->{name}_size", '-1')
> ++                       : $_->{name}} @d;
> ++  $accum->{suffix}[1] .=
> ++      (join '',
> ++       "#define $name",
> ++       '(', join(', ', @parms), ')',
> ++       "\\\n    ",
> ++       $name, '_wide',
> ++       '(', join(', ', @args), ')',
> ++       "\n");
> ++  @args = map  {$_->{type} eq 'encoded string'
> ++                    ? ("aspell_cast_from_wide_($_->{name})",
> ++                       "$_-
> >{name}_size*aspell_cast_(int,sizeof(*($_->{name})))",
> ++                       "sizeof(*($_->{name}))")
> ++                    : $_->{name}} @d;
> ++  return (join '',
> ++          "\n",
> ++          "/* version of $name that is safe to use with (null
> terminated) wide characters */\n",
> ++          '#define ',
> ++          $name, '_w',
> ++          '(', join(', ', @parms), ')', 
> ++          "\\\n    ",
> ++          $name, '_wide',
> ++          '(', join(', ', @args), ')',
> ++          "\n");
> ++}
> ++
> ++
> + =item call_func NAME @TYPES PARMS ; %ACCUM
> + 
> + Return a string to call a func.  Will prefix the function with
> return
> +@@ -103,7 +166,6 @@ Parms can be any of:
> + 
> + sub call_func ( $ \@ $ ; \% ) {
> +   my ($name, $d, $p, $accum) = @_;
> +-  $accum = {} unless defined $accum;
> +   my @d = @$d;
> +   my $func_ret = to_type_name(shift @d, {%$p,pos=>'return'},
> %$accum);
> +   return (join '',
> +@@ -148,8 +210,14 @@ sub to_type_name ( $ $ ; \% ) {
> +   my $name = $t->{name};
> +   my $type = $t->{type};
> + 
> +-  return ( (to_type_name {%$d, type=>'string'}, $p, %$accum) ,
> +-	   (to_type_name {%$d, type=>'int', name=>"$d->{name}_size"},
> $p, %$accum) )
> ++  if ($name eq 'encoded string' && $is_cc && $pos eq 'parm') {
> ++    my @types = ((to_type_name {%$d, type=>($p->{wide}?'const void
> pointer':'string')}, $p, %$accum),
> ++                 (to_type_name {%$d, type=>'int', name=>"$d-
> >{name}_size"}, $p, %$accum));
> ++    push @types, (to_type_name {%$d, type=>'int', name=>"$d-
> >{name}_type_width"}, $p, %$accum) if $p->{wide};
> ++    return @types;
> ++  }
> ++  return ( (to_type_name {%$d, type=>($p->{wide}?'const void
> pointer':'string')}, $p, %$accum) ,
> ++           (to_type_name {%$d, type=>'int', name=>"$d-
> >{name}_size"}, $p, %$accum) )
> +       if $name eq 'encoded string' && $is_cc && $pos eq 'parm';
> + 
> +   my $str;
> +@@ -174,7 +242,7 @@ sub to_type_name ( $ $ ; \% ) {
> + 	$str .= "String";
> +       }
> +     } elsif ($name eq 'encoded string') {
> +-      $str .= "const char *";
> ++      $str .= $p->{wide} ? "const void *" : "const char *";
> +     } elsif ($name eq '') {
> +       $str .= "void";
> +     } elsif ($name eq 'bool' && $is_cc) {
> +@@ -186,7 +254,7 @@ sub to_type_name ( $ $ ; \% ) {
> +       if ($t->{pointer}) {
> + 	$accum->{types}->{$name} = $t;
> +       } else {
> +-	$accum->{headers}->{$t->{created_in}} = true;
> ++        $accum->{headers}->{$t->{created_in}} = true unless $mode
> eq 'cc';
> +       }
> +       $str .= "$c_type Aspell" if $mode eq 'cc';
> +       $str .= to_mixed($name);
> +@@ -214,6 +282,7 @@ sub to_type_name ( $ $ ; \% ) {
> +   return $str;
> + }
> + 
> ++
> + =item make_desc DESC ; LEVEL
> + 
> + Make a C comment out of DESC optionally indenting it LEVEL spaces.
> +@@ -286,6 +355,7 @@ sub form_c_method ($ $ $ ; \% )
> +     } else {
> +       $func = "aspell $class $name";
> +     }
> ++    $func .= " wide" if $p->{wide};
> +     if (exists $d->{'const'}) {
> +       splice @data, 1, 0, {type => "const $class", name=>
> $this_name};
> +     } else {
> +@@ -306,6 +376,21 @@ sub make_c_method ($ $ $ ; \%)
> +   return &make_func(@ret);
> + }
> + 
> ++sub get_c_func_name ($ $ $)
> ++{
> ++  my @ret = &form_c_method(@_);
> ++  return undef unless @ret > 0;
> ++  return to_lower $ret[0];
> ++}
> ++
> ++sub make_wide_macro ($ $ $ ; \%)
> ++{
> ++  my @ret = &form_c_method(@_);
> ++  return undef unless @ret > 0;
> ++  my $str = &make_wide_version(@ret);
> ++  return $str;
> ++}
> ++
> + sub call_c_method ($ $ $ ; \%)
> + {
> +   my @ret = &form_c_method(@_);
> +diff --git a/auto/MkSrc/Create.pm b/auto/MkSrc/Create.pm
> +index d39b60e..630ede5 100644
> +--- a/auto/MkSrc/Create.pm
> ++++ b/auto/MkSrc/Create.pm
> +@@ -77,8 +77,10 @@ sub create_cc_file ( % )  {
> +   $file .= "#include \"aspell.h\"\n" if $p{type} eq 'cxx';
> +   $file .= "#include \"settings.h\"\n" if $p{type} eq 'native_impl'
> && $p{name} eq 'errors';
> +   $file .= "#include \"gettext.h\"\n" if $p{type} eq 'native_impl'
> && $p{name} eq 'errors';
> ++  $file .= cmap {"#include <$_>\n"} sort keys
> %{$accum{sys_headers}};
> +   $file .= cmap {"#include \"".to_lower($_).".hpp\"\n"} sort keys
> %{$accum{headers}};
> +-  $file .= "#ifdef __cplusplus\nextern \"C\" {\n#endif\n" if
> $p{header} && !$p{cxx};
> ++  $file .= "\n#ifdef __cplusplus\nextern \"C\" {\n#endif\n" if
> $p{header} && !$p{cxx};
> ++  $file .= join('', grep {defined $_} @{$accum{prefix}});
> +   $file .= "\nnamespace $p{namespace} {\n\n" if $p{cxx};
> +   if (defined $info{forward}{proc}{$p{type}}) {
> +     my @types = sort {$a->{name} cmp $b->{name}} (values
> %{$accum{types}});
> +@@ -86,6 +88,7 @@ sub create_cc_file ( % )  {
> +   }
> +   $file .= "\n";
> +   $file .= $body;
> ++  $file .= join('', grep {defined $_} @{$accum{suffix}});
> +   $file .= "\n\n}\n\n" if $p{cxx};
> +   $file .= "#ifdef __cplusplus\n}\n#endif\n" if $p{header} &&
> !$p{cxx};
> +   $file .= "#endif /* $hm */\n" if $p{header};
> +diff --git a/auto/MkSrc/Info.pm b/auto/MkSrc/Info.pm
> +index c644028..ace8e21 100644
> +--- a/auto/MkSrc/Info.pm
> ++++ b/auto/MkSrc/Info.pm
> +@@ -60,6 +60,7 @@ each proc sub should take the following argv
> +     the object from which it is a member of
> +   no native: do not attempt to create a native implementation
> +   treat as object: treat as a object rather than a pointer
> ++  no conv: do not converted an encoded string
> + 
> + The %info structure is initialized as follows:
> + 
> +@@ -104,8 +105,8 @@ The %info structure is initialized as follows:
> +   errors => {}, # possible errors
> +   method => {
> +     # A class method
> +-    options => ['desc', 'posib err', 'c func', 'const',
> +-		'c only', 'c impl', 'cxx impl'],
> ++    options => ['desc', 'posib err', 'c func', 'const', 'no conv',
> 'on conv error',
> ++		'c only', 'c impl', 'cxx impl', 'cc extra'],
> +     groups => undef},
> +   constructor => {
> +     # A class constructor
> +diff --git a/auto/MkSrc/ProcCc.pm b/auto/MkSrc/ProcCc.pm
> +index 47c4338..98cc435 100644
> +--- a/auto/MkSrc/ProcCc.pm
> ++++ b/auto/MkSrc/ProcCc.pm
> +@@ -23,7 +23,7 @@ use MkSrc::Info;
> + sub make_c_object ( $ @ );
> + 
> + $info{group}{proc}{cc} = sub {
> +-  my ($data) = @_;
> ++  my ($data,@rest) = @_;
> +   my $ret;
> +   my $stars = (70 - length $data->{name})/2;
> +   $ret .= "/";
> +@@ -33,14 +33,14 @@ $info{group}{proc}{cc} = sub {
> +   $ret .= "/\n";
> +   foreach my $d (@{$data->{data}}) {
> +     $ret .= "\n\n";
> +-    $ret .= $info{$d->{type}}{proc}{cc}->($d);
> ++    $ret .= $info{$d->{type}}{proc}{cc}->($d,@rest);
> +   }
> +   $ret .= "\n\n";
> +   return $ret;
> + };
> + 
> + $info{enum}{proc}{cc} = sub {
> +-  my ($d) = @_;
> ++  my ($d,@rest) = @_;
> +   my $n = "Aspell".to_mixed($d->{name});
> +   return ("\n".
> + 	  make_desc($d->{desc}).
> +@@ -58,21 +58,26 @@ $info{struct}{proc}{cc} = sub {
> + };
> + 
> + $info{union}{proc}{cc} = sub {
> +-  return make_c_object "union", $_[0];
> ++  return make_c_object "union", @_;
> + };
> + 
> + $info{class}{proc}{cc} = sub {
> +-  my ($d) = @_;
> ++  my ($d,$accum) = @_;
> +   my $class = $d->{name};
> +   my $classname = "Aspell".to_mixed($class);
> +   my $ret = "";
> +   $ret .= "typedef struct $classname $classname;\n\n";
> +   foreach (@{$d->{data}}) {
> +-    my $s = make_c_method($class, $_, {mode=>'cc'});
> ++    my $s = make_c_method($class, $_, {mode=>'cc'}, %$accum);
> +     next unless defined $s;
> +     $ret .= "\n";
> +     $ret .= make_desc($_->{desc});
> +-    $ret .= make_c_method($class, $_, {mode=>'cc'}).";\n";
> ++    $ret .= make_c_method($class, $_, {mode=>'cc'}, %$accum).";\n";
> ++    if (grep {$_->{type} eq 'encoded string'} @{$_->{data}}) {
> ++      $ret .= make_c_method($class, $_, {mode=>'cc', wide=>true},
> %$accum).";\n";
> ++      $ret .= make_wide_macro($class, $_, {mode=>'cc'}, %$accum);
> ++    }
> ++    $ret .= "\n".$_->{'cc extra'}."\n" if defined $_->{'cc extra'};
> +   }
> +   $ret .= "\n";
> +   return $ret;
> +@@ -105,7 +110,8 @@ $info{errors}{proc}{cc} = sub {
> + };
> + 
> + sub make_c_object ( $ @ ) {
> +-  my ($t, $d) = @_;
> ++  my ($t, $d, $accum) = @_;
> ++  $accum = {} unless defined $accum;
> +   my $struct;
> +   $struct .= "Aspell";
> +   $struct .= to_mixed($d->{name});
> +@@ -120,7 +126,7 @@ sub make_c_object ( $ @ ) {
> + 		"\n};\n"),
> + 	  "typedef $t $struct $struct;",
> + 	  join ("\n",
> +-		map {make_c_method($d->{name}, $_, {mode=>'cc'}).";"}
> ++		map {make_c_method($d->{name}, $_, {mode=>'cc'},
> %$accum).";"}
> + 		grep {$_->{type} eq 'method'}
> + 		@{$d->{data}})
> + 	  )."\n";
> +diff --git a/auto/MkSrc/ProcImpl.pm b/auto/MkSrc/ProcImpl.pm
> +index b8628fd..3d0f220 100644
> +--- a/auto/MkSrc/ProcImpl.pm
> ++++ b/auto/MkSrc/ProcImpl.pm
> +@@ -45,10 +45,13 @@ $info{class}{proc}{impl} = sub {
> +   foreach (grep {$_ ne ''} split /\s*,\s*/, $data->{'c impl
> headers'}) {
> +     $accum->{headers}{$_} = true;
> +   }
> +-  foreach my $d (@{$data->{data}}) {
> ++  my @d = @{$data->{data}};
> ++  while (@d) {
> ++    my $d = shift @d;
> ++    my $need_wide = false;
> +     next unless one_of $d->{type}, qw(method constructor
> destructor);
> +     my @parms = @{$d->{data}} if exists $d->{data};
> +-    my $m = make_c_method $data->{name}, $d, {mode=>'cc_cxx',
> use_name=>true}, %$accum;
> ++    my $m = make_c_method $data->{name}, $d, {mode=>'cc_cxx',
> use_name=>true, wide=>$d->{wide}}, %$accum;
> +     next unless defined $m;
> +     $ret .= "extern \"C\" $m\n";
> +     $ret .= "{\n";
> +@@ -57,24 +60,49 @@ $info{class}{proc}{impl} = sub {
> +     } else {
> +       if ($d->{type} eq 'method') {
> + 	my $ret_type = shift @parms;
> +-	my $ret_native = to_type_name $ret_type,
> {mode=>'native_no_err', pos=>'return'}, %$accum;
> ++	my $ret_native = to_type_name $ret_type,
> {mode=>'native_no_err', pos=>'return', wide=>$d->{wide}}, %$accum;
> + 	my $snum = 0;
> ++        my $call_fun = $d->{name};
> ++        my @call_parms;
> + 	foreach (@parms) {
> + 	  my $n = to_lower($_->{name});
> +-	  if ($_->{type} eq 'encoded string') {
> +-	    $accum->{headers}{'mutable string'} = true;
> +-	    $accum->{headers}{'convert'} = true;
> +-	    $ret .= "  ths->temp_str_$snum.clear();\n";
> +-	    $ret .= "  ths->to_internal_->convert($n, ${n}_size, ths-
> >temp_str_$snum);\n";
> +-	    $ret .= "  unsigned int s$snum = ths-
> >temp_str_$snum.size();\n";
> +-	    $_ = "MutableString(ths->temp_str_$snum.mstr(), s$snum)";
> +-	    $snum++;
> ++	  if ($_->{type} eq 'encoded string' && !exists($d->{'no
> conv'})) {
> ++            $need_wide = true unless $d->{wide};
> ++            die unless exists $d->{'posib err'};
> ++            $accum->{headers}{'mutable string'} = true;
> ++            $accum->{headers}{'convert'} = true;
> ++            my $name = get_c_func_name $data->{name}, $d,
> {mode=>'cc_cxx', use_name=>true, wide=>$d->{wide}};
> ++            $ret .= "  ths->temp_str_$snum.clear();\n";
> ++            if ($d->{wide}) {
> ++              $ret .= "  ${n}_size = get_correct_size(\"$name\",
> ths->to_internal_->in_type_width(), ${n}_size, ${n}_type_width);\n";
> ++            } else {
> ++              $ret .= "  PosibErr<int> ${n}_fixed_size =
> get_correct_size(\"$name\", ths->to_internal_->in_type_width(),
> ${n}_size);\n";
> ++              if (exists($d->{'on conv error'})) {
> ++                $ret .= "  if (${n}_fixed_size.get_err()) {\n";
> ++                $ret .= "    ".$d->{'on conv error'}."\n";
> ++                $ret .= "  } else {\n";
> ++                $ret .= "    ${n}_size = ${n}_fixed_size;\n";
> ++                $ret .= "  }\n";
> ++              } else {
> ++                $ret .= "  ths-
> >err_.reset(${n}_fixed_size.release_err());\n";
> ++                $ret .= "  if (ths->err_ != 0) return
> ".(c_error_cond $ret_type).";\n";
> ++              }
> ++            }
> ++            $ret .= "  ths->to_internal_->convert($n, ${n}_size,
> ths->temp_str_$snum);\n";
> ++            $ret .= "  unsigned int s$snum = ths-
> >temp_str_$snum.size();\n";
> ++            push @call_parms, "MutableString(ths-
> >temp_str_$snum.mstr(), s$snum)";
> ++            $snum++;
> ++          } elsif ($_->{type} eq 'encoded string') {
> ++            $need_wide = true unless $d->{wide};
> ++            push @call_parms, $n, "${n}_size";
> ++            push @call_parms, "${n}_type_width" if $d->{wide};
> ++            $call_fun .= " wide" if $d->{wide};
> + 	  } else {
> +-	    $_ = $n;
> ++	    push @call_parms, $n;
> + 	  }
> + 	}
> +-	my $parms = '('.(join ', ', @parms).')';
> +-	my $exp = "ths->".to_lower($d->{name})."$parms";
> ++	my $parms = '('.(join ', ', @call_parms).')';
> ++	my $exp = "ths->".to_lower($call_fun)."$parms";
> + 	if (exists $d->{'posib err'}) {
> + 	  $accum->{headers}{'posib err'} = true;
> + 	  $ret .= "  PosibErr<$ret_native> ret = $exp;\n";
> +@@ -118,6 +146,7 @@ $info{class}{proc}{impl} = sub {
> +       }
> +     }
> +     $ret .= "}\n\n";
> ++    unshift @d,{%$d, wide=>true} if $need_wide;
> +   }
> +   return $ret;
> + };
> +diff --git a/auto/MkSrc/Read.pm b/auto/MkSrc/Read.pm
> +index 4b3d1d0..4bf640e 100644
> +--- a/auto/MkSrc/Read.pm
> ++++ b/auto/MkSrc/Read.pm
> +@@ -88,13 +88,13 @@ sub advance ( ) {
> +     $in_pod = $1 if $line =~ /^\=(\w+)/;
> +     $line = '' if $in_pod;
> +     $in_pod = undef if $in_pod && $in_pod eq 'cut';
> +-    $line =~ s/\#.*$//;
> ++    $line =~ s/(?<!\\)\#.*$//;
> +     $line =~ s/^(\t*)//;
> +     $level = $base_level + length($1);
> +       $line =~ s/\s*$//;
> +     ++$base_level if $line =~ s/^\{$//;
> +     --$base_level if $line =~ s/^\}$//;
> +-    $line =~ s/\\([{}])/$1/g;
> ++    $line =~ s/\\([{}#\\])/$1/g;
> +   } while ($line eq '');
> +   #print "$level:$line\n";
> + }
> +diff --git a/auto/mk-src.in b/auto/mk-src.in
> +index 0e7833a..eb3353f 100644
> +--- a/auto/mk-src.in
> ++++ b/auto/mk-src.in
> +@@ -608,6 +608,7 @@ errors:
> + 		invalid expression
> + 			mesg => "%expression" is not a valid regular
> expression.
> + 			parms => expression
> ++
> + }
> + group: speller
> + {
> +@@ -650,6 +651,7 @@ class: speller
> + 		posib err
> + 		desc => Returns 0 if it is not in the dictionary,
> + 			1 if it is, or -1 on error.
> ++		on conv error => return 0;
> + 		/
> + 		bool
> + 		encoded string: word
> +@@ -715,6 +717,8 @@ class: speller
> + 		desc => Return NULL on error.
> + 			The word list returned by suggest is only
> + 			valid until the next call to suggest.
> ++		on conv error =>
> ++			word = NULL; word_size = 0;
> + 		/
> + 		const word list
> + 		encoded string: word
> +@@ -840,7 +844,6 @@ class: document checker
> + 		void
> + 
> + 	method: process
> +-
> + 		desc => Process a string.
> + 			The string passed in should only be split on
> + 			white space characters.  Furthermore, between
> +@@ -849,10 +852,10 @@ class: document checker
> + 			in the document.  Passing in strings out of
> + 			order, skipping strings or passing them in
> + 			more than once may lead to undefined results.
> ++		no conv
> + 		/
> + 		void
> +-		string: str
> +-		int: size
> ++		encoded string: str
> + 
> + 	method: next misspelling
> + 
> +@@ -860,9 +863,23 @@ class: document checker
> + 			processed string.  If there are no more
> + 			misspelled words, then token.word will be
> + 			NULL and token.size will be 0
> ++		cc extra =>
> ++			\#define
> aspell_document_checker_next_misspelling_w(type, ths) \\
> ++			    aspell_document_checker_next_misspelling_ad
> j(ths, sizeof(type))
> + 		/
> + 		token object
> + 
> ++	method: next misspelling adj
> ++		desc => internal: do not use
> ++		c impl =>
> ++			Token res = ths->next_misspelling();
> ++			res.offset /= type_width;
> ++			res.len /= type_width;
> ++			return res;
> ++		/
> ++		token object
> ++		int: type_width
> ++
> + 	method: filter
> + 
> + 		desc => Returns the underlying filter class.
> +@@ -922,9 +939,30 @@ class: string enumeration
> + 			  ths->from_internal_->append_null(ths-
> >temp_str);
> + 			  return ths->temp_str.data();
> + 			\}
> ++		cc extra =>
> ++			\#define aspell_string_enumeration_next_w(type,
> ths) \\
> ++			    aspell_cast_(const type *,
> aspell_string_enumeration_next_wide(ths, sizeof(type)))
> + 		/
> + 		const string
> + 
> ++	method: next wide
> ++		c impl =>
> ++			const char * s = ths->next();
> ++			if (s == 0) {
> ++			  return s;
> ++			} else if (ths->from_internal_ == 0) \{
> ++			  assert(type_width == 1);
> ++			  return s;
> ++			\} else \{
> ++			  assert(type_width == ths->from_internal_-
> >out_type_width());
> ++			  ths->temp_str.clear();
> ++			  ths->from_internal_->convert(s,-1,ths-
> >temp_str);
> ++			  ths->from_internal_->append_null(ths-
> >temp_str);
> ++			  return ths->temp_str.data();
> ++			\}
> ++		/
> ++		const void pointer
> ++		int: type_width
> + }
> + group: info
> + {
> +diff --git a/common/convert.cpp b/common/convert.cpp
> +index 1add95a..7ae0317 100644
> +--- a/common/convert.cpp
> ++++ b/common/convert.cpp
> +@@ -541,18 +541,25 @@ namespace acommon {
> +   // Trivial Conversion
> +   //
> + 
> ++  const char * unsupported_null_term_wide_string_msg =
> ++    "Null-terminated wide-character strings unsupported when used
> this way.";
> ++
> +   template <typename Chr>
> +   struct DecodeDirect : public Decode 
> +   {
> ++    DecodeDirect() {type_width = sizeof(Chr);}
> +     void decode(const char * in0, int size, FilterCharVector & out)
> const {
> +       const Chr * in = reinterpret_cast<const Chr *>(in0);
> +-      if (size == -1) {
> ++      if (size == -sizeof(Chr)) {
> +         for (;*in; ++in)
> +-          out.append(*in);
> ++          out.append(*in, sizeof(Chr));
> ++      } else if (size <= -1) {
> ++        fprintf(stderr, "%s\n",
> unsupported_null_term_wide_string_msg);
> ++        abort();
> +       } else {
> +-        const Chr * stop = reinterpret_cast<const Chr *>(in0
> +size);
> ++        const Chr * stop = reinterpret_cast<const Chr *>(in0) +
> size/sizeof(Chr);
> +         for (;in != stop; ++in)
> +-          out.append(*in);
> ++          out.append(*in, sizeof(Chr));
> +       }
> +     }
> +     PosibErr<void> decode_ec(const char * in0, int size, 
> +@@ -565,6 +572,7 @@ namespace acommon {
> +   template <typename Chr>
> +   struct EncodeDirect : public Encode
> +   {
> ++    EncodeDirect() {type_width = sizeof(Chr);}
> +     void encode(const FilterChar * in, const FilterChar * stop, 
> +                 CharVector & out) const {
> +       for (; in != stop; ++in) {
> +@@ -594,11 +602,15 @@ namespace acommon {
> +   template <typename Chr>
> +   struct ConvDirect : public DirectConv
> +   {
> ++    ConvDirect() {type_width = sizeof(Chr);}
> +     void convert(const char * in0, int size, CharVector & out)
> const {
> +-      if (size == -1) {
> ++      if (size == -sizeof(Chr)) {
> +         const Chr * in = reinterpret_cast<const Chr *>(in0);
> +         for (;*in != 0; ++in)
> +           out.append(in, sizeof(Chr));
> ++      } else if (size <= -1) {
> ++        fprintf(stderr, "%s\n",
> unsupported_null_term_wide_string_msg);
> ++        abort();
> +       } else {
> +         out.append(in0, size);
> +       }
> +@@ -1121,5 +1133,20 @@ namespace acommon {
> +     }
> +     return 0;
> +   }
> +-  
> ++
> ++  PosibErr<void> unsupported_null_term_wide_string_err_(const char
> * func) {
> ++    static bool reported_to_stderr = false;
> ++    PosibErr<void> err = make_err(other_error,
> unsupported_null_term_wide_string_msg);
> ++    if (!reported_to_stderr) {
> ++      CERR.printf("ERROR: %s: %s\n", func,
> unsupported_null_term_wide_string_msg);
> ++      reported_to_stderr = true;
> ++    }
> ++    return err;
> ++  }
> ++
> ++  void unsupported_null_term_wide_string_abort_(const char * func)
> {
> ++    CERR.printf("%s: %s\n", unsupported_null_term_wide_string_msg);
> ++    abort();
> ++  }
> ++ 
> + }
> +diff --git a/common/convert.hpp b/common/convert.hpp
> +index 76332ee..c948973 100644
> +--- a/common/convert.hpp
> ++++ b/common/convert.hpp
> +@@ -7,6 +7,8 @@
> + #ifndef ASPELL_CONVERT__HPP
> + #define ASPELL_CONVERT__HPP
> + 
> ++#include "settings.h"
> ++
> + #include "string.hpp"
> + #include "posib_err.hpp"
> + #include "char_vector.hpp"
> +@@ -25,8 +27,9 @@ namespace acommon {
> +     typedef const Config CacheConfig;
> +     typedef const char * CacheKey;
> +     String key;
> ++    int type_width; // type width in bytes
> +     bool cache_key_eq(const char * l) const  {return key == l;}
> +-    ConvBase() {}
> ++    ConvBase() : type_width(1) {}
> +   private:
> +     ConvBase(const ConvBase &);
> +     void operator=(const ConvBase &);
> +@@ -56,6 +59,8 @@ namespace acommon {
> +     virtual ~Encode() {}
> +   };
> +   struct DirectConv { // convert directly from in_code to out_code.
> ++    int type_width; // type width in bytes
> ++    DirectConv() : type_width(1) {}
> +     // should not take ownership of decode and encode.
> +     // decode and encode guaranteed to stick around for the life
> +     // of the object.
> +@@ -126,6 +131,9 @@ namespace acommon {
> +     const char * in_code() const   {return decode_->key.c_str();}
> +     const char * out_code() const  {return encode_->key.c_str();}
> + 
> ++    int in_type_width() const {return decode_->type_width;}
> ++    int out_type_width() const {return encode_->type_width;}
> ++
> +     void append_null(CharVector & out) const
> +     {
> +       const char nul[4] = {0,0,0,0}; // 4 should be enough
> +@@ -191,6 +199,10 @@ namespace acommon {
> +       }
> +     }
> + 
> ++    void convert(const void * in, int size, CharVector & out) {
> ++      convert(static_cast<const char *>(in), size, out);
> ++    }
> ++
> +     void generic_convert(const char * in, int size, CharVector &
> out);
> +     
> +   };
> +@@ -412,6 +424,30 @@ namespace acommon {
> +       return operator()(str, str + byte_size);}
> +   };
> + 
> ++#ifdef SLOPPY_NULL_TERM_STRINGS
> ++  static const bool sloppy_null_term_strings = true;
> ++#else
> ++  static const bool sloppy_null_term_strings = false;
> ++#endif
> ++  
> ++  PosibErr<void> unsupported_null_term_wide_string_err_(const char
> * func);
> ++  void unsupported_null_term_wide_string_abort_(const char * func);
> ++    
> ++  static inline PosibErr<int> get_correct_size(const char * func,
> int conv_type_width, int size) {
> ++    if (sloppy_null_term_strings && size <= -1)
> ++      return -conv_type_width;
> ++    if (size <= -1 && -conv_type_width != size)
> ++      return unsupported_null_term_wide_string_err_(func);
> ++    return size;
> ++  }
> ++  static inline int get_correct_size(const char * func, int
> conv_type_width, int size, int type_width) {
> ++    if ((sloppy_null_term_strings || type_width <= -1) && size <=
> -1)
> ++      return -conv_type_width;
> ++    if (size <= -1 && conv_type_width != type_width)
> ++      unsupported_null_term_wide_string_abort_(func);
> ++    return size;
> ++  }
> ++
> + }
> + 
> + #endif
> +diff --git a/common/document_checker.cpp
> b/common/document_checker.cpp
> +index 5e510c4..0ccf1cd 100644
> +--- a/common/document_checker.cpp
> ++++ b/common/document_checker.cpp
> +@@ -44,7 +44,9 @@ namespace acommon {
> +   void DocumentChecker::process(const char * str, int size)
> +   {
> +     proc_str_.clear();
> +-    conv_->decode(str, size, proc_str_);
> ++    PosibErr<int> fixed_size =
> get_correct_size("aspell_document_checker_process", conv_-
> >in_type_width(), size);
> ++    if (!fixed_size.has_err())
> ++      conv_->decode(str, fixed_size, proc_str_);
> +     proc_str_.append(0);
> +     FilterChar * begin = proc_str_.pbegin();
> +     FilterChar * end   = proc_str_.pend() - 1;
> +@@ -53,6 +55,19 @@ namespace acommon {
> +     tokenizer_->reset(begin, end);
> +   }
> + 
> ++  void DocumentChecker::process_wide(const void * str, int size,
> int type_width)
> ++  {
> ++    proc_str_.clear();
> ++    int fixed_size =
> get_correct_size("aspell_document_checker_process", conv_-
> >in_type_width(), size, type_width);
> ++    conv_->decode(static_cast<const char *>(str), fixed_size,
> proc_str_);
> ++    proc_str_.append(0);
> ++    FilterChar * begin = proc_str_.pbegin();
> ++    FilterChar * end   = proc_str_.pend() - 1;
> ++    if (filter_)
> ++      filter_->process(begin, end);
> ++    tokenizer_->reset(begin, end);
> ++  }
> ++  
> +   Token DocumentChecker::next_misspelling()
> +   {
> +     bool correct;
> +diff --git a/common/document_checker.hpp
> b/common/document_checker.hpp
> +index d35bb88..11a3c73 100644
> +--- a/common/document_checker.hpp
> ++++ b/common/document_checker.hpp
> +@@ -36,6 +36,7 @@ namespace acommon {
> +     PosibErr<void> setup(Tokenizer *, Speller *, Filter *);
> +     void reset();
> +     void process(const char * str, int size);
> ++    void process_wide(const void * str, int size, int type_width);
> +     Token next_misspelling();
> +     
> +     Filter * filter() {return filter_;}
> +diff --git a/common/version.cpp b/common/version.cpp
> +index 414d938..9e60b75 100644
> +--- a/common/version.cpp
> ++++ b/common/version.cpp
> +@@ -1,8 +1,17 @@
> + #include "settings.h"
> + 
> +-extern "C" const char * aspell_version_string() {
> + #ifdef NDEBUG
> +-  return VERSION " NDEBUG";
> ++#  define NDEBUG_STR " NDEBUG"
> ++#else
> ++#  define NDEBUG_STR
> ++#endif
> ++
> ++#ifdef SLOPPY_NULL_TERM_STRINGS
> ++#  define SLOPPY_STR " SLOPPY"
> ++#else
> ++#  define SLOPPY_STR
> + #endif
> +-  return VERSION;
> ++
> ++extern "C" const char * aspell_version_string() {
> ++  return VERSION NDEBUG_STR SLOPPY_STR;
> + }
> +diff --git a/configure.ac b/configure.ac
> +index 60e3b39..a5d51e3 100644
> +--- a/configure.ac
> ++++ b/configure.ac
> +@@ -73,6 +73,9 @@ AC_ARG_ENABLE(filter-version-control,
> + AC_ARG_ENABLE(32-bit-hash-fun,
> +   AS_HELP_STRING([--enable-32-bit-hash-fun],[use 32-bit hash
> function for compiled dictionaries]))
> + 
> ++AC_ARG_ENABLE(sloppy-null-term-strings,
> ++  AS_HELP_STRING([--enable-sloppy-null-term-strings],[allows allow
> null terminated UCS-2 and UCS-4 strings]))
> ++
> + AC_ARG_ENABLE(pspell-compatibility,
> +   AS_HELP_STRING([--disable-pspell-compatibility],[don't install
> pspell compatibility libraries]))
> + 
> +@@ -141,6 +144,11 @@ then
> +   AC_DEFINE(USE_32_BIT_HASH_FUN, 1, [Defined if 32-bit hash
> function should be used for compiled dictionaries.])
> + fi
> + 
> ++if test "$enable_sloppy_null_term_strings" = "yes"
> ++then
> ++  AC_DEFINE(SLOPPY_NULL_TERM_STRINGS, 1, [Defined if null-
> terminated UCS-2 and UCS-4 strings should always be allowed.])
> ++fi
> ++
> + AM_CONDITIONAL(PSPELL_COMPATIBILITY,  
> +   [test "$enable_pspell_compatibility" != "no"])
> + AM_CONDITIONAL(INCREMENTED_SONAME,    
> +diff --git a/manual/aspell.texi b/manual/aspell.texi
> +index 45fa091..f400e06 100644
> +--- a/manual/aspell.texi
> ++++ b/manual/aspell.texi
> +@@ -158,7 +158,8 @@ Installing
> + 
> + * Generic Install Instructions::  
> + * HTML Manuals and "make clean"::  
> +-* Curses Notes::                
> ++* Curses Notes::
> ++* Upgrading from Aspell 0.60.7::
> + * Loadable Filter Notes::       
> + * Upgrading from Aspell 0.50::  
> + * Upgrading from Aspell .33/Pspell .12::  
> +@@ -2206,18 +2207,26 @@ int correct =
> aspell_speller_check(spell_checker, @var{word}, @var{size});
> + @end smallexample
> + 
> + @noindent
> +-@var{word} is expected to be a @code{const char *} character
> +-string.  If the encoding is set to be @code{ucs-2} or
> +-@code{ucs-4} @var{word} is expected to be a cast
> +-from either @code{const u16int *} or @code{const u32int *}
> +-respectively.  @code{u16int} and @code{u32int} are generally
> +-@code{unsigned short} and @code{unsigned int} respectively.
> +-@var{size} is the length of the string or @code{-1} if the string
> +-is null terminated.  If the string is a cast from @code{const
> u16int
> +-*} or @code{const u32int *} then @code{@i{size}} is the amount of
> +-space in bytes the string takes up after being cast to @code{const
> +-char *} and not the true size of the
> string.  @code{sspell_speller_check}
> +-will return @code{0} if it is not found and non-zero otherwise.
> ++@var{word} is expected to be a @code{const char *} character
> string.
> ++@var{size} is the length of the string or @code{-1} if the string
> is
> ++null terminated.  @code{aspell_speller_check} will return @code{0}
> if it is not found
> ++and non-zero otherwise.
> ++
> ++If you are using the @code{ucs-2} or @code{ucs-4} encoding then the
> ++string is expected to be either a 2 or 4 byte wide integer
> ++(respectively) and the @code{_w} macro vesion should be used:
> ++
> ++@smallexample
> ++int correct = aspell_speller_check_w(spell_checker, @var{word},
> @var{size});
> ++@end smallexample
> ++
> ++The macro will cast the string to to the correct type and convert
> ++@var{size} into bytes for you and then a call the special wide
> version of the
> ++function that will make sure the encoding is correct for the type
> ++passed in.  For compatibility with older versions of Aspell the
> normal
> ++non-wide functions can still be used provided that the size of the
> ++string, in bytes, is also passed in.  Null terminated @code{ucs-2}
> or
> ++@code{ucs-4} are no longer supported when using the non-wide
> functions.
> + 
> + If the word is not correct, then the @code{suggest} method can be
> used
> + to come up with likely replacements.
> +@@ -2236,7 +2245,28 @@ delete_aspell_string_enumeration(elements);
> + 
> + Notice how @code{elements} is deleted but @code{suggestions} is
> not.
> + The value returned by @code{suggestions} is only valid to the next
> +-call to @code{suggest}.  Once a replacement is made the
> ++call to @code{suggest}.
> ++
> ++If you are using the @code{ucs-2} or @code{ucs-4} encoding then, in
> ++addition to using the @code{_w} macro for the @code{suggest}
> method, you
> ++should also use the @code{_w} macro with the @code{next} method
> which
> ++will cast the string to the correct type for you.  For example, if
> you
> ++are using the @code{ucs-2} encoding and the string is a @code{const
> ++uint16_t *} then you should use:
> ++
> ++@smallexample
> ++AspellWordList * suggestions =
> aspell_speller_suggest_w(spell_checker,
> ++                                                        @var{word},
> @var{size});
> ++AspellStringEnumeration * elements =
> aspell_word_list_elements(suggestions);
> ++const uint16_t * word;
> ++while ( (word = aspell_string_enumeration_next_w(uint16_t,
> aspell_elements)) != NULL )
> ++@{
> ++  // add to suggestion list
> ++@}
> ++delete_aspell_string_enumeration(elements);
> ++@end smallexample
> ++
> ++Once a replacement is made the
> + @code{store_repl} method should be used to communicate the
> replacement
> + pair back to the spell checker (for the reason, @pxref{Notes on
> + Storing Replacement Pairs}).  Its usage is as follows:
> +diff --git a/manual/readme.texi b/manual/readme.texi
> +index 669ab8e..531721f 100644
> +--- a/manual/readme.texi
> ++++ b/manual/readme.texi
> +@@ -15,15 +15,16 @@ The latest version can always be found at GNU
> Aspell's home page at
> + @uref{http://aspell.net}.
> + 
> + @menu
> +-* Generic Install Instructions::  
> +-* HTML Manuals and "make clean"::  
> +-* Curses Notes::                
> +-* Loadable Filter Notes::       
> +-* Using 32-Bit Dictionaries on a 64-Bit System::  
> +-* Upgrading from Aspell 0.50::  
> +-* Upgrading from Aspell .33/Pspell .12::  
> +-* Upgrading from a Pre-0.50 snapshot::  
> +-* WIN32 Notes::                 
> ++* Generic Install Instructions::
> ++* HTML Manuals and "make clean"::
> ++* Curses Notes::
> ++* Upgrading from Aspell 0.60.7::
> ++* Loadable Filter Notes::
> ++* Using 32-Bit Dictionaries on a 64-Bit System::
> ++* Upgrading from Aspell 0.50::
> ++* Upgrading from Aspell .33/Pspell .12::
> ++* Upgrading from a Pre-0.50 snapshot::
> ++* WIN32 Notes::
> + @end menu
> + 
> + @node Generic Install Instructions
> +@@ -121,17 +122,62 @@ In addition your system must also support the
> @code{mblen} function.
> + Although this function was defined in the ISO C89 standard (ANSI
> + X3.159-1989), not all systems have it.
> + 
> ++@node Upgrading from Aspell 0.60.7
> ++@appendixsec Upgrading from Aspell 0.60.7
> ++
> ++To prevent a potentially unbounded buffer over-read, Aspell no
> longer
> ++supports null-terminated UCS-2 and UCS-4 encoded strings with the
> ++original C API.  Null-termianted 8-bit or UTF-8 encoded strings are
> ++still supported, as are UCS-2 and UCS-4 encoded strings when the
> ++length is passed in.
> ++
> ++As of Aspell 0.60.8 a function from the original API that expects
> an
> ++encoded string as a parameter will return meaningless results (or
> an
> ++error code) if string is null terminated and the encoding is set to
> ++@code{ucs-2} or @code{ucs-4}.  In addition, a single:
> ++@example
> ++ERROR: aspell_speller_check: Null-terminated wide-character strings
> unsupported when used this way.
> ++@end example
> ++will be printed to standard error the first time one of those
> ++functions is called.
> ++
> ++Application that use null-terminated UCS-2/4 strings should either
> (1)
> ++use the interface intended for working with wide-characters
> ++(@xref{Through the C API}); or (2) define
> ++@code{ASPELL_ENCODE_SETTING_SECURE} before including
> @code{aspell.h}.
> ++In the latter case is is important that the application explicitly
> ++sets the encoding to a known value.  Defining
> ++@code{ASPELL_ENCODE_SETTING_SECURE} and not setting the encoding
> ++explicitly or allowing user of the application to set the encoding
> ++could result in an unbounded buffer over-read.
> ++
> ++If it is necessary to preserve binary compatibility with older
> ++versions of Aspell, the easiest thing would be to determine the
> length
> ++of the UCS-2/4 string---in bytes---and pass that in.  Due to an
> ++implemenation detail, existing API functions can be made to work
> with
> ++null-terminated UCS-2/4 strings safely by passing in either @code{-
> 2}
> ++or @code{-4} (corresponding to the width of the character type) as
> the
> ++size.  Doing so, however, will cause a buffer over-read for
> unpatched
> ++version of Aspell.  To avoid this it will be necessary to parse the
> ++version string to determine the correct value to use.  However, no
> ++official support will be provided for the latter method.
> ++
> ++If the application can not be recompiled, then Aspell can be
> configured
> ++to preserve the old behavior by passing
> ++@option{--enable-sloppy-null-term-strings} to
> @command{configure}.  When Aspell
> ++is compiled this way the version string will include the string
> ++@samp{ SLOPPY}.
> ++
> + @node Loadable Filter Notes
> + @appendixsec Loadable Filter Notes
> +-
> ++             
> + Support for being able to load additional filter modules at run-
> time
> + has only been verified to work on Linux platforms.  If you get
> linker
> + errors when trying to use a filter, then it is likely that loadable
> + filter support is not working yet on your platform.  Thus, in order
> to
> + get Aspell to work correctly you will need to avoid compiling the
> + filters as individual modules by using the
> +-@option{--enable-compile-in-filters} when configuring Aspell with
> +-@command{./configure}.
> ++@option{--enable-compile-in-filters} @command{configure} option.
> + 
> + @node Using 32-Bit Dictionaries on a 64-Bit System
> + @appendixsec Using 32-Bit Dictionaries on a 64-Bit System
> +-- 
> +2.17.1
> +
> diff --git a/meta/recipes-support/aspell/aspell/CVE-2019-20433-
> 0002.patch b/meta/recipes-support/aspell/aspell/CVE-2019-20433-
> 0002.patch
> new file mode 100644
> index 0000000000..9569ddeebe
> --- /dev/null
> +++ b/meta/recipes-support/aspell/aspell/CVE-2019-20433-0002.patch
> @@ -0,0 +1,68 @@
> +From cefd447e5528b08bb0cd6656bc52b4255692cefc Mon Sep 17 00:00:00
> 2001
> +From: Kevin Atkinson <kevina@gnu.org>
> +Date: Sat, 17 Aug 2019 20:25:21 -0400
> +Subject: [PATCH 2/2] Increment library version to reflect API
> changes.
> +
> +CVE: CVE-2019-20433
> +Upstream-Status: Backport [
> https://github.com/GNUAspell/aspell/commit/cefd447e5528b08bb0cd6656bc52b4255692cefc
> ]
> +
> +Signed-off-by: Stefan Ghinea <stefan.ghinea@windriver.com>
> +---
> + Makefile.am | 31 +++++++++++++++++--------------
> + 1 file changed, 17 insertions(+), 14 deletions(-)
> +
> +diff --git a/Makefile.am b/Makefile.am
> +index 7e15851..19dc044 100644
> +--- a/Makefile.am
> ++++ b/Makefile.am
> +@@ -94,18 +94,25 @@ libaspell_la_SOURCES =\
> + 
> + libaspell_la_LIBADD =  $(LTLIBINTL) $(PTHREAD_LIB)
> + 
> +-## Libtool to so name
> +-## C:R:A => (C-A).(A).(R)
> +-## 16:5:0 => 16.0.5
> +-## 16:5:1 => 15.1.5
> +-## 18:0:2 => 16.2.0 
> +-## 17:0:2 => 15.2.0
> +-
> ++## The version string is current[:revision[:age]]
> ++##
> ++## Before a release that has changed the source code at all
> ++## increment revision.
> ++##
> ++## After merging changes that have changed the API in a backwards
> ++## comptable way set revision to 0 and bump both current and age.
> ++##
> ++## Do not change the API in a backwards incompatible way.
> ++##
> ++## See "Libtool: Updating version info"
> ++## (
> https://www.gnu.org/software/libtool/manual/html_node/Updating-version-info.html
> )
> ++## for more into
> ++##
> + if INCREMENTED_SONAME
> +-libaspell_la_LDFLAGS = -version-info 18:0:2 -no-undefined
> ++libaspell_la_LDFLAGS = -version-info 19:0:3 -no-undefined
> + else
> + ## Use C-1:R:A 
> +-libaspell_la_LDFLAGS = -version-info 17:0:2 -no-undefined
> ++libaspell_la_LDFLAGS = -version-info 18:0:3 -no-undefined
> + endif
> + 
> + if PSPELL_COMPATIBILITY
> +@@ -113,11 +120,7 @@ libpspell_la_SOURCES = lib/dummy.cpp
> + 
> + libpspell_la_LIBADD = libaspell.la
> + 
> +-if INCREMENTED_SONAME
> +-libpspell_la_LDFLAGS = -version-info 18:0:2 -no-undefined
> +-else
> +-libpspell_la_LDFLAGS = -version-info 17:0:2 -no-undefined
> +-endif
> ++libpspell_la_LDFLAGS = $(libaspell_la_LDFLAGS)
> + 
> + endif
> + 
> +-- 
> +2.17.1
> +
> diff --git a/meta/recipes-support/aspell/aspell_0.60.7.bb
> b/meta/recipes-support/aspell/aspell_0.60.7.bb
> index b565cb3c6e..1e104c263c 100644
> --- a/meta/recipes-support/aspell/aspell_0.60.7.bb
> +++ b/meta/recipes-support/aspell/aspell_0.60.7.bb
> @@ -8,6 +8,8 @@ PR = "r1"
>  
>  SRC_URI = "${GNU_MIRROR}/aspell/aspell-${PV}.tar.gz \
>             file://0001-Fix-various-bugs-found-by-OSS-Fuze.patch \
> +           file://CVE-2019-20433-0001.patch \
> +           file://CVE-2019-20433-0002.patch \
>            "
>  SRC_URI[md5sum] = "8ef2252609c511cd2bb26f3a3932ef28"
>  SRC_URI[sha256sum] =
> "5ca8fc8cb0370cc6c9eb5b64c6d1bc5d57b3750dbf17887726c3407d833b70e4"
> -- 
> 2.17.1
> 

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] [zeus] aspell: CVE-2019-20433
  2020-03-12 12:25 ` Mittal, Anuj
@ 2020-03-12 12:34   ` Mikko.Rapeli
  2020-03-12 12:49     ` Adrian Bunk
  2020-03-12 13:04     ` Mittal, Anuj
  2020-03-12 14:35   ` Stefan Robert Ghinea
  1 sibling, 2 replies; 7+ messages in thread
From: Mikko.Rapeli @ 2020-03-12 12:34 UTC (permalink / raw)
  To: anuj.mittal; +Cc: openembedded-core

On Thu, Mar 12, 2020 at 12:25:21PM +0000, Mittal, Anuj wrote:
> It looks like this is changing the API. I wonder if this would need any
> other change or break something elsewhere in OE-core, meta-oe?
> 
> http://aspell.net/buffer-overread-ucs.txt

Debian classified issues as minor and fixed only by updating
to 0.60.8:

https://security-tracker.debian.org/tracker/CVE-2019-20433

https://metadata.ftp-master.debian.org/changelogs//main/a/aspell/aspell_0.60.8-1_changelog

Maybe whitelist for stable branches and update to new version on master?

Cheers,

-Mikko

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] [zeus] aspell: CVE-2019-20433
  2020-03-12 12:34   ` Mikko.Rapeli
@ 2020-03-12 12:49     ` Adrian Bunk
  2020-03-12 13:25       ` Mikko.Rapeli
  2020-03-12 13:04     ` Mittal, Anuj
  1 sibling, 1 reply; 7+ messages in thread
From: Adrian Bunk @ 2020-03-12 12:49 UTC (permalink / raw)
  To: Mikko.Rapeli; +Cc: openembedded-core

On Thu, Mar 12, 2020 at 12:34:19PM +0000, Mikko.Rapeli@bmw.de wrote:
> On Thu, Mar 12, 2020 at 12:25:21PM +0000, Mittal, Anuj wrote:
> > It looks like this is changing the API. I wonder if this would need any
> > other change or break something elsewhere in OE-core, meta-oe?
> > 
> > http://aspell.net/buffer-overread-ucs.txt
> 
> Debian classified issues as minor and fixed only by updating
> to 0.60.8:
> 
> https://security-tracker.debian.org/tracker/CVE-2019-20433
> 
> https://metadata.ftp-master.debian.org/changelogs//main/a/aspell/aspell_0.60.8-1_changelog
> 
> Maybe whitelist for stable branches and update to new version on master?

master already has the new version.

IMHO whitelisting is wrong unless there would be a clear and documented 
policy what kind of vulnerabilities are getting whitelisted.

But even then "Base Score: 9.1 CRITICAL"[1] would make whitelisting 
unlikely in this case.

> Cheers,
> 
> -Mikko

cu
Adrian

[1] https://nvd.nist.gov/vuln/detail/CVE-2019-20433


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] [zeus] aspell: CVE-2019-20433
  2020-03-12 12:34   ` Mikko.Rapeli
  2020-03-12 12:49     ` Adrian Bunk
@ 2020-03-12 13:04     ` Mittal, Anuj
  1 sibling, 0 replies; 7+ messages in thread
From: Mittal, Anuj @ 2020-03-12 13:04 UTC (permalink / raw)
  To: Mikko.Rapeli@bmw.de; +Cc: openembedded-core@lists.openembedded.org



> -----Original Message-----
> From: Mikko.Rapeli@bmw.de <Mikko.Rapeli@bmw.de>
> Sent: Thursday, March 12, 2020 08:34 PM
> To: Mittal, Anuj <anuj.mittal@intel.com>
> Cc: openembedded-core@lists.openembedded.org; stefan.ghinea@windriver.com
> Subject: Re: [OE-core] [PATCH] [zeus] aspell: CVE-2019-20433
> 
> On Thu, Mar 12, 2020 at 12:25:21PM +0000, Mittal, Anuj wrote:
> > It looks like this is changing the API. I wonder if this would need
> > any other change or break something elsewhere in OE-core, meta-oe?
> >
> > http://aspell.net/buffer-overread-ucs.txt
> 
> Debian classified issues as minor and fixed only by updating to 0.60.8:

They were applied to 0.60.7:

https://salsa.debian.org/debian/aspell/-/commit/ab3214b1e758646c5a995d277ac80f6d04566149

https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=935128

I think that "minor" categorization is for versions where it wasn't fixed. The NVD severity at the top says medium and it has been assigned a score of 9.1.

> 
> https://security-tracker.debian.org/tracker/CVE-2019-20433
> 
> https://metadata.ftp-master.debian.org/changelogs//main/a/aspell/aspell_0.60.8-
> 1_changelog
> 
> Maybe whitelist for stable branches and update to new version on master?
> 

Whitelisting doesn't sound the right thing to do here especially since this is a valid problem.

Thanks,

Anuj


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] [zeus] aspell: CVE-2019-20433
  2020-03-12 12:49     ` Adrian Bunk
@ 2020-03-12 13:25       ` Mikko.Rapeli
  0 siblings, 0 replies; 7+ messages in thread
From: Mikko.Rapeli @ 2020-03-12 13:25 UTC (permalink / raw)
  To: bunk, anuj.mittal; +Cc: openembedded-core

Yes, you are correct. White listing isn't right either.

-Mikko

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] [zeus] aspell: CVE-2019-20433
  2020-03-12 12:25 ` Mittal, Anuj
  2020-03-12 12:34   ` Mikko.Rapeli
@ 2020-03-12 14:35   ` Stefan Robert Ghinea
  1 sibling, 0 replies; 7+ messages in thread
From: Stefan Robert Ghinea @ 2020-03-12 14:35 UTC (permalink / raw)
  To: Mittal, Anuj, openembedded-core@lists.openembedded.org

[-- Attachment #1: Type: text/plain, Size: 47583 bytes --]

I looked for dependent packages in oe-core and in meta-oe with grep and found only enchant
and enchant2 although I was able to build both of them having the aspell patch applied.

Best regards,
Stefan Ghinea

On 3/12/20 14:25, Mittal, Anuj wrote:
> It looks like this is changing the API. I wonder if this would need any
> other change or break something elsewhere in OE-core, meta-oe?
>
> http://aspell.net/buffer-overread-ucs.txt
>
> Thanks,
>
> Anuj
>
> On Thu, 2020-03-12 at 11:23 +0200, Stefan Ghinea wrote:
>> libaspell.a in GNU Aspell before 0.60.8 has a buffer over-read for a
>> string
>> ending with a single '\0' byte, if the encoding is set to ucs-2 or
>> ucs-4
>> outside of the application, as demonstrated by the ASPELL_CONF
>> environment
>> variable.
>>
>> References:
>> https://nvd.nist.gov/vuln/detail/CVE-2019-20433
>>
>> Upstream patches:
>> https://github.com/GNUAspell/aspell/commit/de29341638833ba7717bd6b5e6850998454b044b
>> https://github.com/GNUAspell/aspell/commit/cefd447e5528b08bb0cd6656bc52b4255692cefc
>>
>> Signed-off-by: Stefan Ghinea <stefan.ghinea@windriver.com>
>> ---
>>   .../aspell/aspell/CVE-2019-20433-0001.patch   | 999
>> ++++++++++++++++++
>>   .../aspell/aspell/CVE-2019-20433-0002.patch   |  68 ++
>>   meta/recipes-support/aspell/aspell_0.60.7.bb  |   2 +
>>   3 files changed, 1069 insertions(+)
>>   create mode 100644 meta/recipes-support/aspell/aspell/CVE-2019-
>> 20433-0001.patch
>>   create mode 100644 meta/recipes-support/aspell/aspell/CVE-2019-
>> 20433-0002.patch
>>
>> diff --git a/meta/recipes-support/aspell/aspell/CVE-2019-20433-
>> 0001.patch b/meta/recipes-support/aspell/aspell/CVE-2019-20433-
>> 0001.patch
>> new file mode 100644
>> index 0000000000..fd68461e32
>> --- /dev/null
>> +++ b/meta/recipes-support/aspell/aspell/CVE-2019-20433-0001.patch
>> @@ -0,0 +1,999 @@
>> +From de29341638833ba7717bd6b5e6850998454b044b Mon Sep 17 00:00:00
>> 2001
>> +From: Kevin Atkinson <kevina@gnu.org>
>> +Date: Sat, 17 Aug 2019 17:06:53 -0400
>> +Subject: [PATCH 1/2] Don't allow null-terminated UCS-2/4 strings
>> using the
>> + original API.
>> +
>> +Detect if the encoding is UCS-2/4 and the length is -1 in affected
>> API
>> +functions and refuse to convert the string.  If the string ends up
>> +being converted somehow, abort with an error message in DecodeDirect
>> +and ConvDirect.  To convert a null terminated string in
>> +Decode/ConvDirect, a negative number corresponding to the width of
>> the
>> +underlying character type for the encoding is expected; for example,
>> +if the encoding is "ucs-2" then a the size is expected to be -2.
>> +
>> +Also fix a 1-3 byte over-read in DecodeDirect when reading UCS-2/4
>> +strings when a size is provided (found by OSS-Fuzz).
>> +
>> +Also fix a bug in DecodeDirect that caused DocumentChecker to return
>> +the wrong offsets when working with UCS-2/4 strings.
>> +
>> +CVE: CVE-2019-20433
>> +Upstream-Status: Backport [
>> https://github.com/GNUAspell/aspell/commit/de29341638833ba7717bd6b5e6850998454b044b
>> ]
>> +
>> +[SG: - adjusted context
>> +     - discarded test changes as test framework is not available
>> +     - discarded manual entry changes for features that aren't
>> backported]
>> +Signed-off-by: Stefan Ghinea <stefan.ghinea@windriver.com>
>> +---
>> + auto/MkSrc/CcHelper.pm      | 99
>> ++++++++++++++++++++++++++++++++++---
>> + auto/MkSrc/Create.pm        |  5 +-
>> + auto/MkSrc/Info.pm          |  5 +-
>> + auto/MkSrc/ProcCc.pm        | 24 +++++----
>> + auto/MkSrc/ProcImpl.pm      | 57 +++++++++++++++------
>> + auto/MkSrc/Read.pm          |  4 +-
>> + auto/mk-src.in              | 44 +++++++++++++++--
>> + common/convert.cpp          | 39 ++++++++++++---
>> + common/convert.hpp          | 38 +++++++++++++-
>> + common/document_checker.cpp | 17 ++++++-
>> + common/document_checker.hpp |  1 +
>> + common/version.cpp          | 15 ++++--
>> + configure.ac                |  8 +++
>> + manual/aspell.texi          | 58 ++++++++++++++++------
>> + manual/readme.texi          | 70 +++++++++++++++++++++-----
>> + 15 files changed, 409 insertions(+), 75 deletions(-)
>> +
>> +diff --git a/auto/MkSrc/CcHelper.pm b/auto/MkSrc/CcHelper.pm
>> +index f2de991..0044335 100644
>> +--- a/auto/MkSrc/CcHelper.pm
>> ++++ b/auto/MkSrc/CcHelper.pm
>> +@@ -10,8 +10,8 @@ BEGIN {
>> +   use Exporter;
>> +   our @ISA = qw(Exporter);
>> +   our @EXPORT = qw(to_c_return_type c_error_cond
>> +-		   to_type_name make_desc make_func call_func
>> +-		   make_c_method call_c_method form_c_method
>> ++		   to_type_name make_desc make_func call_func
>> get_c_func_name
>> ++		   make_c_method make_wide_macro call_c_method
>> form_c_method
>> + 		   make_cxx_method);
>> + }
>> +
>> +@@ -90,6 +90,69 @@ sub make_func ( $ \@ $ ; \% ) {
>> + 	   ')'));
>> + }
>> +
>> ++=item make_wide_version NAME @TYPES PARMS ; %ACCUM
>> ++
>> ++Creates the wide character version of the function if needed
>> ++
>> ++=cut
>> ++
>> ++sub make_wide_version ( $ \@ $ ; \% ) {
>> ++  my ($name, $d, $p, $accum) = @_;
>> ++  my @d = @$d;
>> ++  shift @d;
>> ++  return '' unless grep {$_->{type} eq 'encoded string'} @d;
>> ++  $accum->{sys_headers}{'stddef.h'} = true;
>> ++  $accum->{suffix}[5] = <<'---';
>> ++
>> ++/******************* private implemantion details
>> *********************/
>> ++
>> ++#ifdef __cplusplus
>> ++#  define aspell_cast_(type, expr) (static_cast<type>(expr))
>> ++#  define aspell_cast_from_wide_(str) (static_cast<const void
>> *>(str))
>> ++#else
>> ++#  define aspell_cast_(type, expr) ((type)(expr))
>> ++#  define aspell_cast_from_wide_(str) ((const char *)(str))
>> ++#endif
>> ++---
>> ++  my @parms = map {$_->{type} eq 'encoded string'
>> ++                       ? ($_->{name}, $_->{name}.'_size')
>> ++                       : $_->{name}} @d;
>> ++  $name = to_lower $name;
>> ++  $accum->{suffix}[0] = <<'---';
>> ++/******************************************************************
>> ****/
>> ++
>> ++#ifdef ASPELL_ENCODE_SETTING_SECURE
>> ++---
>> ++  $accum->{suffix}[2] = "#endif\n";
>> ++  my @args = map  {$_->{type} eq 'encoded string'
>> ++                       ? ($_->{name}, "$_->{name}_size", '-1')
>> ++                       : $_->{name}} @d;
>> ++  $accum->{suffix}[1] .=
>> ++      (join '',
>> ++       "#define $name",
>> ++       '(', join(', ', @parms), ')',
>> ++       "\\\n    ",
>> ++       $name, '_wide',
>> ++       '(', join(', ', @args), ')',
>> ++       "\n");
>> ++  @args = map  {$_->{type} eq 'encoded string'
>> ++                    ? ("aspell_cast_from_wide_($_->{name})",
>> ++                       "$_-
>>> {name}_size*aspell_cast_(int,sizeof(*($_->{name})))",
>> ++                       "sizeof(*($_->{name}))")
>> ++                    : $_->{name}} @d;
>> ++  return (join '',
>> ++          "\n",
>> ++          "/* version of $name that is safe to use with (null
>> terminated) wide characters */\n",
>> ++          '#define ',
>> ++          $name, '_w',
>> ++          '(', join(', ', @parms), ')',
>> ++          "\\\n    ",
>> ++          $name, '_wide',
>> ++          '(', join(', ', @args), ')',
>> ++          "\n");
>> ++}
>> ++
>> ++
>> + =item call_func NAME @TYPES PARMS ; %ACCUM
>> +
>> + Return a string to call a func.  Will prefix the function with
>> return
>> +@@ -103,7 +166,6 @@ Parms can be any of:
>> +
>> + sub call_func ( $ \@ $ ; \% ) {
>> +   my ($name, $d, $p, $accum) = @_;
>> +-  $accum = {} unless defined $accum;
>> +   my @d = @$d;
>> +   my $func_ret = to_type_name(shift @d, {%$p,pos=>'return'},
>> %$accum);
>> +   return (join '',
>> +@@ -148,8 +210,14 @@ sub to_type_name ( $ $ ; \% ) {
>> +   my $name = $t->{name};
>> +   my $type = $t->{type};
>> +
>> +-  return ( (to_type_name {%$d, type=>'string'}, $p, %$accum) ,
>> +-	   (to_type_name {%$d, type=>'int', name=>"$d->{name}_size"},
>> $p, %$accum) )
>> ++  if ($name eq 'encoded string' && $is_cc && $pos eq 'parm') {
>> ++    my @types = ((to_type_name {%$d, type=>($p->{wide}?'const void
>> pointer':'string')}, $p, %$accum),
>> ++                 (to_type_name {%$d, type=>'int', name=>"$d-
>>> {name}_size"}, $p, %$accum));
>> ++    push @types, (to_type_name {%$d, type=>'int', name=>"$d-
>>> {name}_type_width"}, $p, %$accum) if $p->{wide};
>> ++    return @types;
>> ++  }
>> ++  return ( (to_type_name {%$d, type=>($p->{wide}?'const void
>> pointer':'string')}, $p, %$accum) ,
>> ++           (to_type_name {%$d, type=>'int', name=>"$d-
>>> {name}_size"}, $p, %$accum) )
>> +       if $name eq 'encoded string' && $is_cc && $pos eq 'parm';
>> +
>> +   my $str;
>> +@@ -174,7 +242,7 @@ sub to_type_name ( $ $ ; \% ) {
>> + 	$str .= "String";
>> +       }
>> +     } elsif ($name eq 'encoded string') {
>> +-      $str .= "const char *";
>> ++      $str .= $p->{wide} ? "const void *" : "const char *";
>> +     } elsif ($name eq '') {
>> +       $str .= "void";
>> +     } elsif ($name eq 'bool' && $is_cc) {
>> +@@ -186,7 +254,7 @@ sub to_type_name ( $ $ ; \% ) {
>> +       if ($t->{pointer}) {
>> + 	$accum->{types}->{$name} = $t;
>> +       } else {
>> +-	$accum->{headers}->{$t->{created_in}} = true;
>> ++        $accum->{headers}->{$t->{created_in}} = true unless $mode
>> eq 'cc';
>> +       }
>> +       $str .= "$c_type Aspell" if $mode eq 'cc';
>> +       $str .= to_mixed($name);
>> +@@ -214,6 +282,7 @@ sub to_type_name ( $ $ ; \% ) {
>> +   return $str;
>> + }
>> +
>> ++
>> + =item make_desc DESC ; LEVEL
>> +
>> + Make a C comment out of DESC optionally indenting it LEVEL spaces.
>> +@@ -286,6 +355,7 @@ sub form_c_method ($ $ $ ; \% )
>> +     } else {
>> +       $func = "aspell $class $name";
>> +     }
>> ++    $func .= " wide" if $p->{wide};
>> +     if (exists $d->{'const'}) {
>> +       splice @data, 1, 0, {type => "const $class", name=>
>> $this_name};
>> +     } else {
>> +@@ -306,6 +376,21 @@ sub make_c_method ($ $ $ ; \%)
>> +   return &make_func(@ret);
>> + }
>> +
>> ++sub get_c_func_name ($ $ $)
>> ++{
>> ++  my @ret = &form_c_method(@_);
>> ++  return undef unless @ret > 0;
>> ++  return to_lower $ret[0];
>> ++}
>> ++
>> ++sub make_wide_macro ($ $ $ ; \%)
>> ++{
>> ++  my @ret = &form_c_method(@_);
>> ++  return undef unless @ret > 0;
>> ++  my $str = &make_wide_version(@ret);
>> ++  return $str;
>> ++}
>> ++
>> + sub call_c_method ($ $ $ ; \%)
>> + {
>> +   my @ret = &form_c_method(@_);
>> +diff --git a/auto/MkSrc/Create.pm b/auto/MkSrc/Create.pm
>> +index d39b60e..630ede5 100644
>> +--- a/auto/MkSrc/Create.pm
>> ++++ b/auto/MkSrc/Create.pm
>> +@@ -77,8 +77,10 @@ sub create_cc_file ( % )  {
>> +   $file .= "#include \"aspell.h\"\n" if $p{type} eq 'cxx';
>> +   $file .= "#include \"settings.h\"\n" if $p{type} eq 'native_impl'
>> && $p{name} eq 'errors';
>> +   $file .= "#include \"gettext.h\"\n" if $p{type} eq 'native_impl'
>> && $p{name} eq 'errors';
>> ++  $file .= cmap {"#include <$_>\n"} sort keys
>> %{$accum{sys_headers}};
>> +   $file .= cmap {"#include \"".to_lower($_).".hpp\"\n"} sort keys
>> %{$accum{headers}};
>> +-  $file .= "#ifdef __cplusplus\nextern \"C\" {\n#endif\n" if
>> $p{header} && !$p{cxx};
>> ++  $file .= "\n#ifdef __cplusplus\nextern \"C\" {\n#endif\n" if
>> $p{header} && !$p{cxx};
>> ++  $file .= join('', grep {defined $_} @{$accum{prefix}});
>> +   $file .= "\nnamespace $p{namespace} {\n\n" if $p{cxx};
>> +   if (defined $info{forward}{proc}{$p{type}}) {
>> +     my @types = sort {$a->{name} cmp $b->{name}} (values
>> %{$accum{types}});
>> +@@ -86,6 +88,7 @@ sub create_cc_file ( % )  {
>> +   }
>> +   $file .= "\n";
>> +   $file .= $body;
>> ++  $file .= join('', grep {defined $_} @{$accum{suffix}});
>> +   $file .= "\n\n}\n\n" if $p{cxx};
>> +   $file .= "#ifdef __cplusplus\n}\n#endif\n" if $p{header} &&
>> !$p{cxx};
>> +   $file .= "#endif /* $hm */\n" if $p{header};
>> +diff --git a/auto/MkSrc/Info.pm b/auto/MkSrc/Info.pm
>> +index c644028..ace8e21 100644
>> +--- a/auto/MkSrc/Info.pm
>> ++++ b/auto/MkSrc/Info.pm
>> +@@ -60,6 +60,7 @@ each proc sub should take the following argv
>> +     the object from which it is a member of
>> +   no native: do not attempt to create a native implementation
>> +   treat as object: treat as a object rather than a pointer
>> ++  no conv: do not converted an encoded string
>> +
>> + The %info structure is initialized as follows:
>> +
>> +@@ -104,8 +105,8 @@ The %info structure is initialized as follows:
>> +   errors => {}, # possible errors
>> +   method => {
>> +     # A class method
>> +-    options => ['desc', 'posib err', 'c func', 'const',
>> +-		'c only', 'c impl', 'cxx impl'],
>> ++    options => ['desc', 'posib err', 'c func', 'const', 'no conv',
>> 'on conv error',
>> ++		'c only', 'c impl', 'cxx impl', 'cc extra'],
>> +     groups => undef},
>> +   constructor => {
>> +     # A class constructor
>> +diff --git a/auto/MkSrc/ProcCc.pm b/auto/MkSrc/ProcCc.pm
>> +index 47c4338..98cc435 100644
>> +--- a/auto/MkSrc/ProcCc.pm
>> ++++ b/auto/MkSrc/ProcCc.pm
>> +@@ -23,7 +23,7 @@ use MkSrc::Info;
>> + sub make_c_object ( $ @ );
>> +
>> + $info{group}{proc}{cc} = sub {
>> +-  my ($data) = @_;
>> ++  my ($data,@rest) = @_;
>> +   my $ret;
>> +   my $stars = (70 - length $data->{name})/2;
>> +   $ret .= "/";
>> +@@ -33,14 +33,14 @@ $info{group}{proc}{cc} = sub {
>> +   $ret .= "/\n";
>> +   foreach my $d (@{$data->{data}}) {
>> +     $ret .= "\n\n";
>> +-    $ret .= $info{$d->{type}}{proc}{cc}->($d);
>> ++    $ret .= $info{$d->{type}}{proc}{cc}->($d,@rest);
>> +   }
>> +   $ret .= "\n\n";
>> +   return $ret;
>> + };
>> +
>> + $info{enum}{proc}{cc} = sub {
>> +-  my ($d) = @_;
>> ++  my ($d,@rest) = @_;
>> +   my $n = "Aspell".to_mixed($d->{name});
>> +   return ("\n".
>> + 	  make_desc($d->{desc}).
>> +@@ -58,21 +58,26 @@ $info{struct}{proc}{cc} = sub {
>> + };
>> +
>> + $info{union}{proc}{cc} = sub {
>> +-  return make_c_object "union", $_[0];
>> ++  return make_c_object "union", @_;
>> + };
>> +
>> + $info{class}{proc}{cc} = sub {
>> +-  my ($d) = @_;
>> ++  my ($d,$accum) = @_;
>> +   my $class = $d->{name};
>> +   my $classname = "Aspell".to_mixed($class);
>> +   my $ret = "";
>> +   $ret .= "typedef struct $classname $classname;\n\n";
>> +   foreach (@{$d->{data}}) {
>> +-    my $s = make_c_method($class, $_, {mode=>'cc'});
>> ++    my $s = make_c_method($class, $_, {mode=>'cc'}, %$accum);
>> +     next unless defined $s;
>> +     $ret .= "\n";
>> +     $ret .= make_desc($_->{desc});
>> +-    $ret .= make_c_method($class, $_, {mode=>'cc'}).";\n";
>> ++    $ret .= make_c_method($class, $_, {mode=>'cc'}, %$accum).";\n";
>> ++    if (grep {$_->{type} eq 'encoded string'} @{$_->{data}}) {
>> ++      $ret .= make_c_method($class, $_, {mode=>'cc', wide=>true},
>> %$accum).";\n";
>> ++      $ret .= make_wide_macro($class, $_, {mode=>'cc'}, %$accum);
>> ++    }
>> ++    $ret .= "\n".$_->{'cc extra'}."\n" if defined $_->{'cc extra'};
>> +   }
>> +   $ret .= "\n";
>> +   return $ret;
>> +@@ -105,7 +110,8 @@ $info{errors}{proc}{cc} = sub {
>> + };
>> +
>> + sub make_c_object ( $ @ ) {
>> +-  my ($t, $d) = @_;
>> ++  my ($t, $d, $accum) = @_;
>> ++  $accum = {} unless defined $accum;
>> +   my $struct;
>> +   $struct .= "Aspell";
>> +   $struct .= to_mixed($d->{name});
>> +@@ -120,7 +126,7 @@ sub make_c_object ( $ @ ) {
>> + 		"\n};\n"),
>> + 	  "typedef $t $struct $struct;",
>> + 	  join ("\n",
>> +-		map {make_c_method($d->{name}, $_, {mode=>'cc'}).";"}
>> ++		map {make_c_method($d->{name}, $_, {mode=>'cc'},
>> %$accum).";"}
>> + 		grep {$_->{type} eq 'method'}
>> + 		@{$d->{data}})
>> + 	  )."\n";
>> +diff --git a/auto/MkSrc/ProcImpl.pm b/auto/MkSrc/ProcImpl.pm
>> +index b8628fd..3d0f220 100644
>> +--- a/auto/MkSrc/ProcImpl.pm
>> ++++ b/auto/MkSrc/ProcImpl.pm
>> +@@ -45,10 +45,13 @@ $info{class}{proc}{impl} = sub {
>> +   foreach (grep {$_ ne ''} split /\s*,\s*/, $data->{'c impl
>> headers'}) {
>> +     $accum->{headers}{$_} = true;
>> +   }
>> +-  foreach my $d (@{$data->{data}}) {
>> ++  my @d = @{$data->{data}};
>> ++  while (@d) {
>> ++    my $d = shift @d;
>> ++    my $need_wide = false;
>> +     next unless one_of $d->{type}, qw(method constructor
>> destructor);
>> +     my @parms = @{$d->{data}} if exists $d->{data};
>> +-    my $m = make_c_method $data->{name}, $d, {mode=>'cc_cxx',
>> use_name=>true}, %$accum;
>> ++    my $m = make_c_method $data->{name}, $d, {mode=>'cc_cxx',
>> use_name=>true, wide=>$d->{wide}}, %$accum;
>> +     next unless defined $m;
>> +     $ret .= "extern \"C\" $m\n";
>> +     $ret .= "{\n";
>> +@@ -57,24 +60,49 @@ $info{class}{proc}{impl} = sub {
>> +     } else {
>> +       if ($d->{type} eq 'method') {
>> + 	my $ret_type = shift @parms;
>> +-	my $ret_native = to_type_name $ret_type,
>> {mode=>'native_no_err', pos=>'return'}, %$accum;
>> ++	my $ret_native = to_type_name $ret_type,
>> {mode=>'native_no_err', pos=>'return', wide=>$d->{wide}}, %$accum;
>> + 	my $snum = 0;
>> ++        my $call_fun = $d->{name};
>> ++        my @call_parms;
>> + 	foreach (@parms) {
>> + 	  my $n = to_lower($_->{name});
>> +-	  if ($_->{type} eq 'encoded string') {
>> +-	    $accum->{headers}{'mutable string'} = true;
>> +-	    $accum->{headers}{'convert'} = true;
>> +-	    $ret .= "  ths->temp_str_$snum.clear();\n";
>> +-	    $ret .= "  ths->to_internal_->convert($n, ${n}_size, ths-
>>> temp_str_$snum);\n";
>> +-	    $ret .= "  unsigned int s$snum = ths-
>>> temp_str_$snum.size();\n";
>> +-	    $_ = "MutableString(ths->temp_str_$snum.mstr(), s$snum)";
>> +-	    $snum++;
>> ++	  if ($_->{type} eq 'encoded string' && !exists($d->{'no
>> conv'})) {
>> ++            $need_wide = true unless $d->{wide};
>> ++            die unless exists $d->{'posib err'};
>> ++            $accum->{headers}{'mutable string'} = true;
>> ++            $accum->{headers}{'convert'} = true;
>> ++            my $name = get_c_func_name $data->{name}, $d,
>> {mode=>'cc_cxx', use_name=>true, wide=>$d->{wide}};
>> ++            $ret .= "  ths->temp_str_$snum.clear();\n";
>> ++            if ($d->{wide}) {
>> ++              $ret .= "  ${n}_size = get_correct_size(\"$name\",
>> ths->to_internal_->in_type_width(), ${n}_size, ${n}_type_width);\n";
>> ++            } else {
>> ++              $ret .= "  PosibErr<int> ${n}_fixed_size =
>> get_correct_size(\"$name\", ths->to_internal_->in_type_width(),
>> ${n}_size);\n";
>> ++              if (exists($d->{'on conv error'})) {
>> ++                $ret .= "  if (${n}_fixed_size.get_err()) {\n";
>> ++                $ret .= "    ".$d->{'on conv error'}."\n";
>> ++                $ret .= "  } else {\n";
>> ++                $ret .= "    ${n}_size = ${n}_fixed_size;\n";
>> ++                $ret .= "  }\n";
>> ++              } else {
>> ++                $ret .= "  ths-
>>> err_.reset(${n}_fixed_size.release_err());\n";
>> ++                $ret .= "  if (ths->err_ != 0) return
>> ".(c_error_cond $ret_type).";\n";
>> ++              }
>> ++            }
>> ++            $ret .= "  ths->to_internal_->convert($n, ${n}_size,
>> ths->temp_str_$snum);\n";
>> ++            $ret .= "  unsigned int s$snum = ths-
>>> temp_str_$snum.size();\n";
>> ++            push @call_parms, "MutableString(ths-
>>> temp_str_$snum.mstr(), s$snum)";
>> ++            $snum++;
>> ++          } elsif ($_->{type} eq 'encoded string') {
>> ++            $need_wide = true unless $d->{wide};
>> ++            push @call_parms, $n, "${n}_size";
>> ++            push @call_parms, "${n}_type_width" if $d->{wide};
>> ++            $call_fun .= " wide" if $d->{wide};
>> + 	  } else {
>> +-	    $_ = $n;
>> ++	    push @call_parms, $n;
>> + 	  }
>> + 	}
>> +-	my $parms = '('.(join ', ', @parms).')';
>> +-	my $exp = "ths->".to_lower($d->{name})."$parms";
>> ++	my $parms = '('.(join ', ', @call_parms).')';
>> ++	my $exp = "ths->".to_lower($call_fun)."$parms";
>> + 	if (exists $d->{'posib err'}) {
>> + 	  $accum->{headers}{'posib err'} = true;
>> + 	  $ret .= "  PosibErr<$ret_native> ret = $exp;\n";
>> +@@ -118,6 +146,7 @@ $info{class}{proc}{impl} = sub {
>> +       }
>> +     }
>> +     $ret .= "}\n\n";
>> ++    unshift @d,{%$d, wide=>true} if $need_wide;
>> +   }
>> +   return $ret;
>> + };
>> +diff --git a/auto/MkSrc/Read.pm b/auto/MkSrc/Read.pm
>> +index 4b3d1d0..4bf640e 100644
>> +--- a/auto/MkSrc/Read.pm
>> ++++ b/auto/MkSrc/Read.pm
>> +@@ -88,13 +88,13 @@ sub advance ( ) {
>> +     $in_pod = $1 if $line =~ /^\=(\w+)/;
>> +     $line = '' if $in_pod;
>> +     $in_pod = undef if $in_pod && $in_pod eq 'cut';
>> +-    $line =~ s/\#.*$//;
>> ++    $line =~ s/(?<!\\)\#.*$//;
>> +     $line =~ s/^(\t*)//;
>> +     $level = $base_level + length($1);
>> +       $line =~ s/\s*$//;
>> +     ++$base_level if $line =~ s/^\{$//;
>> +     --$base_level if $line =~ s/^\}$//;
>> +-    $line =~ s/\\([{}])/$1/g;
>> ++    $line =~ s/\\([{}#\\])/$1/g;
>> +   } while ($line eq '');
>> +   #print "$level:$line\n";
>> + }
>> +diff --git a/auto/mk-src.in b/auto/mk-src.in
>> +index 0e7833a..eb3353f 100644
>> +--- a/auto/mk-src.in
>> ++++ b/auto/mk-src.in
>> +@@ -608,6 +608,7 @@ errors:
>> + 		invalid expression
>> + 			mesg => "%expression" is not a valid regular
>> expression.
>> + 			parms => expression
>> ++
>> + }
>> + group: speller
>> + {
>> +@@ -650,6 +651,7 @@ class: speller
>> + 		posib err
>> + 		desc => Returns 0 if it is not in the dictionary,
>> + 			1 if it is, or -1 on error.
>> ++		on conv error => return 0;
>> + 		/
>> + 		bool
>> + 		encoded string: word
>> +@@ -715,6 +717,8 @@ class: speller
>> + 		desc => Return NULL on error.
>> + 			The word list returned by suggest is only
>> + 			valid until the next call to suggest.
>> ++		on conv error =>
>> ++			word = NULL; word_size = 0;
>> + 		/
>> + 		const word list
>> + 		encoded string: word
>> +@@ -840,7 +844,6 @@ class: document checker
>> + 		void
>> +
>> + 	method: process
>> +-
>> + 		desc => Process a string.
>> + 			The string passed in should only be split on
>> + 			white space characters.  Furthermore, between
>> +@@ -849,10 +852,10 @@ class: document checker
>> + 			in the document.  Passing in strings out of
>> + 			order, skipping strings or passing them in
>> + 			more than once may lead to undefined results.
>> ++		no conv
>> + 		/
>> + 		void
>> +-		string: str
>> +-		int: size
>> ++		encoded string: str
>> +
>> + 	method: next misspelling
>> +
>> +@@ -860,9 +863,23 @@ class: document checker
>> + 			processed string.  If there are no more
>> + 			misspelled words, then token.word will be
>> + 			NULL and token.size will be 0
>> ++		cc extra =>
>> ++			\#define
>> aspell_document_checker_next_misspelling_w(type, ths) \\
>> ++			    aspell_document_checker_next_misspelling_ad
>> j(ths, sizeof(type))
>> + 		/
>> + 		token object
>> +
>> ++	method: next misspelling adj
>> ++		desc => internal: do not use
>> ++		c impl =>
>> ++			Token res = ths->next_misspelling();
>> ++			res.offset /= type_width;
>> ++			res.len /= type_width;
>> ++			return res;
>> ++		/
>> ++		token object
>> ++		int: type_width
>> ++
>> + 	method: filter
>> +
>> + 		desc => Returns the underlying filter class.
>> +@@ -922,9 +939,30 @@ class: string enumeration
>> + 			  ths->from_internal_->append_null(ths-
>>> temp_str);
>> + 			  return ths->temp_str.data();
>> + 			\}
>> ++		cc extra =>
>> ++			\#define aspell_string_enumeration_next_w(type,
>> ths) \\
>> ++			    aspell_cast_(const type *,
>> aspell_string_enumeration_next_wide(ths, sizeof(type)))
>> + 		/
>> + 		const string
>> +
>> ++	method: next wide
>> ++		c impl =>
>> ++			const char * s = ths->next();
>> ++			if (s == 0) {
>> ++			  return s;
>> ++			} else if (ths->from_internal_ == 0) \{
>> ++			  assert(type_width == 1);
>> ++			  return s;
>> ++			\} else \{
>> ++			  assert(type_width == ths->from_internal_-
>>> out_type_width());
>> ++			  ths->temp_str.clear();
>> ++			  ths->from_internal_->convert(s,-1,ths-
>>> temp_str);
>> ++			  ths->from_internal_->append_null(ths-
>>> temp_str);
>> ++			  return ths->temp_str.data();
>> ++			\}
>> ++		/
>> ++		const void pointer
>> ++		int: type_width
>> + }
>> + group: info
>> + {
>> +diff --git a/common/convert.cpp b/common/convert.cpp
>> +index 1add95a..7ae0317 100644
>> +--- a/common/convert.cpp
>> ++++ b/common/convert.cpp
>> +@@ -541,18 +541,25 @@ namespace acommon {
>> +   // Trivial Conversion
>> +   //
>> +
>> ++  const char * unsupported_null_term_wide_string_msg =
>> ++    "Null-terminated wide-character strings unsupported when used
>> this way.";
>> ++
>> +   template <typename Chr>
>> +   struct DecodeDirect : public Decode
>> +   {
>> ++    DecodeDirect() {type_width = sizeof(Chr);}
>> +     void decode(const char * in0, int size, FilterCharVector & out)
>> const {
>> +       const Chr * in = reinterpret_cast<const Chr *>(in0);
>> +-      if (size == -1) {
>> ++      if (size == -sizeof(Chr)) {
>> +         for (;*in; ++in)
>> +-          out.append(*in);
>> ++          out.append(*in, sizeof(Chr));
>> ++      } else if (size <= -1) {
>> ++        fprintf(stderr, "%s\n",
>> unsupported_null_term_wide_string_msg);
>> ++        abort();
>> +       } else {
>> +-        const Chr * stop = reinterpret_cast<const Chr *>(in0
>> +size);
>> ++        const Chr * stop = reinterpret_cast<const Chr *>(in0) +
>> size/sizeof(Chr);
>> +         for (;in != stop; ++in)
>> +-          out.append(*in);
>> ++          out.append(*in, sizeof(Chr));
>> +       }
>> +     }
>> +     PosibErr<void> decode_ec(const char * in0, int size,
>> +@@ -565,6 +572,7 @@ namespace acommon {
>> +   template <typename Chr>
>> +   struct EncodeDirect : public Encode
>> +   {
>> ++    EncodeDirect() {type_width = sizeof(Chr);}
>> +     void encode(const FilterChar * in, const FilterChar * stop,
>> +                 CharVector & out) const {
>> +       for (; in != stop; ++in) {
>> +@@ -594,11 +602,15 @@ namespace acommon {
>> +   template <typename Chr>
>> +   struct ConvDirect : public DirectConv
>> +   {
>> ++    ConvDirect() {type_width = sizeof(Chr);}
>> +     void convert(const char * in0, int size, CharVector & out)
>> const {
>> +-      if (size == -1) {
>> ++      if (size == -sizeof(Chr)) {
>> +         const Chr * in = reinterpret_cast<const Chr *>(in0);
>> +         for (;*in != 0; ++in)
>> +           out.append(in, sizeof(Chr));
>> ++      } else if (size <= -1) {
>> ++        fprintf(stderr, "%s\n",
>> unsupported_null_term_wide_string_msg);
>> ++        abort();
>> +       } else {
>> +         out.append(in0, size);
>> +       }
>> +@@ -1121,5 +1133,20 @@ namespace acommon {
>> +     }
>> +     return 0;
>> +   }
>> +-
>> ++
>> ++  PosibErr<void> unsupported_null_term_wide_string_err_(const char
>> * func) {
>> ++    static bool reported_to_stderr = false;
>> ++    PosibErr<void> err = make_err(other_error,
>> unsupported_null_term_wide_string_msg);
>> ++    if (!reported_to_stderr) {
>> ++      CERR.printf("ERROR: %s: %s\n", func,
>> unsupported_null_term_wide_string_msg);
>> ++      reported_to_stderr = true;
>> ++    }
>> ++    return err;
>> ++  }
>> ++
>> ++  void unsupported_null_term_wide_string_abort_(const char * func)
>> {
>> ++    CERR.printf("%s: %s\n", unsupported_null_term_wide_string_msg);
>> ++    abort();
>> ++  }
>> ++
>> + }
>> +diff --git a/common/convert.hpp b/common/convert.hpp
>> +index 76332ee..c948973 100644
>> +--- a/common/convert.hpp
>> ++++ b/common/convert.hpp
>> +@@ -7,6 +7,8 @@
>> + #ifndef ASPELL_CONVERT__HPP
>> + #define ASPELL_CONVERT__HPP
>> +
>> ++#include "settings.h"
>> ++
>> + #include "string.hpp"
>> + #include "posib_err.hpp"
>> + #include "char_vector.hpp"
>> +@@ -25,8 +27,9 @@ namespace acommon {
>> +     typedef const Config CacheConfig;
>> +     typedef const char * CacheKey;
>> +     String key;
>> ++    int type_width; // type width in bytes
>> +     bool cache_key_eq(const char * l) const  {return key == l;}
>> +-    ConvBase() {}
>> ++    ConvBase() : type_width(1) {}
>> +   private:
>> +     ConvBase(const ConvBase &);
>> +     void operator=(const ConvBase &);
>> +@@ -56,6 +59,8 @@ namespace acommon {
>> +     virtual ~Encode() {}
>> +   };
>> +   struct DirectConv { // convert directly from in_code to out_code.
>> ++    int type_width; // type width in bytes
>> ++    DirectConv() : type_width(1) {}
>> +     // should not take ownership of decode and encode.
>> +     // decode and encode guaranteed to stick around for the life
>> +     // of the object.
>> +@@ -126,6 +131,9 @@ namespace acommon {
>> +     const char * in_code() const   {return decode_->key.c_str();}
>> +     const char * out_code() const  {return encode_->key.c_str();}
>> +
>> ++    int in_type_width() const {return decode_->type_width;}
>> ++    int out_type_width() const {return encode_->type_width;}
>> ++
>> +     void append_null(CharVector & out) const
>> +     {
>> +       const char nul[4] = {0,0,0,0}; // 4 should be enough
>> +@@ -191,6 +199,10 @@ namespace acommon {
>> +       }
>> +     }
>> +
>> ++    void convert(const void * in, int size, CharVector & out) {
>> ++      convert(static_cast<const char *>(in), size, out);
>> ++    }
>> ++
>> +     void generic_convert(const char * in, int size, CharVector &
>> out);
>> +
>> +   };
>> +@@ -412,6 +424,30 @@ namespace acommon {
>> +       return operator()(str, str + byte_size);}
>> +   };
>> +
>> ++#ifdef SLOPPY_NULL_TERM_STRINGS
>> ++  static const bool sloppy_null_term_strings = true;
>> ++#else
>> ++  static const bool sloppy_null_term_strings = false;
>> ++#endif
>> ++
>> ++  PosibErr<void> unsupported_null_term_wide_string_err_(const char
>> * func);
>> ++  void unsupported_null_term_wide_string_abort_(const char * func);
>> ++
>> ++  static inline PosibErr<int> get_correct_size(const char * func,
>> int conv_type_width, int size) {
>> ++    if (sloppy_null_term_strings && size <= -1)
>> ++      return -conv_type_width;
>> ++    if (size <= -1 && -conv_type_width != size)
>> ++      return unsupported_null_term_wide_string_err_(func);
>> ++    return size;
>> ++  }
>> ++  static inline int get_correct_size(const char * func, int
>> conv_type_width, int size, int type_width) {
>> ++    if ((sloppy_null_term_strings || type_width <= -1) && size <=
>> -1)
>> ++      return -conv_type_width;
>> ++    if (size <= -1 && conv_type_width != type_width)
>> ++      unsupported_null_term_wide_string_abort_(func);
>> ++    return size;
>> ++  }
>> ++
>> + }
>> +
>> + #endif
>> +diff --git a/common/document_checker.cpp
>> b/common/document_checker.cpp
>> +index 5e510c4..0ccf1cd 100644
>> +--- a/common/document_checker.cpp
>> ++++ b/common/document_checker.cpp
>> +@@ -44,7 +44,9 @@ namespace acommon {
>> +   void DocumentChecker::process(const char * str, int size)
>> +   {
>> +     proc_str_.clear();
>> +-    conv_->decode(str, size, proc_str_);
>> ++    PosibErr<int> fixed_size =
>> get_correct_size("aspell_document_checker_process", conv_-
>>> in_type_width(), size);
>> ++    if (!fixed_size.has_err())
>> ++      conv_->decode(str, fixed_size, proc_str_);
>> +     proc_str_.append(0);
>> +     FilterChar * begin = proc_str_.pbegin();
>> +     FilterChar * end   = proc_str_.pend() - 1;
>> +@@ -53,6 +55,19 @@ namespace acommon {
>> +     tokenizer_->reset(begin, end);
>> +   }
>> +
>> ++  void DocumentChecker::process_wide(const void * str, int size,
>> int type_width)
>> ++  {
>> ++    proc_str_.clear();
>> ++    int fixed_size =
>> get_correct_size("aspell_document_checker_process", conv_-
>>> in_type_width(), size, type_width);
>> ++    conv_->decode(static_cast<const char *>(str), fixed_size,
>> proc_str_);
>> ++    proc_str_.append(0);
>> ++    FilterChar * begin = proc_str_.pbegin();
>> ++    FilterChar * end   = proc_str_.pend() - 1;
>> ++    if (filter_)
>> ++      filter_->process(begin, end);
>> ++    tokenizer_->reset(begin, end);
>> ++  }
>> ++
>> +   Token DocumentChecker::next_misspelling()
>> +   {
>> +     bool correct;
>> +diff --git a/common/document_checker.hpp
>> b/common/document_checker.hpp
>> +index d35bb88..11a3c73 100644
>> +--- a/common/document_checker.hpp
>> ++++ b/common/document_checker.hpp
>> +@@ -36,6 +36,7 @@ namespace acommon {
>> +     PosibErr<void> setup(Tokenizer *, Speller *, Filter *);
>> +     void reset();
>> +     void process(const char * str, int size);
>> ++    void process_wide(const void * str, int size, int type_width);
>> +     Token next_misspelling();
>> +
>> +     Filter * filter() {return filter_;}
>> +diff --git a/common/version.cpp b/common/version.cpp
>> +index 414d938..9e60b75 100644
>> +--- a/common/version.cpp
>> ++++ b/common/version.cpp
>> +@@ -1,8 +1,17 @@
>> + #include "settings.h"
>> +
>> +-extern "C" const char * aspell_version_string() {
>> + #ifdef NDEBUG
>> +-  return VERSION " NDEBUG";
>> ++#  define NDEBUG_STR " NDEBUG"
>> ++#else
>> ++#  define NDEBUG_STR
>> ++#endif
>> ++
>> ++#ifdef SLOPPY_NULL_TERM_STRINGS
>> ++#  define SLOPPY_STR " SLOPPY"
>> ++#else
>> ++#  define SLOPPY_STR
>> + #endif
>> +-  return VERSION;
>> ++
>> ++extern "C" const char * aspell_version_string() {
>> ++  return VERSION NDEBUG_STR SLOPPY_STR;
>> + }
>> +diff --git a/configure.ac b/configure.ac
>> +index 60e3b39..a5d51e3 100644
>> +--- a/configure.ac
>> ++++ b/configure.ac
>> +@@ -73,6 +73,9 @@ AC_ARG_ENABLE(filter-version-control,
>> + AC_ARG_ENABLE(32-bit-hash-fun,
>> +   AS_HELP_STRING([--enable-32-bit-hash-fun],[use 32-bit hash
>> function for compiled dictionaries]))
>> +
>> ++AC_ARG_ENABLE(sloppy-null-term-strings,
>> ++  AS_HELP_STRING([--enable-sloppy-null-term-strings],[allows allow
>> null terminated UCS-2 and UCS-4 strings]))
>> ++
>> + AC_ARG_ENABLE(pspell-compatibility,
>> +   AS_HELP_STRING([--disable-pspell-compatibility],[don't install
>> pspell compatibility libraries]))
>> +
>> +@@ -141,6 +144,11 @@ then
>> +   AC_DEFINE(USE_32_BIT_HASH_FUN, 1, [Defined if 32-bit hash
>> function should be used for compiled dictionaries.])
>> + fi
>> +
>> ++if test "$enable_sloppy_null_term_strings" = "yes"
>> ++then
>> ++  AC_DEFINE(SLOPPY_NULL_TERM_STRINGS, 1, [Defined if null-
>> terminated UCS-2 and UCS-4 strings should always be allowed.])
>> ++fi
>> ++
>> + AM_CONDITIONAL(PSPELL_COMPATIBILITY,
>> +   [test "$enable_pspell_compatibility" != "no"])
>> + AM_CONDITIONAL(INCREMENTED_SONAME,
>> +diff --git a/manual/aspell.texi b/manual/aspell.texi
>> +index 45fa091..f400e06 100644
>> +--- a/manual/aspell.texi
>> ++++ b/manual/aspell.texi
>> +@@ -158,7 +158,8 @@ Installing
>> +
>> + * Generic Install Instructions::
>> + * HTML Manuals and "make clean"::
>> +-* Curses Notes::
>> ++* Curses Notes::
>> ++* Upgrading from Aspell 0.60.7::
>> + * Loadable Filter Notes::
>> + * Upgrading from Aspell 0.50::
>> + * Upgrading from Aspell .33/Pspell .12::
>> +@@ -2206,18 +2207,26 @@ int correct =
>> aspell_speller_check(spell_checker, @var{word}, @var{size});
>> + @end smallexample
>> +
>> + @noindent
>> +-@var{word} is expected to be a @code{const char *} character
>> +-string.  If the encoding is set to be @code{ucs-2} or
>> +-@code{ucs-4} @var{word} is expected to be a cast
>> +-from either @code{const u16int *} or @code{const u32int *}
>> +-respectively.  @code{u16int} and @code{u32int} are generally
>> +-@code{unsigned short} and @code{unsigned int} respectively.
>> +-@var{size} is the length of the string or @code{-1} if the string
>> +-is null terminated.  If the string is a cast from @code{const
>> u16int
>> +-*} or @code{const u32int *} then @code{@i{size}} is the amount of
>> +-space in bytes the string takes up after being cast to @code{const
>> +-char *} and not the true size of the
>> string.  @code{sspell_speller_check}
>> +-will return @code{0} if it is not found and non-zero otherwise.
>> ++@var{word} is expected to be a @code{const char *} character
>> string.
>> ++@var{size} is the length of the string or @code{-1} if the string
>> is
>> ++null terminated.  @code{aspell_speller_check} will return @code{0}
>> if it is not found
>> ++and non-zero otherwise.
>> ++
>> ++If you are using the @code{ucs-2} or @code{ucs-4} encoding then the
>> ++string is expected to be either a 2 or 4 byte wide integer
>> ++(respectively) and the @code{_w} macro vesion should be used:
>> ++
>> ++@smallexample
>> ++int correct = aspell_speller_check_w(spell_checker, @var{word},
>> @var{size});
>> ++@end smallexample
>> ++
>> ++The macro will cast the string to to the correct type and convert
>> ++@var{size} into bytes for you and then a call the special wide
>> version of the
>> ++function that will make sure the encoding is correct for the type
>> ++passed in.  For compatibility with older versions of Aspell the
>> normal
>> ++non-wide functions can still be used provided that the size of the
>> ++string, in bytes, is also passed in.  Null terminated @code{ucs-2}
>> or
>> ++@code{ucs-4} are no longer supported when using the non-wide
>> functions.
>> +
>> + If the word is not correct, then the @code{suggest} method can be
>> used
>> + to come up with likely replacements.
>> +@@ -2236,7 +2245,28 @@ delete_aspell_string_enumeration(elements);
>> +
>> + Notice how @code{elements} is deleted but @code{suggestions} is
>> not.
>> + The value returned by @code{suggestions} is only valid to the next
>> +-call to @code{suggest}.  Once a replacement is made the
>> ++call to @code{suggest}.
>> ++
>> ++If you are using the @code{ucs-2} or @code{ucs-4} encoding then, in
>> ++addition to using the @code{_w} macro for the @code{suggest}
>> method, you
>> ++should also use the @code{_w} macro with the @code{next} method
>> which
>> ++will cast the string to the correct type for you.  For example, if
>> you
>> ++are using the @code{ucs-2} encoding and the string is a @code{const
>> ++uint16_t *} then you should use:
>> ++
>> ++@smallexample
>> ++AspellWordList * suggestions =
>> aspell_speller_suggest_w(spell_checker,
>> ++                                                        @var{word},
>> @var{size});
>> ++AspellStringEnumeration * elements =
>> aspell_word_list_elements(suggestions);
>> ++const uint16_t * word;
>> ++while ( (word = aspell_string_enumeration_next_w(uint16_t,
>> aspell_elements)) != NULL )
>> ++@{
>> ++  // add to suggestion list
>> ++@}
>> ++delete_aspell_string_enumeration(elements);
>> ++@end smallexample
>> ++
>> ++Once a replacement is made the
>> + @code{store_repl} method should be used to communicate the
>> replacement
>> + pair back to the spell checker (for the reason, @pxref{Notes on
>> + Storing Replacement Pairs}).  Its usage is as follows:
>> +diff --git a/manual/readme.texi b/manual/readme.texi
>> +index 669ab8e..531721f 100644
>> +--- a/manual/readme.texi
>> ++++ b/manual/readme.texi
>> +@@ -15,15 +15,16 @@ The latest version can always be found at GNU
>> Aspell's home page at
>> + @uref{http://aspell.net}.
>> +
>> + @menu
>> +-* Generic Install Instructions::
>> +-* HTML Manuals and "make clean"::
>> +-* Curses Notes::
>> +-* Loadable Filter Notes::
>> +-* Using 32-Bit Dictionaries on a 64-Bit System::
>> +-* Upgrading from Aspell 0.50::
>> +-* Upgrading from Aspell .33/Pspell .12::
>> +-* Upgrading from a Pre-0.50 snapshot::
>> +-* WIN32 Notes::
>> ++* Generic Install Instructions::
>> ++* HTML Manuals and "make clean"::
>> ++* Curses Notes::
>> ++* Upgrading from Aspell 0.60.7::
>> ++* Loadable Filter Notes::
>> ++* Using 32-Bit Dictionaries on a 64-Bit System::
>> ++* Upgrading from Aspell 0.50::
>> ++* Upgrading from Aspell .33/Pspell .12::
>> ++* Upgrading from a Pre-0.50 snapshot::
>> ++* WIN32 Notes::
>> + @end menu
>> +
>> + @node Generic Install Instructions
>> +@@ -121,17 +122,62 @@ In addition your system must also support the
>> @code{mblen} function.
>> + Although this function was defined in the ISO C89 standard (ANSI
>> + X3.159-1989), not all systems have it.
>> +
>> ++@node Upgrading from Aspell 0.60.7
>> ++@appendixsec Upgrading from Aspell 0.60.7
>> ++
>> ++To prevent a potentially unbounded buffer over-read, Aspell no
>> longer
>> ++supports null-terminated UCS-2 and UCS-4 encoded strings with the
>> ++original C API.  Null-termianted 8-bit or UTF-8 encoded strings are
>> ++still supported, as are UCS-2 and UCS-4 encoded strings when the
>> ++length is passed in.
>> ++
>> ++As of Aspell 0.60.8 a function from the original API that expects
>> an
>> ++encoded string as a parameter will return meaningless results (or
>> an
>> ++error code) if string is null terminated and the encoding is set to
>> ++@code{ucs-2} or @code{ucs-4}.  In addition, a single:
>> ++@example
>> ++ERROR: aspell_speller_check: Null-terminated wide-character strings
>> unsupported when used this way.
>> ++@end example
>> ++will be printed to standard error the first time one of those
>> ++functions is called.
>> ++
>> ++Application that use null-terminated UCS-2/4 strings should either
>> (1)
>> ++use the interface intended for working with wide-characters
>> ++(@xref{Through the C API}); or (2) define
>> ++@code{ASPELL_ENCODE_SETTING_SECURE} before including
>> @code{aspell.h}.
>> ++In the latter case is is important that the application explicitly
>> ++sets the encoding to a known value.  Defining
>> ++@code{ASPELL_ENCODE_SETTING_SECURE} and not setting the encoding
>> ++explicitly or allowing user of the application to set the encoding
>> ++could result in an unbounded buffer over-read.
>> ++
>> ++If it is necessary to preserve binary compatibility with older
>> ++versions of Aspell, the easiest thing would be to determine the
>> length
>> ++of the UCS-2/4 string---in bytes---and pass that in.  Due to an
>> ++implemenation detail, existing API functions can be made to work
>> with
>> ++null-terminated UCS-2/4 strings safely by passing in either @code{-
>> 2}
>> ++or @code{-4} (corresponding to the width of the character type) as
>> the
>> ++size.  Doing so, however, will cause a buffer over-read for
>> unpatched
>> ++version of Aspell.  To avoid this it will be necessary to parse the
>> ++version string to determine the correct value to use.  However, no
>> ++official support will be provided for the latter method.
>> ++
>> ++If the application can not be recompiled, then Aspell can be
>> configured
>> ++to preserve the old behavior by passing
>> ++@option{--enable-sloppy-null-term-strings} to
>> @command{configure}.  When Aspell
>> ++is compiled this way the version string will include the string
>> ++@samp{ SLOPPY}.
>> ++
>> + @node Loadable Filter Notes
>> + @appendixsec Loadable Filter Notes
>> +-
>> ++
>> + Support for being able to load additional filter modules at run-
>> time
>> + has only been verified to work on Linux platforms.  If you get
>> linker
>> + errors when trying to use a filter, then it is likely that loadable
>> + filter support is not working yet on your platform.  Thus, in order
>> to
>> + get Aspell to work correctly you will need to avoid compiling the
>> + filters as individual modules by using the
>> +-@option{--enable-compile-in-filters} when configuring Aspell with
>> +-@command{./configure}.
>> ++@option{--enable-compile-in-filters} @command{configure} option.
>> +
>> + @node Using 32-Bit Dictionaries on a 64-Bit System
>> + @appendixsec Using 32-Bit Dictionaries on a 64-Bit System
>> +--
>> +2.17.1
>> +
>> diff --git a/meta/recipes-support/aspell/aspell/CVE-2019-20433-
>> 0002.patch b/meta/recipes-support/aspell/aspell/CVE-2019-20433-
>> 0002.patch
>> new file mode 100644
>> index 0000000000..9569ddeebe
>> --- /dev/null
>> +++ b/meta/recipes-support/aspell/aspell/CVE-2019-20433-0002.patch
>> @@ -0,0 +1,68 @@
>> +From cefd447e5528b08bb0cd6656bc52b4255692cefc Mon Sep 17 00:00:00
>> 2001
>> +From: Kevin Atkinson <kevina@gnu.org>
>> +Date: Sat, 17 Aug 2019 20:25:21 -0400
>> +Subject: [PATCH 2/2] Increment library version to reflect API
>> changes.
>> +
>> +CVE: CVE-2019-20433
>> +Upstream-Status: Backport [
>> https://github.com/GNUAspell/aspell/commit/cefd447e5528b08bb0cd6656bc52b4255692cefc
>> ]
>> +
>> +Signed-off-by: Stefan Ghinea <stefan.ghinea@windriver.com>
>> +---
>> + Makefile.am | 31 +++++++++++++++++--------------
>> + 1 file changed, 17 insertions(+), 14 deletions(-)
>> +
>> +diff --git a/Makefile.am b/Makefile.am
>> +index 7e15851..19dc044 100644
>> +--- a/Makefile.am
>> ++++ b/Makefile.am
>> +@@ -94,18 +94,25 @@ libaspell_la_SOURCES =\
>> +
>> + libaspell_la_LIBADD =  $(LTLIBINTL) $(PTHREAD_LIB)
>> +
>> +-## Libtool to so name
>> +-## C:R:A => (C-A).(A).(R)
>> +-## 16:5:0 => 16.0.5
>> +-## 16:5:1 => 15.1.5
>> +-## 18:0:2 => 16.2.0
>> +-## 17:0:2 => 15.2.0
>> +-
>> ++## The version string is current[:revision[:age]]
>> ++##
>> ++## Before a release that has changed the source code at all
>> ++## increment revision.
>> ++##
>> ++## After merging changes that have changed the API in a backwards
>> ++## comptable way set revision to 0 and bump both current and age.
>> ++##
>> ++## Do not change the API in a backwards incompatible way.
>> ++##
>> ++## See "Libtool: Updating version info"
>> ++## (
>> https://www.gnu.org/software/libtool/manual/html_node/Updating-version-info.html
>> )
>> ++## for more into
>> ++##
>> + if INCREMENTED_SONAME
>> +-libaspell_la_LDFLAGS = -version-info 18:0:2 -no-undefined
>> ++libaspell_la_LDFLAGS = -version-info 19:0:3 -no-undefined
>> + else
>> + ## Use C-1:R:A
>> +-libaspell_la_LDFLAGS = -version-info 17:0:2 -no-undefined
>> ++libaspell_la_LDFLAGS = -version-info 18:0:3 -no-undefined
>> + endif
>> +
>> + if PSPELL_COMPATIBILITY
>> +@@ -113,11 +120,7 @@ libpspell_la_SOURCES = lib/dummy.cpp
>> +
>> + libpspell_la_LIBADD = libaspell.la
>> +
>> +-if INCREMENTED_SONAME
>> +-libpspell_la_LDFLAGS = -version-info 18:0:2 -no-undefined
>> +-else
>> +-libpspell_la_LDFLAGS = -version-info 17:0:2 -no-undefined
>> +-endif
>> ++libpspell_la_LDFLAGS = $(libaspell_la_LDFLAGS)
>> +
>> + endif
>> +
>> +--
>> +2.17.1
>> +
>> diff --git a/meta/recipes-support/aspell/aspell_0.60.7.bb
>> b/meta/recipes-support/aspell/aspell_0.60.7.bb
>> index b565cb3c6e..1e104c263c 100644
>> --- a/meta/recipes-support/aspell/aspell_0.60.7.bb
>> +++ b/meta/recipes-support/aspell/aspell_0.60.7.bb
>> @@ -8,6 +8,8 @@ PR = "r1"
>>   
>>   SRC_URI = "${GNU_MIRROR}/aspell/aspell-${PV}.tar.gz \
>>              file://0001-Fix-various-bugs-found-by-OSS-Fuze.patch \
>> +           file://CVE-2019-20433-0001.patch \
>> +           file://CVE-2019-20433-0002.patch \
>>             "
>>   SRC_URI[md5sum] = "8ef2252609c511cd2bb26f3a3932ef28"
>>   SRC_URI[sha256sum] =
>> "5ca8fc8cb0370cc6c9eb5b64c6d1bc5d57b3750dbf17887726c3407d833b70e4"
>> -- 
>> 2.17.1
>>

[-- Attachment #2: Type: text/html, Size: 58450 bytes --]

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2020-03-12 14:36 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2020-03-12  9:23 [PATCH] [zeus] aspell: CVE-2019-20433 Stefan Ghinea
2020-03-12 12:25 ` Mittal, Anuj
2020-03-12 12:34   ` Mikko.Rapeli
2020-03-12 12:49     ` Adrian Bunk
2020-03-12 13:25       ` Mikko.Rapeli
2020-03-12 13:04     ` Mittal, Anuj
2020-03-12 14:35   ` Stefan Robert Ghinea

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.