[Logo]

Link Grammar Parser Older Changes

Summary of Older Changes

Version 4.7.14 (20 June 2013) MacOS build break

  • Fix build break on MacOSX

Version 4.7.13 (17 June 2013) Windows unicode handling fixes.

  • Remove trailing carriage return in MSVC6 filenames.
  • Performance improvement: avoid excess system timer calls.
  • Unicode fixes for Windows.
  • Fix: "... a purseful of pesos"; other currency fixes.
  • MinGW, unicode fixes from Per Larsson

Version 4.7.12 (25 May 2013) Major revision to the Russian dictionaries!

  • Large fixes to the Russian dictionaries.
  • Windows: Explicitly fail if cygwin version is too old.
  • Tweak the lt dict to work again with the modern parser.
  • Make the fat linkages code be compile-time configurable.
  • Disable fat linkages by default; mark as deprecated.
  • Fix SAT-solver build; recent changes had broken it.
  • Export read-dict.h as a public API.
  • Ongoing development of the Viterbi prototype.
  • Windows: some UTF8/widechar refactoring.
  • Java bindings: add method to set the language.
  • CMake: add version checking to the CMakefile
  • Fix: failed handling of capitalized first word for Russian.
  • Fix: stemming failures in many cases (for Russian dictionaries)
  • Add flag to suppress stem-suffix printing.
  • Windows: Fixes to MSVC6 build files.
  • Fix: hash-table bug affecting Russian dictionaries

Version 4.7.11 (23 March 2013) Emergency build break fix.

  • Fix build bug introduced recently (build_disjuncts_for_dict_node)
  • MSVC6 build environment fixes.
  • Ongoing development of the Viterbi prototype.

Version 4.7.10 (9 March 2013) Russian dictionaries!

  • New: Russian dictionaries from Sergey Protasov!
  • Improve prefix, suffix handling and stemming (needed for Russian)
  • Fix: add "gurgle.n" to dictionary.
  • Improve memory utilization (remove pointless malloc/free).
  • Fix: UTF8 multi-byte diagram printing.
  • Add UTF8 multi-byte editline support, when available.
  • Include Common Lisp bindings, from Peter Szolovits (circa 2009).
  • Ongoing work on pre-alpha Viterbi decoder.
  • Fix: Bug 55: aspell header file included even if disabled.
  • Updated MSVC6, MSVC9 project files.
  • New, improved search for java jni.h
  • Faster counting of disjuncts for the !! command.
  • The !! command now shows regex entries and stem+suffix splits.

Version 4.7.9 (3 November 2012) A large number of dictionary updates, plus some more MacOSX touchups.

  • Fix: Poor comma chocies: "The man, that you saw..." (issue #36)
  • Fix: Add more java jni.h search paths.
  • Fix: Warning in java JSON interface.
  • Fix: weren't constructions: "If it weren't for Sally, ..."
  • Fix: Better support for ellipsis ...
  • Fix: strangely: "He is behaving very strangely"
  • Fix: assorted usages of yes, no, maybe
  • Fix: "being" in subordinate clauses: "The blade being dull, he ..."
  • Fix: Build failure for Mac OSX.
  • Fix: "... this time for real".
  • Fix: "... the impudence to laugh."
  • Fix: "we two", "us two"
  • Fix: very old parse bug, using "enough"
  • Fix: assorted profanity as synonyms to "else"
  • Fix: whoever, whomever
  • Fix: decade possessives: "It's a 60's hit"
  • Fix: "... chances that ... could be ..."
  • Fix: "Do you have it ready?"
  • Fix: "What John wants are those cats"
  • Fix: greetings
  • Fix: misc verbs: wave, ink, kiss, bake, corral, crown, decant, rope, slice, ink, butter, proclaim, make, give, bid, bade, think
  • Fix: titles can behave as indefinite nouns.
  • Fix: somewhere, near, nearby, halfway, about
  • Fix: police-interrogation style questions

Version 4.7.8 (10 October 2012) Broke the build, in the previous release. Ooooops!

  • Ongoing work on pre-alpha Viterbi decoder.
  • Fix include file paths so that 'make install' doesn't cause recompile.
  • Fix build break due to bad include file paths in Makefiles.

Version 4.7.7 (6 October 2012) This version makes a number of significant changes to the dictionary. The most important of these is to indicate that expressions of the form ...a number of X act as if they were determiners; correspondingly, the subject and object links move to a new location. That is, the subject/object links now land on "X" instead of "number". Other changes to the dictionary are listed below.

  • Fix configure to not bomb if c++ is not found. (weird regex.h missing)
  • Fix configure to explicitly require c++ only if sat-solver enabled.
  • Dictionary: parse comma in "Monsters, Inc."
  • Update README about Viterbi algo.
  • Remove malloc-dbg.c from the MSVC6 project file.
  • Add missing source files to MSVC6 project file.
  • Use enums for constituent display style.
  • Move command-line parser to its own directory.
  • Portability fixes for Windows, from Bill Hayes.
  • Fix: "Show results from today"
  • Fix: conjoined imperatives.
  • Fix: qualified conditional clauses
  • Fix: conjoined irregular verbs: seem, appear, do, be, prove, have
  • Disable fat link code via conditional compile.
  • Fix: "a lot of", "a gaggle of" now treated as quantifying determiners
  • Fix: likewise: "gallons of", "grams of" volume measures as determiners
  • Update German dictionary so that it actually loads.
  • Fix: assorted "but not", "not very" constructions.
  • Pre-alpha implementation of Viterbi decoder started.
  • Fix: conjunctions of optionally ditransitive verbs.

Version 4.7.6 (26 April 2012) This version fixes a bungled header file in the last version: builds broke because the public header file was including an internal header file.

  • fast-match.c: unroll recursive call into loop (thx Valery Kholodkov).
  • Fix accidental inclusion of internal header file from public header file.

Version 4.7.5 (16 April 2012) This version includes fixes for a handful of dictionary bugs, an all new manual page, and several fixes and improvements for the build system.

  • Remove several duplicate given names.
  • Remove malloc-dbg.c from the MSVC9 project file.
  • Remove several dozen duplicated idioms.
  • New AM_SILENT_RULES for less noisy build!
  • Clean up misc minor compiler warnings.
  • Start using -O3 optimization for another 2% perf improvement.
  • Fix configure.in --enable/--disable flags (bug #53)
  • Some intransitive verbs are actually transitive in rare cases.
  • Use enum not int for the cost-model type.
  • Update the man page to reflect actual command usage.

Version 4.7.4 (13 February 2011) This version includes fixes for several bugs/crashes introduced in the previous version :-( It also includes changes that result in dramatic speed improvements on long sentences, with an average of 1.1x to 2.0x improvement on "typical" texts.

  • Dramatic speed improvement for long sentences (hash table fix).
  • Fix: crash on certain sentences, when fat-link parsing enabled.
  • Fix: fatal error introduced in last version (combinatorial explosion).
  • Fix: mem leak introduced in last version (combinatorial explosion).
  • Assorted fixes to the Boolean SAT solver.

Version 4.7.3 (7 February 2011) This version fixes a bug where a "combinatorial explosion" resulted in no parses being printed.

  • Fix: "level" as indefinite noun: "... at knee level".
  • Enable max disjunct cost as a controllable parser parameter.
  • Don't just give up on combinatorial explosion, show something!

Version 4.7.2 (6 January 2011) This version fixes a crash and an "almost infinite loop"; both might be hit when using the (now deprecated) fat-links, and certain overflow conditions are met.

  • Fix: crash on certain sentences, when fat-link parsing enabled
  • Fix: 3 corrupted nouns in words.n.1
  • Fix: minimize cpu spin if constituent processing and-list overflows.

Version 4.7.1 (11 December 2010) This version fixes several bugs introduced by the conjunction-handling rework of 4.7.0. Several Windows-related compile fixes are included as well.

  • Fix: dictionary: Add "x" as synonym for "times".
  • Fix: compilation failure due to lack of termios support on Windows.
  • Fix: if not set, then force-set locale to UTF-8 in the java library.
  • Fix: Explicitly include <stddef.h> for Windows builds.
  • Fix: "John imagines himself lost".
  • Fix: predicative adjectives w/misc verbs: "You are driving me crazy"
  • Fix: "judge innocent", etc.
  • Fix: many, many conjoined present tense verbs.
  • Fix: conjoined verbs with negations.
  • Fix: "to" as post-nominal modifier: "the inability to laugh"

Version 4.7.0 (12 September 2010) This version introduces a major change to the way that conjunctions are handled. The parse output for conjunctions is not compatbile with the old output, but should be easier to work with, solving many problems. The rework results in a performance improvement of 1.3x to 2.7x, depending on the text.

  • Fix: hunspell configuration on Fedora (bugtracker issue 47)
  • Fix: 'turn' with adjective: "She turned him green" from wingedtachikoma
  • Fix: comma-conjoined modifiers: "It tastes bitter, not sweet."
  • Fix: conjoined question words: "When and where is the party?"
  • Fix: recognize short, capitalized words (Los, La, etc.).
  • Treat colon as synonym for is: "The answer: yes."
  • Fix: begin with prepositions: "It all began in Chicago."
  • Fix: "What does it come to?" and related.
  • Fix: null infinitive: "I'd like to, I want to."
  • Fix: "Because I said so."
  • Fix: "sure" as preverbal adverb: "It sure is."
  • Fix: Gerunds with determiners: "a running of the bulls"
  • SJ link for conjoined nouns/noun phrases.
  • Sort linkages according to whether fat linkage was used.
  • Add flag to enable use of fat linkage during parsing. (Fat links now disabled by default).
  • Add male/female gender tags to misc nouns.
  • Fix: misc optionally transitive verbs: mix, paint, boot
  • Fix: word order: "look about fearfully", "look fearfully about", around
  • Fix: recognize simple fractions
  • Fix: "is" with uncountable nouns: "there is blood on your hands"
  • Fix: Roman numeral suffixes e.g. "Henry VIII"
  • Fix: regression in dates followed by punctuation. "In the 1950s, ..."
  • Fix: verbs drank, drunk are optionally transitive.
  • Fix: regression: "all the X", X can be plural or mass.
  • Fix: verbs paint, color may be ditranstive: "paint the car bright green"

Version 4.6.7 (16 April 2010)

  • Fix: configure handling of --enable-hunspell (bugtracker issue 46)
  • Fix: conjoined modifier phrases: "the black and white cat sleeps."
  • Fix: MSWindows+MingGW compilation issue.
  • Update msvc9 Microsoft Visual C 2008 project, from Boris Iordanov.

Version 4.6.6 (19 March 2010)

  • Minor fixes to handling of units.
  • Fix: MSWindows+Cygwin/MingGW compilation issues.
  • Use pkgconfig for editline (from Vincent Untz)
  • Fix: "I want it back."
  • Additional titular prefixes, suffixes: Maj. Ph.D.
  • Add code to detect dictionary version number.
  • Fix: MS VC missing strdup() function.
  • Add AutoIt3 bindings from JRowe

Version 4.6.5 (3 November 2009)

  • Fix: Superlatives without preceding determiners ("... likes you best")
  • Fix: Take more care in distinguishing mass and count nouns.
  • Fix: Old bug w/relative clauses: Rw+ is optional, not mandatory.
  • Provide tags identifying relative, superlative adjectives.
  • Remove BioLG NUMBER-AND-UNIT handling, its been superceded.
  • Fix handling of parenthetical phrases/clauses.
  • Fix: handling of capitalized first words ending in letter "s".
  • Fix: support "filler-it" SF link for "It was reasoned that..."
  • Fix: certain WH-word constructions: "I did not know why until recently"
  • Fix: go: "there goes the greatest guy ever"
  • Fix: opening coordinating conjunctions: "And you can also ..."
  • Configurable Hunspell spell-checker dictionary location.
  • Fix: Misc ordinal usage.
  • Add support for aspell spell-checker.

Version 4.6.4 (11 October 2009)

  • Restore nouns starting w/letters x-z, elided in version 4.5.9 ff.
  • Add support for single-word interjections/exclamations!
  • Fix: sometimes command line client fails to show all valid linkages.
  • Misc fixes: such_that, upon, acted.v
  • Fix: impersonal "be" linking to passive participle.
  • Fix: handling of capitalized first words.
  • Fix: duplication of certain parses involving transitive verbs.

Version 4.6.3 (4 October 2009)

  • Fix compilation bug on FreeBSD.
  • Fix: allow MX link to post-nominal ", to be ..., "
  • Fix: add idiom "time and again"
  • Fix: another BioLG regression in handling of possessives.
  • Fix: handling of period at end of number at end of sentence.
  • Fix: Capitalized words ending in s at start of sentence.
  • Use corpus-statistics-based ranking by default, if available.
  • Fix difficulties in build of corpus statistics module.

Version 4.6.2 (21 September 2009)

  • Fix: "come across as authoritative".
  • Improve Java location guessing in FreeBSD
  • Fix for assert triggered by long sentences.
  • Fix: long sequence of periods treated as unknown word.
  • Add informational print showing dictionary location on startup.
  • Remove duplicated {@MV+} in tend.v
  • Automatically resize the display size to fit the current window size.
  • Fix handling of punctuation at the end of a capitalized word.
  • Fix misc verbs acting as adjectival modifiers: e.g. "given", "allied"
  • Fix bug in BioLG code regarding the handling of possessives.
  • Fix a (rare) crash in sentences with many conjunctions.
  • Fix a crash involving long sequences of UTF8 punctuation marks.

Version 4.6.1 (31 August 2009)

  • Stop printing annoying warning when !vars are used.
  • Fix missing dict file units.2 problem
  • Fix compilation bug on FreeBSD.

Version 4.6.0 (29 August 2009)

  • Avoid used of bzero, add missing include directives (MacOSX problem)
  • Reclassify a number of "medical" prepositions as adverbs.
  • Add approx 100 adverbs & 300 adjectives.
  • Add approx 250 verbs.
  • Add approx 300 nouns.
  • Add misc units.
  • Add misc European connector words/patronymics.
  • Reclassify 100's of transitive verbs as optionally-transitive.
  • Add distinct tokenization step ("sentence_split") to public API. This last change forces the minor-version-number bump.

Version 4.5.10 (25 August 2009)

  • Be sure to link with -lm

Version 4.5.9 (25 August 2009)

  • Modify error messages to indicate that they are from link-grammar.
  • Add missing Java files that were forgotten last time around.
  • Add greeting to command-line client startup.
  • Print disjunct cost also, when requesting disjunct printing.
  • Add missing color names as mass nouns.
  • Fix: Reclassify musical instruments: "He plays piano"
  • Add experimental word-clustering system.
  • Add CMake build file
  • Fix: "It takes longer than that."
  • Fix: "He has done very well."
  • Fix: a dozen optionally transitive verbs (swim, kill, etc.)
  • Fix: "He's out running."
  • Fix: "suddenly" is a "manner adverb", not a clausal adverb.
  • Fix: Use Pg links to gerunds: "He feared hitting the wall."
  • Fix: assorted numerical-range bugs.
  • Fix: prep modifiers with distances: "It is a few miles out"
  • Fix: Spelled-out dates: "It started in nineteen twelve"
  • Fix: Misc date, time expression parsing e.g "Zero hour is here."
  • Fix: Misc words, "ordered list", "screened out"
  • Fix: Post-fixed numbers can act as determiners.
  • Fix: "We bought the last 50 ft. of cable."
  • Fix: opening directives to imperatives: "Finally, move it back."
  • Fix: Improved simple equation parsing support.
  • Fix: Add misc fixes from BioLG that were previously overlooked.
  • Fix: "favorite" can take determiner "a" ("a favorite place")
  • Fix: assorted clausal complements: "The emperor ordered it done."
  • Fix: ordinals: "First on our list is ..."
  • Fix: verb modifier "some of the time", "most places"
  • Fix: Sit, stand take modifiers: "he stood still"

Version 4.5.8 (2 July 2009) includes the following changes:

  • Fix: 'than anticipated', 'than was anticipated', etc.
  • Fix: 'saw the wood'
  • Fix: sometimes commas are used as if they were semicolons.
  • Fix: 'We have quite enough work already, thank you!'
  • Fix: allow 'and' as conjunction in entity names.
  • Fix: 'I stared him down', 'They shouted him down', 'booed off'
  • Fix: 'sound him out', 'look him over'
  • Fix: 'Somewhere in the distance'
  • Stub out list of names given to both men and women, to avoid duplication.
  • Fix: 'I think so, too'
  • Fixes for compilation under Cygwin.
  • From Boris Iordanov: fixes to JSON java code.
  • From Boris Iordanov: new java remote client code.
  • Fix: Biological texts commonly have adj-noun-adj-noun chains

Version 4.5.7 (4 June 2009) includes the following changes:

  • Fix 'make install' for windows (abi bug 12049)
  • Fix multi-threaded bug when TRACK_SPACE_USAGE is defined.
  • Add './configure --enable-mudflap' just for fun...
  • Fix: "Walk tall", "Think quick"
  • Fix: "... part no. 1234-56A"
  • Fix regression from BioLG merge: "It cost $14 million."
  • Fix come/came: "The dog came running..."
  • Fix year abbreviations: "He drove a souped-up '57 Chevy"
  • Fix sit, stand: "The dog stood still"
  • Fix act up, act out: "He is acting out." "The motor is acting up."
  • Fix notoriously, poorly: "The store was poorly stocked".
  • Fix: "strong" can be adverb
  • Add support for recognizing basic time zones during parsing.
  • Fix: verbs acting as adjectival modifiers: "a very politicized deal."
  • Fix: ".. nearly so well", "...almost so well".
  • Fix financial ranges: "It will cost $10 million to $20 million to build."
  • Expand handling of capitalized words that appear in entity names.
  • Expand the list of characters that are recognized as quotes.
  • Support usage of yes, no as sentence openers.
  • Better support for directives, commands.
  • Fix: "Ash Wednesday", "Fat Tuesday", etc.
  • Fix: post-verbal adj: "she wiped the table dry"
  • Fix: wish: "she wished me a happy birthday"

Version 4.5.6 (24 May 2009) includes the following changes:

  • Bugfix: fix non-thread-safe usage.
  • Changes to enable MinGW/Windows to compile.
  • Update of MSVC6 build files
  • Fix: pizza, fries, chopsticks.
  • Export word-sense database to Java apps.
  • Fix: "Was the man drunk or crazy or both?"

Version 4.5.5 (10 May 2009) includes the following changes:

  • Bugfix: crash for zero-length sentences.

Version 4.5.4 (9 May 2009) includes the following changes:

  • Fix: "sleep in": "A bed is something you sleep in."
  • Fix: "drinking": "Let's go drinking."
  • Fix go+bare infinitive: "Let's go shop", "Let's go swim"
  • Fix: "Let's go for a swim." "Let's go for a smoke".
  • Fix: "Let's not" "Let's not go" "Let's not cry"
  • Fix: ... is : "All he ever does is complain."
  • Fix: "You will die young/happy/unhappy"
  • Fix: "You should exercise to stay fit."
  • Fix: "We danced 'til dawn."
  • Fix: "tell off": "She had told him off."
  • Bugfix: sometimes spell checker would run even if turned off.

Version 4.5.3 (14 April 2009) includes the following changes:

  • Haste makes waste! Revert a recent 'fix'.

Version 4.5.2 (14 April 2009) includes the following changes:

  • Use re-entrant version of mbtowc in all code.
  • Fix run-time breakage on Mac OSX and FreeBSD.

Version 4.5.1 (13 April 2009) includes the following changes:

  • Fix Assertion failed: negative constituent length!
  • Fix build break for Mac OSX.
  • Force use of UTF-8 locale in the command-line tool.

Version 4.5.0 (10 April 2009) includes the following changes:

  • Hack around missing SQLite3 pkgconfig on MacOS
  • Fix adverbs: 'The motor ran hot', 'the door swung wide open', etc.
  • Fix: 'at risk of breakdown', 'under threat of fire'
  • Add regular-expression-based word guessing, from BioLG project. This provides support for many scientific/biomedical terms.
  • Add spell-guessing for unknown words.
  • Fix UTF8 support to be correctly thread-safe.
  • BioLG: fix post-numbering: 'it started on day one'
  • BioLG: add number ranges: 'it takes 2 to 3 times the effort'
  • BioLG: assorted adverb fixes, typical of scientific prose.
  • BioLG: initiate, attach, localize etc are optionally transitive.
  • BioLG: allow fork, branch, splice, export, etc to take particles.
  • BioLG: extended use of Greek letters in biomedical text.
  • BioLG: support parsing of Roman numerals.
  • BioLG: support Greek-letter-number combinations.
  • Fix: 'she was singing', etc.
  • Enable WordNet word-sense identification based on syntactical usage.

Version 4.4.3 (11 March 2009) includes the following changes:

  • Look at JAVA_HOME to guess location of jni.h
  • Add dictionary maintenance utility script.
  • Add prototype German dictionary.
  • Fix sentences containing measurements, lengths, speeds. etc.
  • Add 1400 new adjectives, many of scientific origin.
  • Add 250 international currencies, including special utf8 symbols.
  • Performance optimization in sentence tokenization.
  • Add 1100 male, female given names (from Bruce Wilcox).
  • Cleanup parsing of "in-" date expressions, e.g. "in the 1940's".
  • Cleanup of assorted compiler warnings.
  • Fix handling of special utf8 characters in dictionary.
  • Fix hang when parsing certain utf8 input strings.
  • Make sure that most strings in the API are declared const char *
  • Add option to command-line client to print disjunct strings.

Version 4.4.2 (15 January 2009) includes the following changes:

  • Fix: 'The keys are gone', 'It is a gone desire'
  • Fix: 'He poured me a glass', 'He wired me the money', etc.
  • Initial rough-in of corpus statistics based parse ranking.
  • Initial rough-in of corpus-guided word-sense identification.
  • Fix for googlecode bugs 38, 39 (Java, Ruby failure to load)
  • Add new API routine to return disjunct string.

Version 4.4.1 (15 December 2008) includes the following changes:

  • Balance the dictionary tree; this speeds word-lookup slightly.
  • New MSVC6 build files from Evgenii Philippov.
  • Fix java server classes to pass along the link-grammar version number.

Version 4.4.0 (7 December 2008) includes the following changes:

  • fix: recognize curly-single-quote ’ where straight quote can be used.
  • recognize and explicitly ignore emoticon types.
  • Include MSVC6 build files.
  • Apply patch needed for Ruby bindings.
  • fix: "Where did they come from?", per Viswanath IIIT
  • fix: "Where did they go to?"
  • fix: "It gives me peace of mind."
  • fix: many, many incorrectly identified mass nouns.
  • fix: ladle.v "molten hot" "piping hot"
  • fix: "It's a shame that...", "The crux of the plan is that..."
  • Performance improvements (about 11%) to prunce.c from Bruce Wilcox
  • fix: "He eats with me nightly."
  • Add new public api function: linkgrammar_get_version()
  • MSVC9 build files from Borislav Iordanov
  • Java network-efficient client-server classes from Borislav Iordanov

Version 4.3.9 (8 October 2008) includes the following changes:

  • Issue 13: "John is altogether amazingly quick."
  • Nonstandard spelling "unequivocably"
  • Dictionary fixes for 'marginally', etc. "That one is marginally better"
  • Issue 7: Dictionary fixes for 'done': "I am done working"
  • dictionary entries for walk-up drivethru car-wash
  • dictionary: "I am through being mad", "It was a through flight", etc.
  • Issue 11: "You are doing well"
  • Issue 3: "I asked Jim a question", "I told Jim a story"
  • Passive subjects with objects: "I was told that crap, too" "...was asked..."
  • Fixes for Apple Mac OSX (crash on non-executable stack)
  • Early version of Filip Maric's boolean SAT solver
  • fix: "He talked quietly of revolt."
  • fix: "It consists mostly of sand.", "He talks, mostly of revolution."
  • fix: "He talked mostly to Ann.", "He talks a lot."
  • fix: than_usual: "He is taking longer than usual."
  • fix: a batch of new verbs from Roman Khlupin
  • fix: Fix crash on Apple Macintosh by correctly identifying the platform.
  • fix: "San Gabriel" "Block Island" "Great Southern Bank" "de la Rente"
  • fix: "I biked Johnson Creek."

Version 4.3.8 (14 August 2008) includes the following changes:

  • Build fixes for Windows (missing bzero and thread-safe rand)
  • Fix several mem leaks, one in the Java jni code.
  • Fix crash of java6 jvm, due to blown stack limits.

Version 4.3.7 (8 August 2008) includes the following changes:

  • Expanded list of given names to include those from 2005 US Census
  • Export a new java jni function, to get the inflected word.
  • Add a TO+ connector so that "I tend to agree" parses correctly.
  • Makefile fixes for OS/X Leopard

Version 4.3.6 (21 July 2008) includes the following changes:

  • Fixes for Windows MS Visual-C builds.
  • Fix parsing of "He walked the dog.", "He sailed the boat."
  • Add support for right-apostrophe (’) which is a non-ASCII UTF8 char.
  • Add support for other non-ASCII UTF8 punctuation.
  • Fix crash on printing constituent tree of certain long sentences.
  • Avoid recursive error reporting for UTF8 dictionary errors.
  • Clarify error logging and error printing.
  • Add java getVersion() to return link-grammar version string.
  • Add more numbers to dict (e.g. twenty-seven, bazillion, half-dozen, etc.)
  • Foodstuffs: bagels, lox, tacos, guacamole, roe, neufchatel, mayo, etc.
  • Weights and measures: megabytes, °C, km² etc.
  • Performance improvements in printing of link-tree.
  • Convert assert into warning when no canonical linkages can be found.
  • Convert assert into warning when constituent andlist overflows.
  • Provide additional checks for constituent overflows.
  • Convert most error printfs into a formal error reporting system.
  • Remove all globals, library is now thread-safe.
  • Fix crash when sentence has square bracket, and doing constituents.

Version 4.3.5 (29 April 2008) includes the following changes:

  • Added ant build file to create the link-grammar jar file.
  • Fix regression in command-line client of multiple-parse display.
  • Use MB_LEN_MAX, not MB_CUR_MAX for UTF8 support.
  • Fix a WIN32 compiler regression (no in-line support in Windows).
  • Fix error in handling of UTF8 dictionaries.
  • Fix strncat() misuse in error.c
  • Fix capitalization errors in country names.
  • Fix parsing of "he angled left, he dodged left, he turned left".
  • Don't build the JNI library if Java isn't found. Fixes build on Windows.
  • Fix install bug for NetBSD systems.
  • Pre-detected entities cannot participate in G links.
  • There is no UTF8 support in windows, so stub it out.
  • Fix crash in constituent output, bug #22 in googlecode bugtacker.
  • Some small steps taken to eventually make library thread-safe.
  • There are three constituent string styles, enable all three.
  • Make the command-line flag errors less cryptic.
  • Add readline (BSD editline) support.
  • Rename "grammar-parse" to the more logical "link-parser".
  • Small man page updates.
  • Export and cost, link cost via public API.

Version 4.3.4 (16 March 2008) includes the following changes:

  • Fix regression of handling of capitalization at the start of sentences.
  • Fix dictionary search path so that it respects command-line input.
  • Fix rare but nasty crash when parsing long sentences in panic mode.
  • Add a method to set the dictionary path.
  • Fix all remaining compiler warnings.
  • Make parser capable of handling UTF8 strings and dictionaries.
  • Ongoing minor expansion of the Lithuanian (lt) dictionary.

Version 4.3.3 (27 February 2008) includes the following changes:

  • Missing java is a warning, not an error.
  • man page for grammar-parse.
  • Removed cruft from the dictionary open routines.
  • configure tries to guess some non-standard jni.h locations.
  • Split up java library exports, should help cygwin builds.
  • Fix java library pre-linking bug.
  • Minor English dictionary additions.
  • Prototype Lithuanian (lt) dictionary.

Version 4.3.2 (2 February 2008) includes the following changes:

  • Fix dictionary errors involving given names; e.g. any sentence with the name "John" in it.
  • Minor Windows build fixes.

Version 4.3.1 (31 Jaunary 2008) includes the following changes:

  • Merger of extensive dictionary additions from Peter Szolovits. This adds 15K new words, bringing the dictionary to 70K words total.

Version 4.3.0 (30 January 2008) includes the following changes:

  • New link types (Ct, Cta, Rn, Rw) for comparatives, so as to link relative clauses: "John is bigger than Dave is", "John wants more cookies than Dave wants". The Rw link is used to link question words to the relative clauses that follow them.
  • Dictionary Fixes for "Espresso is a coffee drink", "Teach me fetch", "I am pooped" as synonym for "I am tired", "Mother likes her", "Mommy loves me" and related. Also, directives involving "go": "Go play ball", "Go take a walk", "You and Rover go play with the ball."
  • Dictionary support for external entity markup. This includes the recognition of personID0..personID60, dateID0..dateID60, organizationID0..organizationID60 and locationID0..locationID60 as appropriate words.
  • Fixes of numerous compile-time warnings.
  • Simple Java (JNI) bindings.

Version 4.2.5 (11 November 2007) includes the following changes:

  • Fix for a security problem, involving a buffer overflow: CVE-2007-5395.