NLP++ Language Extension
Many of you have been asking for tutorial videos on NLP++ and here is the first set. More coming soon...
Only Computer Language Dedicated to Natural Language Processing
NLP++ is the only computer language in the world exclusively dedicated to natural language processing. It allows for creating digital human readers that use linguistic and world knowledge to parse and understand text. Whether it be for extracting or marking up "messy" text or full-blown NLP understanding, this language allows for "anything thinkable".
NLP++ and VisualText Now Unicode
The NLP-Engine for NLP++ now works with Unicode (UTF8 via the ICU C++ Package) including the ability to work with emojis.
Quick Video Guides
Find quick video guides on how to install and use VisualText including a "hello world" video as well as an in-depth tour of VisualText.
This is a VSCode Language Extension for NLP++ that recreates the functionality of VisualText which has run on Microsoft Windows for the last two decades. NLP++ is a open source computer language specifically dedicated to creating text analyzers that mimic human readers and includes the NLP++ language and knowledge based system called the "conceptual grammar". NLP++ is used for any type of text processing from simple tagging or extraction, to full language parsing. There is a full english parser that is free an available for use (see information below).
The language extension and the required NLP-ENGINE run on Linux, Windows, and MacOS.
The VSCode NLP++ Language Extension allows for the fast development of NLP++ analyzers allowing users to:
NLP++ Example Analyzers
Example analyzers can be found in the "analyzers" folder in the NLP-ENGINE folder.
In order to use the VSCode NLP++ Language Extension, the NLP-ENGINE which is in the form of an executable and directory need to be present. Version one now includes this as part of the NLP language extension.
NLP Engine Overview
The NLP-ENGINE now comes with the NLP++ Language extension but is available separately from the VisualText github repository. The engine can run as a standalone executable (nlp.exe) that runs on Linux, Windows, and MacOS or it can be embedded into c++ code.
Types of Analyzers Commonly Written Using NLP++
There are many types of analyzers that are written by NLP++ programmers including:
This file will automatically get generated when a new analyzer is created in VisualText VSCode. It is located in the .vscode directory under the folder for an individual analyzer.
There are many details in the windows version of VisualText that are yet to be implemented in the VSCode version.
Added generate main.kb
Added help for special variables with dollar signs
Added analyzer processing queue for running multiple folders
Added "Split directory" to textview to split large directories into smaller subdirectories
Added keybindings for reformat rule, final tree, and pass tree for .nlp and unfold all for .tree
Added descriptive tooltip to tokenizer path in the analyzer sequence. Fixed renaming.
Add properties to files and folders. Sped up file operations, fixed log deletion bugs, added cancel file operations, added file operation counts display.
Added struniquechars to NLP++ and snippets
Added refresh treeviews array to fileops
Added .dict files to be text files for dictionaries where each meaning for each word occupies one line
Added webitekb function
Added a KB View List
Unicode fixes for colorizing
Major fixes to the textview conext menu
Check for older pat files more obvious and clean, misc fixes.
Added cross reference context menu items in nlp files, tree files, and txxt files
Added duplicate analyzer in the same folder. Fixed delete messages, generate exact rule, deleting analyzer logs, and comments that were removing $ variables.
Added fold and unfold recursively in trees and kbb displays
Added chartok tokenize which breaks apart all characters
Added windows Help.chm file in context menu for windows only
Added cancel analyzer during processing
Don't open text file when analyzing. Also @MULTI fixes and other fixes.
Fixed numerous problems with fileOps
Added "Generate @PATH" from tree
Added icons to output view
Fixed version number bug in linux and mac
Added unicode character offsets to trees
Mac and Linux fixes
Now uses the unicode version of the nlp-engine
Now handles utf8 files and characters (unicode)
Various minor issues fixed
Added stable file and directory deletes and copies
Added copy single analyzer or all analyzers to chosen folder
Added analyzer operation queue
Added sort & unique to text files
Major overhaul on updater
Reveals sequence number for an nlp file. This will eventually select the sequence item once Microsoft fully implements the reveal function for tree items.
Added delete logs to textview. Initialization messages now to LOGGING tab.
Simple search for function. This is a workable hack.
Added online browser lookup help where links work. This is to fix the fact that links don't work in the WebView inside VSCode.
Add clearing log folders for all analyzers for archiving analyzers
Copies settings.json file to the current work file for special NLP++ colorization
Added comment / uncomment lines
VSCode Language Extension
Follow these instructions to install the development code for VSCode extension:
At which point you can edit the sources and launch debug sessions via F5 and included launch configurations.
The NLP Engine which is written in C++ is also open source and can be downloaded in development mode. You can find how to download and develop the NLP Engine code at: https://github.com/VisualText/nlp-engine.