Automatic Domain Terminology Extraction System
Welcome to "Gensen Web"

You can extract valued domain specific terms from Web pages or text you input. The extracted terms are sorted and displayed in descending order of their importance in other words, the extracted terms are well selected ones: thus the name of this system is "Gensen" which means "well selected."

"Gensen Web" system is a Web version of the original term extraction system "TermExtract" written in Perl. The function is a little bit limited compared to the original stand-alone version.

Usage

Input URL of Web page written in HTML or PDF from which you want to extract terms. Or input, probably copy and paste document. Or select your local PC file (text file or PDF only).
Choose POS tagger version: highquality but slow or high speed version: but a little bit less quality
Click the "start" button.
Wait a while, then the extracted and sorted terms are displayed.

Introduction about Stand alone system "termex" (in Japanese)

Introduction about text mining tool "termmi" (in Japanese)

Documentation of Perl module”TermExtract” in Japanese

Documentation of Python3 module”termextract” in Japanese

Top Page

Comments welcome to

Automatic Domain Terminology Extraction SystemWelcome to "Gensen Web"

Usage

Introduction about Stand alone system "termex" (in Japanese)

Introduction about text mining tool "termmi" (in Japanese)

Documentation of Perl module”TermExtract” in Japanese

Documentation of Python3 module”termextract” in Japanese

Top Page

Automatic Domain Terminology Extraction System
Welcome to "Gensen Web"