Automatic Domain Terminology Extraction System
Welcome to "Gensen Web"
You can extract valued domain specific terms from
Web pages or text you input. The extracted terms are sorted and displayed
in descending order of their importance in other words, the extracted terms
are well selected ones: thus the name of this system is
"Gensen" which means "well selected."
"Gensen Web" system is a Web version
of the original term extraction system "TermExtract" written in Perl. The
function is a little bit limited compared to the original stand-alone
version.
Usage
-
Input URL of Web page written in HTML or PDF
from which you want to extract terms. Or input,
probably copy and paste document. Or select your local PC file (text file or PDF only).
-
Choose POS tagger version: highquality but slow or
high speed version: but a little bit less quality
-
Click the "start" button.
-
Wait a while, then the extracted and sorted terms are displayed.
Comments welcome to