A novel information extraction strategy for Chinese free-text EMR is proposed.
Both rule-base method and sequential labeling method (CRF) are explored.
Totally, 12 important data elements related to hepatic carcinomas are extracted.
Two boundary matching strategies (exact, overlapped) are introduced for evaluation.
This work provides some insights for Chinese natural language processing.