Hepatocellular carcinoma is one of the most lethal cancers worldwide. More accurate stratification of patients at risk is necessary to improve its clinical management. As epithelial-mesenchymal transition is critical for the invasiveness and metastasis of human cancers, we investigated expression profiles of 12 genes related to epithelial-mesenchymal transition through a real-time polymerase chain reaction. From a univariate Cox analysis for a training cohort of 128 hepatocellular carcinoma patients, four candidate genes (E-cadherin [CDH1], inhibitor of DNA binding 2 [ID2], matrix metalloproteinase 9 [MMP9], and transcription factor 3 [TCF3]) with significant prognostic values were selected to develop a risk score of patient survival. Patients with high risk scores calculated from the four-gene signature showed significantly shorter overall survival times. Moreover, the multivariate Cox analysis revealed that four-gene signature (P = 0.0026) and tumor stage (P = 0.0023) were independent prognostic factors for overall survival. Subsequently, the four-gene signature was validated in an independent cohort of 231 patients from three institutions, in which high risk score was significantly correlated with shorter overall survival (P = 0.00011) and disease-free survival (P = 0.00038). When the risk score was entered in a multivariate Cox analysis with tumor stage only, both the risk score (P = 0.0046) and tumor stage (P = 2.6 x 10(-9)) emerged as independent prognostic factors. In conclusion, we suggest that the proposed gene signature may improve the prediction accuracy for survival of hepatocellular carcinoma patients, and complement prognostic assessment based on important clinicopathologic parameters such as tumor stage.