Anatomy of a DKPro TC experiment
Subsequently, we introduce how to configure a DKPro TC experiement by discussing a minimal setup.
Results of an experiment
The results are written to the folder provided as DKPRO_HOME
directory. The subfolder contain all output written by an experiment, and not just the final results. The folder with the results is the Evaluation-*
folder. The other folders are probably not of importance for using DKPRo TC, but we explain their content yet briefly. For a train-test experiment, the following folders are created:
- InitTask-Train-ExperimentName-*
- InitTask-Test-ExperimentName-*
- OutcomeCollectionTask-ExperimentName-*
- MetaInfoTask-ExperimentName-*
- ExtractFeaturesTask-Train-ExperimentName-*
- ExtractFeaturesTask-Test-ExperimentName-*
- DKProTcShallowTestTask-ExperimentName-*
- <MachineLearningAdapter>-ExperimentName-*
- Evaluation-ExperimentName-*
The InitTask
folders contain the provided training and testing data converted into an internal data format. OutcomeCollectionTask
collects all occurring labels in the training and testing data (or nothing if its regression). MetaInfoTask
prepares the usage of features that use a frequency cut-off, i.e. the word-ngram feature that is used in the experimental setup. ExtractFeatureTask
contain the extracted features in the data format the respective classifier expects. DKProTcShallowTestTask
and <MachineLearningAdapter>
execute the actual classifier with the feature data extracted before. The results per instance and some more low-level information can be found in the <MachineLearningAdapter>
folder.