Friday, February 12, 2016

Export Text Mining

If you export the training data set out of the Text Topic node by running code in a SAS Code node that follows the Text Topic node:

/***********************************************************************************
SAS INSTITUTE INC. IS PROVIDING YOU WITH THE COMPUTER SOFTWARE CODE INCLUDED WITH THIS AGREEMENT ("CODE") ON AN "AS IS" BASIS, AND AUTHORIZES YOU TO USE THE CODE SUBJECT TO THE TERMS HEREOF.  BY USING THE CODE, YOU AGREE TO THESE TERMS.  YOUR USE OF THE CODE IS AT YOUR OWN RISK.  SAS INSTITUTE INC. MAKES NO REPRESENTATION OR WARRANTY, EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, NONINFRINGEMENT AND TITLE, WITH RESPECT TO THE CODE.
*************************************************************************************/

libname mylib 'c:\data';

data mylib.texttopic_output;
 set &em_import_data;
run;


The following variables are outputted as part of the exported training data set:

Original variables
TextTopic_raw<n>
TextTopic_<n>
_Document_

If you export the training data set out of the Text Cluster node by running code in a SAS Code node that follows the Text Cluster node:

/***********************************************************************************
SAS INSTITUTE INC. IS PROVIDING YOU WITH THE COMPUTER SOFTWARE CODE INCLUDED WITH THIS AGREEMENT ("CODE") ON AN "AS IS" BASIS, AND AUTHORIZES YOU TO USE THE CODE SUBJECT TO THE TERMS HEREOF.  BY USING THE CODE, YOU AGREE TO THESE TERMS.  YOUR USE OF THE CODE IS AT YOUR OWN RISK.  SAS INSTITUTE INC. MAKES NO REPRESENTATION OR WARRANTY, EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, NONINFRINGEMENT AND TITLE, WITH RESPECT TO THE CODE.
*************************************************************************************/

libname mylib 'c:\data';

data mylib.textcluster_output;
 set &em_import_data;
run;

The following variables are outputted as part of the exported training data set:

Original variables
TextCluster_cluster
TextCluster_SVD<n>
TextCluster_prob<n>
_Document_