Biml Language Reference
TermExtraction Element
The Term Extraction transformation extracts terms from input text columns and directs the terms to output text columns.
Attributes
  Attribute API Type Default Description
CaseSensitiveTermExtraction Boolean
This value specifies whether the Term Extraction process is case sensitive. The default value is False.

[.NET API Property: CaseSensitiveTermExtraction]

ConnectionName AstOleDbConnectionNode
This value specifies the name of the OLE DB connection that is used to connect to the exclusion term table.

[.NET API Property: ExclusionTableConnection]

LocaleId Language
This value specifies which locale is used by the dataflow task.

[.NET API Property: LocaleId]

Name String
Specifies the name of the object. This name can be used to reference this object from anywhere else in the program.

[.NET API Property: Name]

ScoreOutputColumn String
This value specifies the name of the dataflow column to use for the score output. The default value is "Score".

[.NET API Property: ScoreOutputColumn]

ScoreType TermExtractionTransformScoreType
This value specifies what kind of score is associated with an extracted term. The default value is that the frequency of a term is used as the score.

[.NET API Property: ScoreType]

TermExtractionColumn String
This value specifies the name of the term extraction column.

[.NET API Property: TermExtractionColumn]

TermFrequencyThreshold Int32
Specifies the frequency threshold above which a term is extracted.

[.NET API Property: TermFrequencyThreshold]

TermMaxLength Int32
This value specifies the maximum number of words allowed in a term. The default value is 12.

[.NET API Property: TermMaxLength]

TermOutputColumn String
This value specifies the name of the dataflow column to use for the term output. The default value is "Term".

[.NET API Property: TermOutputColumn]

TermType TermExtractionTransformTermType
This value specifies the type of terms to extract. The default value is that noun and noun phrases are extracted.

[.NET API Property: TermType]

ValidateExternalMetadata Boolean
This value specifies whether the data flow transformation is validated against columns that originated in external data sources. When server assets such as tables and stored procedures are created during processing, ValidateExternalMetadata is normally set to False, which prevents validation from completing at compile time.

[.NET API Property: ValidateExternalMetadata]

Singleton Children
  Child API Type Description
<ErrorHandling /> AstComponentErrorHandlingNode
This value specifies how errors are handled by default in columns processed by the component. This can be overriden at the component or column level for specific cases.

[.NET API Property: ErrorHandling]

<ExclusionTermTableColumnInput /> AstTableColumnResourceNode
This values specifies which column contains terms that are skipped during the processing of extracting terms from the data source.

[.NET API Property: ExclusionTermColumn]

<ExclusionTermExternalTableColumnInput /> AstUnmanagedTableColumnResourceNode
<InputPath /> AstDataflowInputPathNode
This specifies the input path that will be used by this dataflow component. If an input path is not specified, the dataflow component will attempt to automatically discover an appropriate input path based on the surrounding dataflow.

[.NET API Property: InputPath]

Collection Children
  Child API Type Description
<Annotations>
    <Annotation />
</Annotations>
AstAnnotationNode
This is a collection of annotation items that can be used to specify documentation, tags, or other information. Annotations are particularly useful for storing information about nodes that can be used by BimlScript code.

[.NET API Property: Annotations]

<DataflowOverrides>
    Multiple Choices...
</DataflowOverrides>
AstDataflowOverrideNode
Provides a collection of objects to override properties of the component, its input paths, its output paths, and its consituent dataflow columns.

[.NET API Property: DataflowOverrides]