Biml Language Reference
FuzzyGrouping Element
The Fuzzy Grouping transformation groups data set rows that contain similar values.
Attributes
  Attribute API Type Default Description
ConnectionName AstOleDbConnectionNode
This value specifies the connection to an instance of SQL Server to use when creating temporary SQL Server tables required by the Fuzzy Grouping transformation algorithm.

[.NET API Property: Connection]

Delimiters String
This value specifies which white-space and punctuation characters are used to separate strings into multiple words.

[.NET API Property: Delimiters]

Exhaustive Boolean
This value specifies whether every input record is directly compared against every other input record. The default value is False. If set to True, performance will be extremely slow unless the number of input records is very small. This option is primarily intended for debugging purposes and should be used with care.

[.NET API Property: Exhaustive]

InputKeyColumnName String
This is the name of the dataflow output column that will contain the input key value.

[.NET API Property: InputKeyColumnName]

LocaleId Language
This value specifies which locale is used by the dataflow task.

[.NET API Property: LocaleId]

MaxMemoryUsage Int32
This value specifies which white-space and punctuation characters are used to separate strings into multiple words.

[.NET API Property: MaxMemoryUsage]

MinSimilarity Int32
This value specifies the minimum similarity threshold, expressed as a value between 0 and 100. The default value is 80.

[.NET API Property: MinSimilarity]

Name String
Specifies the name of the object. This name can be used to reference this object from anywhere else in the program.

[.NET API Property: Name]

OutputKeyColumnName String
This is the name of the dataflow output column that will contain the output key value.

[.NET API Property: OutputKeyColumnName]

SimilarityScoreColumnName String
This is the name of the dataflow output column that will contain the similarity score value.

[.NET API Property: SimilarityScoreColumnName]

ValidateExternalMetadata Boolean
This value specifies whether the data flow transformation is validated against columns that originated in external data sources. When server assets such as tables and stored procedures are created during processing, ValidateExternalMetadata is normally set to False, which prevents validation from completing at compile time.

[.NET API Property: ValidateExternalMetadata]

Singleton Children
  Child API Type Description
<ErrorHandling /> AstComponentErrorHandlingNode
This value specifies how errors are handled by default in columns processed by the component. This can be overriden at the component or column level for specific cases.

[.NET API Property: ErrorHandling]

<InputPath /> AstDataflowInputPathNode
This specifies the input path that will be used by this dataflow component. If an input path is not specified, the dataflow component will attempt to automatically discover an appropriate input path based on the surrounding dataflow.

[.NET API Property: InputPath]

Collection Children
  Child API Type Description
<Annotations>
    <Annotation />
</Annotations>
AstAnnotationNode
This is a collection of annotation items that can be used to specify documentation, tags, or other information. Annotations are particularly useful for storing information about nodes that can be used by BimlScript code.

[.NET API Property: Annotations]

<Columns>
    <Column />
</Columns>
AstFuzzyGroupingColumnMappingNode
This is a collection of mapping definitions with configuration for each column.

[.NET API Property: Columns]

<DataflowOverrides>
    Multiple Choices...
</DataflowOverrides>
AstDataflowOverrideNode
Provides a collection of objects to override properties of the component, its input paths, its output paths, and its consituent dataflow columns.

[.NET API Property: DataflowOverrides]