Biml Language Reference
FuzzyLookup Element
The Fuzzy Lookup transformation looks up values in a reference data set by using fuzzy matching. That is, matches can be close rather than exact.
Attributes
  Attribute API Type Default Description
AutoPassThroughInputColumns Boolean
Specifies whether or not input columns to the Fuzzy Lookup node that lack an input mapping should be automatically set to passthrough.

[.NET API Property: AutoPassThroughInputColumns]

ConnectionName AstOleDbConnectionNode
This values specifies the OLE DB connection and database in which the fuzzy lookup reference table is located and where the fuzzy match index table is stored and accessed.

[.NET API Property: Connection]

CopyReferenceTable Boolean
This value specifies whether a copy of the table specified in the ReferenceTable should be made for the fuzzy lookup.

[.NET API Property: CopyReferenceTable]

Delimiters String
This value specifies which set of white-space and punctuation characters is used to separate fuzzy match fields into the words used for matching and scoring a fuzzy lookup.

[.NET API Property: Delimiters]

DropExistingIndex Boolean
This value specifies whether any existing indexes having the name specified in the MatchIndexName property should removed. The default is True.

[.NET API Property: DropExistingIndex]

Exhaustive Boolean
This value specifies whether each input record is directly compared against every record in the reference table. This option is primarily intended for debugging purposes and should be used with care. This method is extremely slow unless the number of records in the reference table is very small. Note also that the entire reference table is loaded into main memory if this option is enabled.

[.NET API Property: Exhaustive]

LocaleId Language
This value specifies which locale is used by the dataflow task.

[.NET API Property: LocaleId]

MatchIndexName String
This value specifies the name of the fuzzy-match index table to be reused or the table name in which a newly generated fuzzy-match index table will be written.

[.NET API Property: MatchIndexName]

MatchIndexOptions FuzzyLookupTransformMatchIndexOption
This value indicates whether an existing fuzzy-match index table is reused or a new one is built. The new index table can be temporary, persisted, or persisted and maintained.

[.NET API Property: MatchIndexOptions]

MaxMemoryUsage Int32
This value specifies the maximum amount of memory (MB) that the current instance of the Fuzzy Lookup transformation can use. If 0 is given, then the transform dynamically adjusts memory usage based upon its requirements and available system memory.

[.NET API Property: MaxMemoryUsage]

MaxOutputMatchesPerInput Int32
This value specifies which set of white-space and punctuation characters is used to separate fuzzy match fields into the words used for matching and scoring a fuzzy lookup.

[.NET API Property: MaxOutputMatches]

MinSimilarity Int32
This value specifies the minimum similarity score that is required in order for the transformation to return a match.

[.NET API Property: MinSimilarity]

Name String
Specifies the name of the object. This name can be used to reference this object from anywhere else in the program.

[.NET API Property: Name]

ValidateExternalMetadata Boolean
This value specifies whether the data flow transformation is validated against columns that originated in external data sources. When server assets such as tables and stored procedures are created during processing, ValidateExternalMetadata is normally set to False, which prevents validation from completing at compile time.

[.NET API Property: ValidateExternalMetadata]

WarmCaches Boolean
This value specifies whether the error-tolerant index and the reference table are partially loaded into main memory before the Fuzzy Lookup transformation begins. This can greatly enhance performance if many input rows are to be processed.

[.NET API Property: WarmCaches]

Singleton Children
  Child API Type Description
<ErrorHandling /> AstComponentErrorHandlingNode
This value specifies how errors are handled by default in columns processed by the component. This can be overriden at the component or column level for specific cases.

[.NET API Property: ErrorHandling]

<InputPath /> AstDataflowInputPathNode
This specifies the input path that will be used by this dataflow component. If an input path is not specified, the dataflow component will attempt to automatically discover an appropriate input path based on the surrounding dataflow.

[.NET API Property: InputPath]

<ReferenceTableInput /> AstTableResourceNode
This value specifies the name of the table or view against which incoming records are matched.

[.NET API Property: ReferenceTable]

<ExternalReferenceTableInput /> AstUnmanagedTableResourceNode
Collection Children
  Child API Type Description
<Annotations>
    <Annotation />
</Annotations>
AstAnnotationNode
This is a collection of annotation items that can be used to specify documentation, tags, or other information. Annotations are particularly useful for storing information about nodes that can be used by BimlScript code.

[.NET API Property: Annotations]

<Inputs>
    <Column />
</Inputs>
AstFuzzyLookupColumnMappingNode
This is a collection of input column mapping definitions that determine the behavior of specific columns and their comparisons.

[.NET API Property: Inputs]

<Outputs>
    <Column />
</Outputs>
AstDataflowColumnMappingNode
This is the collection of lookup output column mapping definitions.

[.NET API Property: Outputs]

<DataflowOverrides>
    Multiple Choices...
</DataflowOverrides>
AstDataflowOverrideNode
Provides a collection of objects to override properties of the component, its input paths, its output paths, and its consituent dataflow columns.

[.NET API Property: DataflowOverrides]