RecordType - the type that represents a recordpublic class DataFusionEngine<RecordType extends Matchable & Fusible<SchemaElementType>,SchemaElementType extends Matchable> extends Object
DataFusionStrategy.| Constructor and Description |
|---|
DataFusionEngine(DataFusionStrategy<RecordType,SchemaElementType> strategy)
Creates a new instance that uses the specified data fusion strategy.
|
| Modifier and Type | Method and Description |
|---|---|
Map<String,Double> |
getAttributeConsistencies(CorrespondenceSet<RecordType,SchemaElementType> correspondences,
Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
Calculates the consistencies of the attributes of the records in the
given correspondence set according to the data fusion strategy
|
Set<Pair<RecordGroup<RecordType,SchemaElementType>,Double>> |
getRecordGroupConsistencies(CorrespondenceSet<RecordType,SchemaElementType> correspondences,
Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
Calculates the consistencies of the record groups in the
given correspondence set according to the data fusion strategy
|
DataFusionStrategy<RecordType,SchemaElementType> |
getStrategy() |
void |
printClusterConsistencyReport(CorrespondenceSet<RecordType,SchemaElementType> correspondences,
Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
Calculates the consistencies of the attributes of the records in the
given correspondence set according to the data fusion strategy and prints
the results to the console
|
FusibleDataSet<RecordType,SchemaElementType> |
run(CorrespondenceSet<RecordType,SchemaElementType> correspondences,
Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
Runs the data fusion process on the provided set of correspondences and returns a fused data set
|
void |
writeRecordGroupsByConsistency(File path,
CorrespondenceSet<RecordType,SchemaElementType> correspondences,
Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences) |
public DataFusionEngine(DataFusionStrategy<RecordType,SchemaElementType> strategy)
strategy - the fusion strategypublic DataFusionStrategy<RecordType,SchemaElementType> getStrategy()
public FusibleDataSet<RecordType,SchemaElementType> run(CorrespondenceSet<RecordType,SchemaElementType> correspondences, Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
correspondences - correspondences between the recordsschemaCorrespondences - optional schema correspondences between the recordsFusibleDataSet based on the RecordType of the
CorrespondenceSetpublic Map<String,Double> getAttributeConsistencies(CorrespondenceSet<RecordType,SchemaElementType> correspondences, Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
correspondences - correspondences between the recordsschemaCorrespondences - correspondences between the schema elementspublic Set<Pair<RecordGroup<RecordType,SchemaElementType>,Double>> getRecordGroupConsistencies(CorrespondenceSet<RecordType,SchemaElementType> correspondences, Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
correspondences - correspondences between the recordsschemaCorrespondences - correspondences between the schema elementspublic void printClusterConsistencyReport(CorrespondenceSet<RecordType,SchemaElementType> correspondences, Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
correspondences - correspondences between the recordsschemaCorrespondences - correspondences between the schema elementspublic void writeRecordGroupsByConsistency(File path, CorrespondenceSet<RecordType,SchemaElementType> correspondences, Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences) throws IOException
IOExceptionCopyright © 2018. All rights reserved.