RecordType - the type that represents a recordpublic class DataFusionEngine<RecordType extends Matchable & Fusible<SchemaElementType>,SchemaElementType extends Matchable> extends Object
DataFusionStrategy.| Constructor and Description | 
|---|
DataFusionEngine(DataFusionStrategy<RecordType,SchemaElementType> strategy)
Creates a new instance that uses the specified data fusion strategy. 
 | 
| Modifier and Type | Method and Description | 
|---|---|
Map<String,Double> | 
getAttributeConsistencies(CorrespondenceSet<RecordType,SchemaElementType> correspondences,
                         Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
Calculates the consistencies of the attributes of the records in the
 given correspondence set according to the data fusion strategy 
 | 
Set<Pair<RecordGroup<RecordType,SchemaElementType>,Double>> | 
getRecordGroupConsistencies(CorrespondenceSet<RecordType,SchemaElementType> correspondences,
                           Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
Calculates the consistencies of the record groups in the
 given correspondence set according to the data fusion strategy 
 | 
DataFusionStrategy<RecordType,SchemaElementType> | 
getStrategy()  | 
void | 
printClusterConsistencyReport(CorrespondenceSet<RecordType,SchemaElementType> correspondences,
                             Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
Calculates the consistencies of the attributes of the records in the
 given correspondence set according to the data fusion strategy and prints
 the results to the console 
 | 
FusibleDataSet<RecordType,SchemaElementType> | 
run(CorrespondenceSet<RecordType,SchemaElementType> correspondences,
   Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
Runs the data fusion process on the provided set of correspondences and returns a fused data set 
 | 
void | 
writeRecordGroupsByConsistency(File path,
                              CorrespondenceSet<RecordType,SchemaElementType> correspondences,
                              Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)  | 
public DataFusionEngine(DataFusionStrategy<RecordType,SchemaElementType> strategy)
strategy - the fusion strategypublic DataFusionStrategy<RecordType,SchemaElementType> getStrategy()
public FusibleDataSet<RecordType,SchemaElementType> run(CorrespondenceSet<RecordType,SchemaElementType> correspondences, Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
correspondences - correspondences between the recordsschemaCorrespondences - optional schema correspondences between the recordsFusibleDataSet based on the RecordType of the
         CorrespondenceSetpublic Map<String,Double> getAttributeConsistencies(CorrespondenceSet<RecordType,SchemaElementType> correspondences, Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
correspondences - correspondences between the recordsschemaCorrespondences - correspondences between the schema elementspublic Set<Pair<RecordGroup<RecordType,SchemaElementType>,Double>> getRecordGroupConsistencies(CorrespondenceSet<RecordType,SchemaElementType> correspondences, Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
correspondences - correspondences between the recordsschemaCorrespondences - correspondences between the schema elementspublic void printClusterConsistencyReport(CorrespondenceSet<RecordType,SchemaElementType> correspondences, Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences)
correspondences - correspondences between the recordsschemaCorrespondences - correspondences between the schema elementspublic void writeRecordGroupsByConsistency(File path, CorrespondenceSet<RecordType,SchemaElementType> correspondences, Processable<Correspondence<SchemaElementType,Matchable>> schemaCorrespondences) throws IOException
IOExceptionCopyright © 2018. All rights reserved.