<< Click to Display Table of Contents >> ETL > 12.6 > Implementation Guide > Tutorial and implementation guide > Steps > Advanced deduplicating Using MaxValue and MinValue strategy |
These are handy built-in strategies that can be used with duplicated rows, for which one of their column values determines the priority (as a number, string, date etc.). Typical example would be: once two or more similar devices are found, take only the one that has been created latest, ignore the other devices).
Using of these strategies is only possible if a column name is provided, which can be done via the value parameter:
{
"id": 11,
"name": "Deduplicate table Duplicates, take random",
"type": "deduplicate",
"source": "Duplicates",
"target": "Normalized_Duplicates_Random",
"by": [ "cn", "dn" ],
"strategy": {
"name": "MaxValue",
"value": "LastSeenDate"
}
}