15. Deduplication

OpenCRVS employs a robust matching algorithm, leveraging the capabilities of ElasticSearch. By automatically detecting potential duplicates, OpenCRVS prompts the Registrar to carefully review these declarations before proceeding with the registration process. This proactive approach ensures that duplicate records are promptly addressed and minimises the chances of duplicate civil registration records from existing. To learn more about how deduplication works please refer to OpenCRVS Specifications 1.4: Deduplication

Pre-condition

Declaration status is in-progress

Triggers

  • User with scope:record.declaration-send-for-review sends complete declaration for review

  • User with scope:record.declaration-send-for-approval starts and sends declaration for approval

  • User with scope:record.register starts and then registers a declaration

Standard flow

  1. Each trigger action listed above prompts a deduplication check

  2. If deemed a potential duplicate the declaration will show in the Ready for review workqueue for a user with scope:record.review-duplicates

  3. User assigns themselves to the potential duplicate declaration

  4. User clicks “Review”

  5. User is prompted to check the potential duplicate declaration against matched records

  6. User clicks on the tab of a flagged record

  7. User compares the potential duplicate declaration against the matched record side by side

  8. User reviews any other matched records

  9. User returns to the potential duplicate declaration view after making an assessment

  10. User clicks “Not a duplicate”

  11. User confirms their decision

Post conditions

  • Declaration status remains ‘In review’ and in Ready for Review workqueue.

  • Record audit is updated to show that the declaration was marked not a duplicate

Variations/Exceptions

Marking as a duplicate

  • User clicks “Mark as a duplicate”

  • User prompted to select the record it is a duplicate of

  • User prompted to add a reason for their decision

  • User confirms their decision to mark it as a duplicate

Post conditions

  • Declaration status changed to ‘Archived’. Declaration removed from all workqueues

  • Record audit is updated to show that the declaration was marked as a duplicate and was archived

Last updated