Duplicates | Kirix Strata Blog

Kirix Strata Blog

Archive for the ‘duplicates' Category

Using Relationships to Compare Lists of Emails

Thursday, July 31st, 2008

In a previous article, we were concerned with identifying and removing duplicate emails within a single table. But let's say that email addresses are located in multiple tables — how do we find which email addresses are duplicated in both lists?

To answer this question, we have to break out one of Strata's more powerful features called relationships.

In Strata, relationships allow you to match records in one table with records in another table based on a common value. In this example, the common value is a specific email address found in both tables. For instance, let's suppose we have two lists of email addresses stored in two tables, email_list1 and email_list2.

In email_list1, we have the following values:

email_list1
--------------------
second@email.com
third@email.com
fourth@email.com
fifth@email.com
sixth@email.com
seventh@email.com

(more…)

Removing or Consolidating Duplicate Email Addresses from Website Form Data

Thursday, June 19th, 2008

If you have text entry forms on your website, you probably have data with duplicate email addresses. These duplicates can either be from data entered twice into the same form or are duplicates from merging data from multiple applications/forms.

So, two questions arise:

  1. How do you identify these duplicates?
  2. How do you either remove them or group them together to track the related information?

Using Kirix Strata's grouping functionality, it's actually pretty easy. You can quickly identify duplicates from your website data and then either remove the duplicates or consolidate the different records into groups of related records. Let's look at the problem more closely.

Suppose you have a web page that asks your visitors for the following feedback information:

Dupformdata1

Visitors will enter their information, including their email address, which allows you to respond to them. However, if the visitor stops by again in the future, you'll have multiple records from the same person and therefore duplicated email addresses:

Screenshot - Dupform2

(more…)