The OraTrainer: How to find duplicate records in a table

Saturday, October 26, 2013

How to find duplicate records in a table

A usual situation we come across in development: a table missed a unique key and allowed duplicate rows to be entered unchecked. Now we want to find and delete those duplicates.

In this article, we’ll see how to achieve this.

Consider the table dup_emp with these values:

SQL> select *

  2  from dup_emp

  3  order by empno;

     EMPNO ENAME                 SAL

---------- -------------- ----------

         1 Adam                  400

         2 Sandy                 300

         2 Ted                   450

         3 Mark                  450

         4 Alan                  450

         4 Carol                 200

         4 Peter                 250

         5 David                 350

8 rows selected.

As you can see, there are two entries for empno 2 and three entries for empno 4. We want each empno to correspond to a single employee only.

The query below finds all duplicate records i.e. all empnos with more than one entry in the table:

SQL> select *

  2  from

  3  (select d.*

  4        , count(*) over

  5          (partition by empno) cnt

  6   from dup_emp d

  7  )

  8  where cnt > 1;

     EMPNO ENAME   SAL        CNT

---------- ------ ---- ----------

         2 Sandy   300          2

         2 Ted     450          2

         4 Alan    450          3

         4 Carol   200          3

         4 Peter   250          3

Deleting the duplicate records

The usual need is to delete all but one such row. The first thing is to know the deciding factor -out of the duplicates, which one is to be retained?

The next SQL retains the row with the minimum rowid:

SQL> delete from dup_emp a

  2  where rowid >

  3    (select min(rowid)

  4     from dup_emp b

  5     where a.empno = b.empno);

3 rows deleted.

SQL> select *

  2  from dup_emp;

     EMPNO ENAME   SAL

---------- ------ ----

         1 Adam    400

         2 Sandy   300

         3 Mark    450

         4 Alan    450

         5 David   350

Change the WHERE condition according to your needs.

2 comments:

UnknownTuesday, April 22, 2014 5:20:00 PM
This comment has been removed by the author.
ReplyDelete
Replies
AnonymousTuesday, April 22, 2014 5:22:00 PM
Thanks for the post :-) This Qs was asked to me in interview
ReplyDelete
Replies

Add comment

The Oracle Trainer!

Welcome to The Oracle Trainer!

Ram (seriously... not just my screen name), born and raised in the Erode area, has been forced (not really)

to live in Chennai for the past 5 years. A seasoned writer and editor,

I am glad I (somehow,) made it here! I have not been successful tech blogger.

I've been anxious to write lately, but my muse has declined since then and I find it harder and harder to express myself. Here's to hoping that I can improve my blogging skills. I'm glad you found your way into the mists, and I hope we're able to provide you with the necessary Things on oracle forms and reports related stuffs and quality to help you make the improvements your desire.

And if you have absolutely any questions or concerns or ideas or whatever related to Oracle Sql, Plsql, forms And reports, eBS you can contact me (or any of the other staff members because there's always at least one of us floating around the boards).

I look forward to meeting you in the Blog section!

Anyway, enjoy your Reading.

The OraTrainer

Saturday, October 26, 2013

How to find duplicate records in a table

Deleting the duplicate records

2 comments:

Total Pageviews

Linked In

Search This Blog

Wikipedia