Usuwanie duplikatów z 1,6mln rekordów

Usuwanie duplikatów z 1,6mln rekordów

1q2w3e4r Zobacz profil	13.03.2012, 17:57:54 Post #1
Grupa: Zarejestrowani Postów: 238 Pomógł: 0 Dołączył: 6.05.2011 Ostrzeżenie: (10%)	Witam, Muszę usunąć duplikaty z 1,6mln rekordów.. Zastanawiam się jak to zrobić najszybciej. Duplikaty mają się usuwać wierszami, czyli duplikaty jeśli jeden wiersz nie może się równać innemu. Macie jakieś propozycje jak to zrobić? Proszę o szybka odpowiedź.

Odpowiedzi

cojack Zobacz profil	14.03.2012, 16:02:43 Post #2
Grupa: Zarejestrowani Postów: 898 Pomógł: 80 Dołączył: 31.05.2008 Ostrzeżenie: (20%)	Z http://dev.mysql.com/doc/refman/5.0/en/delete.html lekko przerobione: Cytat If you are deleting many rows from a large table, you may exceed the lock table size for an InnoDB table. To avoid this problem, or simply to minimize the time that the table remains locked, the following strategy (which does not use DELETE at all) might be helpful: Select the rows not to be deleted into an empty table that has the same structure as the original table: Kod INSERT INTO t_copy SELECT DISTINCT * FROM t; Use RENAME TABLE to atomically move the original table out of the way and rename the copy to the original name: Kod RENAME TABLE t TO t_old, t_copy TO t; Drop the original table: Kod DROP TABLE t_old;

Posty w temacie

1q2w3e4r Usuwanie duplikatów z 1,6mln rekordów 13.03.2012, 17:57:54

toniq pobierasz wszystkie posortowane po tym co chcesz s... 13.03.2012, 18:35:55

1q2w3e4r Tak, ale to jest 1 600 000 rekordów, każdy przez k... 14.03.2012, 14:46:02

alegorn duplikaty? w sensie na caly rekord? czy na konkret... 14.03.2012, 15:05:59

cojack Z http://dev.mysql.com/doc/refman/5.0/en/delete.ht... 14.03.2012, 16:02:43

alegorn @cojak w zasadzie to tosamo co i ja zaproponowalem... 14.03.2012, 17:09:31

toniq wersja z php ma pewna zaletę ze możesz dowolnie fi... 15.03.2012, 08:00:12

alegorn toniq:: kilka sekund?? dyskusyjne. no ale nawet gd... 15.03.2012, 10:56:26

cudny Cytat(toniq @ 15.03.2012, 08:00:12 ) ... 15.03.2012, 21:48:11

2 Użytkowników czyta ten temat (2 Gości i 0 Anonimowych użytkowników)

0 Zarejestrowanych:

Tryb wyświetlania: Przełącz na: Standardowy · Przełącz na: Linearny+ · Drzewo

Aktualny czas: 17.10.2025 - 23:03

Hosting zapewnia