Archives of the TeradataForum

Message Posted: Tue, 25 Feb 2003 @ 14:02:00 GMT


	<Prev	Next>		<<First	<Prev	Next>	Last>>

Subj:		Re: Couting duplicate rows thru multi load

From:		Jose Lora

You could use Fastload + INMOD (that will add a record id). This will give you the fastest loading time. Once in the database, use your SQL skills to locate / remove / etc. your duplicated records.

Now, there are some ways to avoid the INMOD, one is using an ETL tool that will allow you to read data from a file (or any other source), do transformations on the fly (calculate the new record id) and send the data directly to a Teradata Utility (streaming data to Fastload). The idea of course is avoid intermediate staging files outside Teradata and use parallelism to locate the duplicated rows.

Jose Lora
Meredith Corporation.


	<Prev	Next>		<<First	<Prev	Next>	Last>>

Archives

2016		2007
2015		2006
2014		2005
2013		2004
2012		2003
2011		2002
2010		2001
2009		2000
2008		1999

2003 Indexes

Jan		Jul
Feb		Aug
Mar		Sep
Apr		Oct
May		Nov
Jun		Dec

Last Modified: 15 Jun 2023