|
Archives of the TeradataForumMessage Posted: Mon, 10 Jan 2011 @ 14:26:10 GMT
Anshuman.Singh wrote:
??? First you write it should be fast, that's why it's discarding dups. Then you write duplicate row checks will slow down performance? And duplicate rows will never be on different AMPs, because they must have the same PI-values. FastLoad discards duplicate rows, because it doesn't have/store any information about the input record sequence like MultiLoad's Match Tag (ApplySeq+DMLSeq+ImportSeq+SMTSeq+SourceSeq), thus it simply doesn't know, if a row was duplicate within the data or was sent twice because of a restarted FastLoad (in Application Phase). If FastLoad would be able to load MultiSet like MLoad, there would be more overhead regarding perm space. Currently the intermediate size of the target table is (almost) the same as the final size and this is one of the big advantages of FastLoad over MLoad. I think FastLoad is older than MultiSet tables and there's no reason to add that feature as long as there's Mload. Dieter
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright 2016 - All Rights Reserved | |||||||||||||||||||||||||||||||||||||||||||||||||||
Last Modified: 15 Jun 2023 | |||||||||||||||||||||||||||||||||||||||||||||||||||