Archives of the TeradataForum

Message Posted: Sat, 14 Jan 2012 @ 22:41:21 GMT


	<Prev	Next>		<<First	<Prev	Next>	Last>>

Subj:		Solutions to Data Skewing

From:		Anomy Anom

<-- Anonymously Posted: Saturday, January 14, 2012 17:21 -->

Folks?

I wondered what solutions might? be extant for 'real' data that has skewed.

Consider this data distribution

     Value       count (*)

     2323232     12 million

     3545        120K

     34349       450K

     3434        45K    etc

If we had skewing caused by NULLS and 'default data' values we can perhaps try some random number solns to make it look better, on a join column.

What about 'real' data? ( the same column is joined and selected ) that causes 'lumpy' distribution causing rowhashes to run amuck when that data get hashed , causing skewed CPU, increasing Impact values and degrading performance .

Thank You


	<Prev	Next>		<<First	<Prev	Next>	Last>>

Archives

2016		2007
2015		2006
2014		2005
2013		2004
2012		2003
2011		2002
2010		2001
2009		2000
2008		1999

2012 Indexes

Jan		Jul
Feb		Aug
Mar		Sep
Apr		Oct
May		Nov
Jun		Dec

Last Modified: 15 Jun 2023