![]() |
|
Archives of the TeradataForumMessage Posted: Wed, 20 Aug 2002 @ 17:33:45 GMT
Hi all, I have a question on PI and statistics.
From a small table 'orig_tbl' (ca. 700,000rows)
with 3 columns col1, col2 and col3,
I created two new tables with different PI as following:
create table tblA as (select * from orig_tbl) with data
primary index (col1);
collect statistics on tblA column col3;
create table tblB as (select * from orig_tbl) with data
primary index (col2);
collect statistics on tblB column col3;
Then I did: explain select * from tblA where col3 = valueX; My Teradata V2R4.1 system said:
[snip]
3) We do an all-AMPs RETRIEVE step from tblA by way
of an all-rows scan with a condition of (tblA.col3 = valueX)
into Spool 1, which is built locally on the AMPs.
The size of Spool 1 is estimated with high confidence to be 1 row.
[snip]
But it claimed that the size of Spool 1 is estimated to be 4,855 rows, when I did the same on tblB. The difference may come from the fact that col2 has 700,000 different values (almost unique) while col1 has only 150 different values. But I can't imagine what is actually happening. Could someone please explain why? Regards. Hiroshi Takenaka
| ||||||||||||||||||||||||||||||||||||||||||||||||
| | ||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||
|
| ||||||||||||||||||||||||||||||||||||||||||||||||
| Copyright 2016 - All Rights Reserved | ||||||||||||||||||||||||||||||||||||||||||||||||
| Last Modified: 15 Jun 2023 | ||||||||||||||||||||||||||||||||||||||||||||||||