The format only says something about how the data is to be displayed, not how it is stored. In this case the formats are the defaults for the different storage types: FEDRFNDX is stored as an int
, while FEDTAXX is stored as a long
. You can find out more about the differences by typing in Stata help data_types
.
My guess would be that
either both can safely be stored as
int
without loss of informationor FEDRFNDX only has integer values less than 32,740, which means it does not use the full 8 digits that the codebook reserved for it, while FEDTAXX uses integer numbers larger than 32,740. 32,740 is the largest number that can be stored in a (2 byte)
int
, while 2,147,483,620 is the limit for a (4 byte)long
.
A safe way to check which of these is true is to type compress
after loading your dataset. This will change the storage type of each variable to the lowest form possible without loss of information. So, if my first guess is true, it will change the storage type of FEDTAXX to int
, while if my second guess is true it will leave the storage type unchanged.
After that it is always a good idea to just type tab FEDTAXX
and look at the values. I like the user-written command fre
for that, as it displays both the values and the value labels. You can get that by typing in Stata ssc install fre
.