- posted and e-mailed.
> The data cluster at the lower bound of the scale--zero and decay to
> the right. I tried box-cox, the zero's still cluster at the far left,
> while other points become kinda symmetric. The transformed
> distribution looks like the below :
> | *
> | * * *
> | * * *** *
> | * ***** ****
> | * * * * * ** ** * * **
> My question is: is there a way to normalize it? How?Thanks.
The appearance of the zeros at the left, followed by
a symmetrical curve, suggests to me that the X scores
fall into two parts.
You can try to add information to get a new X. That
assumes that the X you have is proving to be an
inadequate surrogate for whatever it is intended to measure.
You can split the sample into two parts for analyses; that
is easier to justify if there is natural logic in making two parts.
That also seems to be the way you are going to provide a
concise summary of what you see, above, with X and Y.
If you have further analyses in mind, you can keep these
X and Y scores as they are, and go on from there. You
draw in have the sort of far-outliers that really foul up the
What are you trying to do with the X and Y?
"Taxes are the price we pay for civilization." Justice Holmes.