I mean distance or cosine similarity between vectors. I want to use k-means for clustering.
Sara
"Nasser M. Abbasi" <nma@12000.org> wrote in message <k84aaj$ikb$1@speranza.aioe.org>...
> On 11/15/2012 8:47 PM, sara wrote:
> > Hi,
> >
> > I converted the text to frequency vectors. The thing is all the vectors have different sizes.
> >Could you help me how I can calculate the similarity between two vector with different sizes.
> >I need them to cluster my data.
> > Thanks
> >
>
> not clear what you mean by `similarity'. In which sense?
>
> norm? correlation? What is the measure of similarity you want?
>
> --Nasser
Sara
"Nasser M. Abbasi" <nma@12000.org> wrote in message <k84aaj$ikb$1@speranza.aioe.org>...
> On 11/15/2012 8:47 PM, sara wrote:
> > Hi,
> >
> > I converted the text to frequency vectors. The thing is all the vectors have different sizes.
> >Could you help me how I can calculate the similarity between two vector with different sizes.
> >I need them to cluster my data.
> > Thanks
> >
>
> not clear what you mean by `similarity'. In which sense?
>
> norm? correlation? What is the measure of similarity you want?
>
> --Nasser