- How to prove this integral is less than or equal to zero
- Find the population mean from sampling distributions
- Derivation in 2.50 Jaynes Probability
- Number of bijections encoding same cardinality of sets in ZFC models of size aleph-1
- Decoding Equivalence Relations
- Find the probability density function for the function $E = d^2$
- Countable set of truth assignments satisfying set of well formed formulas
- What is the relationship between the vector fields defined by $\dot{x}=Ax$ and $\dot{x}=kAx$
- In the group $\mathbb {Z}_n $ prove that $ < [k]_n> = < [d]_n>$ where d is the gcd (k,n)
- Is $f$ a constant on $[0,1]$?
- What will be the radius?
- Determine 4 solutions of a complex polynomial
- Find probability adjacent components are not both of same type
- Sum that is not direct
- Time derivative of Local time
- A question about a sequence is convergent or divergent
- Finding the cumulative distribution function in the graph
- determine the number of r-combinations of the multiset
- Equality constraints represented in Semidefinite Programming (LMI)
- Subtract Two Negative Binary Numbers

# Notion of cluster centers and cluster comparison in Density Based Algorithms

I have done some research on clustering algorithms since for my goal is to cluster noisy data and identify outliers or small clusters as anomalies. I consider my data noisy because of my main feautures can have quite varying values. Therefore, my focus has been on density based algorithms with quite some success.

However, I am unable to grasp the idea of cluster comparison in such algorithms since the notion of cluster centers cannot be properly defined.

My dataset constists of network flows and I split the dataset in subsets based on an identifier. After applying clustering on each subset I want to be able to compare the clusters that are created on each subset so that I can compare the subsets themselves in some context.

Would appreciate some help from data scientist gurus on how to approach the concept of cluster comparison or cluster center in such algorithms.

Thanks all!