- vim line count of current paragraph
- Reducing the impact dealt on hardware - hypothetical
- Change material properties of elements in named selection in Ansys workbench using APDL commands
- Difference between Maximum Output and Potential Output
- Models of Financial Frictions
- Partial derivative help for sigma and pi notation in Lagrange maximization
- Timeline of mathematical foundation?
- Can not set keyboard macro name using M-x that is tied to smex
- Extract phone numbers from multiple org files
- Spacemacs: Layout specific python processes?
- Use sudo while editing over SSH?
- Could more Earth-like planets exist in our Sun's “goldilocks” zone?
- Maximum habitability of a planet with no indigenous life
- What are ways I could design a viable split jaw?
- A mystifying grid
- Which one is the last tile?
- Random Forest Optimization
- Parameterization regression of rotation angle
- Joomla 3.8 // New Router & Nested View
- Are coral reefs a Biochemical Sedimentary rock?
Notion of cluster centers and cluster comparison in Density Based Algorithms
I have done some research on clustering algorithms since for my goal is to cluster noisy data and identify outliers or small clusters as anomalies. I consider my data noisy because of my main feautures can have quite varying values. Therefore, my focus has been on density based algorithms with quite some success.
However, I am unable to grasp the idea of cluster comparison in such algorithms since the notion of cluster centers cannot be properly defined.
My dataset constists of network flows and I split the dataset in subsets based on an identifier. After applying clustering on each subset I want to be able to compare the clusters that are created on each subset so that I can compare the subsets themselves in some context.
Would appreciate some help from data scientist gurus on how to approach the concept of cluster comparison or cluster center in such algorithms.