VOC Stats
Bases: BaseStats
Concrete analyzer for datasets in Pascal VOC (XML) format.
This class implements the processing logic for XML annotation files. It coordinates data reading, geometric feature extraction, and pixel-level image analysis to build a comprehensive feature matrix.
Source code in tools/stats/voc_stats.py
14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 | |
get_umap_features(df)
staticmethod
Selects relevant numeric columns for dimensionality reduction (UMAP).
Filters out metadata (paths, timestamps), categorical data, and outlier flags to ensure UMAP focuses on geometric and content-based manifold analysis.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
df
|
DataFrame
|
The complete feature matrix. |
required |
Returns:
| Type | Description |
|---|---|
List[str]
|
List[str]: A list of numeric column names suitable for projection. |
Source code in tools/stats/voc_stats.py
24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 | |