Image Dataset Reporter
Bases: BaseDatasetReporter
Implementation of BaseDatasetReporter for Computer Vision datasets.
This class orchestrates the complete reporting pipeline. It aggregates geometric and pixel-level features to generate structured console logs, spatial heatmaps, correlation matrices, and UMAP manifold projections.
Source code in tools/stats/dataset_reporter/image_reporter.py
12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 | |
generate_visual_report(df, features, destination)
Orchestrates the visual analytics pipeline to create technical plots.
The pipeline includes
- Class distribution bar charts.
- Geometric analysis (Area boxplots and Aspect Ratio violin plots).
- Dataset Bias Matrix (Correlation between classes and features).
- Per-class Spatial Density heatmaps (3x3 grid).
- Per-class internal feature correlation matrices.
- Global UMAP manifold projection.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
df
|
DataFrame
|
The extracted feature matrix. |
required |
features
|
List[str]
|
List of numeric columns for correlation and manifold analysis. |
required |
destination
|
Union[Path, PdfPages]
|
Output target (directory or PDF document). |
required |
Source code in tools/stats/dataset_reporter/image_reporter.py
20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 | |
show_console_report(df, target_format)
Aggregates dataset statistics and prints a structured technical summary.
Provides insights into object density, image quality metrics, and detailed per-class geometry and spatial bias.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
df
|
DataFrame
|
The extracted feature matrix. |
required |
target_format
|
str
|
The annotation format (e.g., 'yolo', 'voc'). |
required |
Source code in tools/stats/dataset_reporter/image_reporter.py
90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 | |