doc_spread

doc_spread(bucket_ids, doc_ids)

Per-document chunk distribution analysis.

Articles with low concentration (high spread) are content-level bridges: they span multiple semantic regions by their structure.

Parameters

Name Type Description Default
bucket_ids Array-like of bucket assignments per point. required
doc_ids Array-like of document identifiers per point (same length). required

Returns

Name Type Description
dict Dict mapping doc_id to DocSpread.