Skip to content

Latest commit

 

History

History
20 lines (14 loc) · 1.11 KB

File metadata and controls

20 lines (14 loc) · 1.11 KB

StreamingSampling.jl

StreamingSampling is a Julia-based proof-of-concept implementation of a streamed variants of maximum-entropy sampling (UPmaxentropy) and weighted sampling. It is designed to process large datasets stored on disk with minimal impact on RAM. The method begins by computing first-order inclusion probabilities using a DPP-based heuristic, and then feeds these probabilities into classical sampling algorithms to produce diverse samples.

Development documentation MIT license Ask us anything