Arctic Embed training code, an open-source training loop for Arctic-Embed-style text embedding models. Includes custom data loading, loss functions, and an in-depth example.
Lodis: Local Dictionary Server, a Python-native dictionary server for cross-process communication backed by shared memory from the operating system.
InvestorsExchange.jl (GitHub), a tool to parse trade-level stock data from the IEX exchange in Julia. There was quite a reaction on HackerNews because automatic recapitalization made the post title comically misinterpretable.
RollingTimeWindows.jl (GitHub), a tool for indexing time-indexed data in fixed-duration periods, even when the number of rows per period is not fixed.
External Links
If you're interested in chatting about ML, quantitative trading, or explainable AI, reach out!