I specialize in exploratory data, ML and AI projects. Past clients include Google, NASA, and the UK, HK governments on topics such as infrastructure, cybersecurity, robotics, and decentralized finance. If you're facing high-stakes challenges or need insights from messy data, let’s connect.
I specialize in exploratory data/ML/AI projects, working with clients like Google, NASA, and the UK/HK governments across infrastructure, cybersecurity, robotics, and decentralized finance. If you're facing high-stakes challenges or need insights from messy data, let’s connect.
OFFERING: Algorithms, crypto, finance, hardware, and AI. Past recent projects on forecasting for infrastructure projects, on-chain zero-knowledge proofs, LLMs for NL2SQL, and AI for mechatronics. Just fire me an email at michele.dallachiesa@sigforge.com
Hi! OP here, addressing one more question I received somewhere else:
3) “Can it also track the model's state during training if, e.g., there is an early stop, and then I want to continue the training process?”
With MLtraq, You can dump and load arbitrary objects, including model weights and other state parameters. Let's consider the example https://mltraq.com/howto/02-artifacts-storage/. MLtraq dumps and reloads from the filesystem the binary blobs referenced in the tracked metadata. Similarly, you can store artifacts in third-party services and data stores.
Author here, happy to answer any questions! Accompanying description for the video:
Together with Oxford Global Projects, we have built a family of forecasting models for S-curves using data from a total of 2,700 years of combined construction activity, with an aggregate cash flow of USD 60bn.
The S-curve in project management is a graphical representation that illustrates the cumulative progress of a project over time. It is called an "S-curve" because its characteristic shape resembles the letter "S": It starts slowly, accelerates, and then levels off.
Project delays and budget overruns are often linked with anomalies within the expenditure profile, like a sluggish burn rate or unexpectedly high spending towards anticipated project completion. Timely identification of these anomalies empowers proactive intervention to realign projects on the path to success.
This short video shows how we're modelling expenditure curves to enable many use cases, including spending projections, cost overruns and underruns, outlier analysis, and more.
OFFERING: Data Products & AI Consulting (Freelance). I have a strong track record in building reliable data pipelines, dashboards and end-to-end AI solutions. CV and references are available upon request.
RECENT PROJECTS: Empowering Researchers with Personalized Recommendations, Accelerating Public Consultations with Large Language Models (LLMs), and Guarding High-Risk Large Infrastructure Projects with an Early Warning System.
Author here, happy to answer any questions! Accompanying description for the video:
Which repositories show the fastest growth? Are there any notable patterns worth highlighting? Let’s find out! The analysis considers 158 public GitHub code repositories at OpenAI, Anthropic and Cohere, created since August 2014, with an aggregate of 12k commits.
The S-curve in project management is a graphical representation that illustrates the cumulative progress of a project over time. It is called an "S-curve" because its characteristic shape resembles the letter "S": It starts slowly, accelerates, and then levels off.
The cumulative number of code commits over time can be used as raw data to model development progress “cost” with S-curves. Similar results can be obtained with the count of distinct authors (harder to control) and the count of modified files.
The animation illustrates the progression of commits over time, with normalisation applied to both axes. Each frame captures a snapshot of the repositories at a specific moment in time. A combination of the fastest and slowest repositories is highlighted with colors and labels. Quiet projects cluster in the top-left corner, and accelerating projects are found in the bottom-right area.
Over time, patterns tend to stabilise. Projects with synchronised acceleration can be attributed to coordinated commits from private repositories. The Python APIs for OpenAI, Anthropic, and Cohere stand out as some of the most active repositories, with OpenAI taking a prominent role in the Node.js / Typescript API development. 5 out of 8 of the most active repositories belong to OpenAI.
S-curves in software development are well-equipped to run simulations, comparative performance analyses, identify project delays and anomalies, and optimise resources in large teams with multiple projects.
OFFERING: Data Products & AI Consulting (Freelance). I have a strong track record in building reliable data pipelines, dashboards and end-to-end AI solutions. CV and references are available upon request.
RECENT PROJECTS: Empowering Researchers with Personalized Recommendations, Accelerating Public Consultations with Large Language Models (LLMs), and Guarding High-Risk Large Infrastructure Projects with an Early Warning System.
Data Products & AI Consulting (Freelance). I work with clients in US and EU on projects lasting 1-12 months. CV and references available upon request.
PAST PROJECTS: Predicting demand for contact center services; Determining the effectiveness of marketing campaigns; Outdoor advertising; Natural language processing; Making predictions and categorizing data using statistical models; Improving traffic flow in urban areas.
I specialize in exploratory data, ML and AI projects. Past clients include Google, NASA, and the UK, HK governments on topics such as infrastructure, cybersecurity, robotics, and decentralized finance. If you're facing high-stakes challenges or need insights from messy data, let’s connect.
LINKEDIN: https://www.linkedin.com/in/dallachiesa/
EMAIL: michele.dallachiesa@sigforge.com