Venture Capital Risk Scoring

Predict startup investment risk scores using panel data with 20 startups tracked over 18 time periods.

Dataset Exploration

The Venture Capital dataset contains investment data and startup performance metrics over time. Features include burn rate, revenue growth, customer churn, founder experience, and industry risk level.

Panel data structure: 20 startups tracked over 18 time periods (360 total rows, 9 columns).

Fit Data (first 5 rows)

TimeIndividualBurnrateRevenue GrowthCustomer ChurnBurnrate×Rev.GrowthFounder Exp.Industry RiskRisk Score
113522.059771.757124.42
213927.12131057.687124.63
314125.14171030.747132.27
414412.5119550.447136.07
514523.09211039.057131.46

Forecast Data (first 3 rows)

The forecast file covers future periods without the Risk Score target column.

TimeIndividualBurnrateRevenue GrowthCustomer ChurnBurnrate×Rev.GrowthFounder Exp.Industry Risk
1912721.1413570.78103
2012810.8814304.64103
2113214.6718469.44103

Download sample datasets from GitHub

Code Walkthrough

Step 1: Initialize

Python
import pandas as pd
from datfid import DATFIDClient

client = DATFIDClient(token="your_DATFID_token")

Step 2: Fit the Model

Python
url_fit = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Venture_Capital.xlsx"
df = pd.read_excel(url_fit)

result = client.fit_model(
    df=df,
    id_col="Individual",
    time_col="Time",
    y="Risk Score",
    current_features="all",
    filter_by_significance=True
)

Step 3: Forecast

Python
url_forecast = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Venture_Capital_forecast.xlsx"
df_forecast = pd.read_excel(url_forecast)

forecast = client.forecast_model(df_forecast=df_forecast)

Analysis Results (Model Fit)

Formula

Risk Score ~ α1*Intercept + α2*Industry Risk Level + α3*Founder Experience + β1*Burnrate + β2*Revenue Growth + β3*Customer Churn

Alpha Estimates (Time-Invariant)

VariableEstimateT-statInterpretation
Intercept12.3915.54Baseline risk score
Founder Experience-0.2422.83More experienced founders reduce risk score by ~0.24 per year
Industry Risk Level+0.1810.99Not statistically significant (p=0.32)

Beta Estimates (Time-Varying)

VariableEstimateT-statInterpretation
Revenue Growth-0.35214.63Higher revenue growth reduces risk score (strongest dynamic predictor)
Burnrate+0.4615.71Higher burn rate increases risk score by ~0.46 per unit
Customer Churn+0.4225.22Higher churn increases risk by ~0.42 per unit

Model Performance

0.847
R² Overall
0.944
R² Between
0.833
R² Within
3.89
MAE
22.93
MSE

Try it yourself: Select "Venture Capital" in the Free Playground.