LauNaMu's RetroPGF3 voting methodology

December 15th, 2023

The review process of RetroPGF3 was a titanic task, not only because of the sheer number of projects but by the format and tooling provided at the onset of this process.

My process entailed a couple of steps, collaborating with different people in the ecosystem, from fellow Badgeholders (Badgies) to contributors of the Optimism Collective, and using diverse information sources through a custom-made GPT agent I trained: GPT-Sunny.

TLDR;

LauNaMu’s steps to evaluation design:

Recategorize and subcategorize all projects
Select evaluation criteria per category
Bounce off and determine level definitions with GPT-Sunny
Determine weights for each criterion with support from GPT-Sunny, fine-tune based on personal criteria
Bucket creation based on points received from criteria
Reward specific behavior - yes/no fields
Setup rails: 2 fundamental constraints

LauNaMu’s steps to evaluation:

Decide if the project has generated impact for the Collective: Yes/No
Decide if I will vote on this project: Yes/No
Evaluate based on Category-criteria: Low, Mid, High
Evaluate for specific attributes: Yes/No
Add comments per project

Getting started

To make reviewing the volume of projects easier and reduce the cognitive load of shifting from one type of contribution to another, I used the re-categorization of the projects generated for the List-making workshops available here. Big shoutout to @Billy191 from RetroPGFHub and to the Retrolist.app team for including this categorization into their websites. These tools made the exploration of projects much easier.

I used primarily the Retrolist.app site for project discovery and evaluation. Having non-overlapping sub-categories enabled me to review smaller chunks of similar contributions. There was room for improvement in the sub-categories but this split made it incredibly easier for me to get closer to comparing apples to apples.

Projects within a subcategory in Retrolist.app

The Categories

I created one evaluation framework per category based on the Impact Evaluation Framework I generated and that was available for all Badgeholders to use in the Badgeholder Hub. I used the “relevant terms'' as evaluation criteria. Each of these criteria had three possible levels: Low, Mid, and High.

For example, for Collective Governance I had:

Impact: Low, Mid, High
Accessibility: Low, Mid, High
Participation: Low, Mid, High
Understandability: Low, Mid, High
Accountability: Low, Mid, High
Capture Resistance: Low Mid, High

GPT-Sunny

To enhance my workflow and effectively integrate the Impact Evaluation Framework and the Metrics Garden, I trained a specialized GPT for Impact Evaluation in the Optimism Ecosystem: GPT-Sunny. We worked together to:

Create the segmentation of what would be Low, Mid, or High within each of the evaluation criteria per subcategory.

Example of segmentation in OP Stack:

0 for no identifiable impact; Low for minor enhancements or indirect benefits; Mid for notable improvements or integrations; High for significant, direct contributions to the OP Stack.
Create the weight each criterion would have in my evaluation based on my chosen relative importance and information from the Impact Evaluation Framework around Ecosystem context and the impact of each item relative to the definition of Impact in the category.

Example of weighting in Collective Governance:
- Impact: 27.7% (impact was included in all categories)
- Accessibility: 25%
- Participation: 20.45%
- Understandability: 13.64%
- Accountability: 6.82%
- Capture Resistance: 6.82%

Buckets

After determining the weights, I simplified the process using a point-based system aligned with the criterion. I used the points to place evaluated projects into buckets. Projects falling within the same bucket, defined by a specific range of points, would be allocated an equal amount of tokens.

I calculated the number of points each level within each criterion would be awarded so that if a project were to score High in all criteria it would receive 100 points.

Example for Collective Governance:

A project with a "Low" Impact rating receives a score of 4.17 points

A project with a "Medium" Impact rating receives a score of 12.50 points (4.17 for Low + 8.33 for Medium).

A project with a "High" Impact rating receives a score of 25 points (4.17 for Low + 8.33 for Medium + 12.50 for High)

Incentivize specific behaviours

Across all categories, there were specific behaviors/attributes that I rewarded because I believe they contribute to a more resilient Ecosystem. These were binary criteria (yes/no).

Each criterion was assigned a fixed amount of OP tokens that the project would be allocated on top of their point-based allocation. This means projects that do not fulfill the criteria wouldn’t be affected in their baseline allocation and those that do, would be better off.

Does the contribution promote a Superchain future/promotion of multiple OP Chains?
Is this an innovative use case built on an OP Chain?
Are they OG Optimism builders? (Received RetroPGF previously or have been developing in the Optimism Ecosystem for more than one year?)
Did they use Optimism-specific impact metrics in their application?
Can I personally attest to their impact on the Optimism Ecosystem?
Do they have a geographical regional focus?
Are they women-led projects?
Is this a contribution by an individual?

Token allocation

I assigned a pre-set amount of tokens per Category based on what I considered contributed to the sustainability of RetroPGF and knowing there was an inclination towards technical contributions from other Badgeholders based on their backgrounds:

OP Stack: 6 million OP

Collective Governance: 4 million OP

Developer Ecosystem: 8 million OP

User Experience + Adoption: 7 million OP

These amounts don’t add up to 30 million as I was allocating additional funds based on the specific behaviors mentioned above.

These amounts were then split between the subcategories, and after the evaluation of projects, token allocation was assigned based on the number of projects per bucket in each subcategory. Projects with a higher score, and therefore in a higher level bucket, were allocated higher OP token amounts.

Wrapping up!

I revisited the final amounts and made tweaks and corrections where I felt were needed and then subtracted previous funding received by the projects where I considered necessary.

For example: Some projects have received funding from the Governance Fund, but the funding is locked for a year, so I only discounted a percentage of the allocation to consider that these funds cannot be accessed by projects. Reported revenue was discounted in full.

Allocation for projects that were included in Lists, by Badgeholders whose expertise I trusted and that included a clear methodology was also reviewed against my allocations.

This process was incredibly interesting and provided an incredible learning ground for hands-on Impact Evaluation. Having specific definitions for impact and knowing what the perceived main barriers in each category as informed by the Impact Evaluation Framework contributed vastly to reducing bias. It opened up opportunities for me to discover numerous new projects and contributions within the Optimism Ecosystem, which I might not have encountered otherwise.

I'm excited to apply the insights gained in the next iteration, employing a revised methodology that more systematically incorporates new data points. This approach will be informed by improved Impact Metrics provided by contributors.

There are more learnings and thoughts on improvements to come from my end.

🔴✨I’m Optimistic that we are headed to a more data-driven and systematic reporting and evaluation of the Impact generated by contributors 🔴✨

Subscribe to LauNaMu | Web3 Impact Evaluator

Receive the latest updates directly to your inbox.

Mint this entry as an NFT to add it to your collection.

Verification

This entry has been permanently stored onchain and signed by its creator.

Arweave Transaction

E9ileAj8pUEfFZi…2Ts_dGXI9Mzkaoc

Author Address

0x5d36a202687fD6B…34bF0B4827Cc1E2

Content Digest

AAwMfPGnOAcXlPY…-mRq4pf3HzBzLnE

More from LauNaMu | Web3 Impact Evaluator

View All

Skeleton