Summary · 20260512T184755Z

Overall

This benchmark evaluated the `so_extraction` agent with 10 different models in a zero-shot configuration. While all combinations ran successfully without errors, there was a wide disparity in the quality of the extracted data. The overall field match rate was 81%, with an average of 4.5 mismatched fields per run, indicating room for improvement. The high standard deviation of mismatch (5.13) highlights inconsistent performance across the board. The `opus` and `sonnet` models demonstrated significantly better extraction quality than the `gemini` and `openai` models on this task.

What the numbers mean

Success Rate: The percentage of runs where the agent and model completed without an error. In this brief, all combos achieved a 100% success rate. Avg runtime: The average time in seconds the agent took to complete a run. Avg mismatch/expected run: For runs with a 'golden' expected output, this is the average number of fields whose extracted values did not match. Lower is better. Field match rate: The percentage of total fields that correctly matched the expected output across all runs for that combination. Higher is better. Mismatch stdev: The standard deviation of the mismatch count across all individual runs. A high value suggests inconsistent quality, with some runs having few errors and others having many.

Leaderboard highlights

The `so_extraction` agent's performance varied significantly depending on the model used, with all runs being zero-shot. The `sonnet` and `opus` model families were the clear leaders in accuracy. `sonnet-4-6` achieved the highest field match rate (87.6%), closely followed by `opus-4-6` (87.4%), which had the lowest average mismatch count (2.92). In contrast, the Gemini models were the weakest performers; `gemini:gemini-2.5-flash` had the lowest field match rate (75.3%) and the highest average mismatch count (6.08). In terms of speed, `openai:4.1` was the fastest by a wide margin at 1.8 seconds, while `openai:5-mini` was the slowest at nearly 21 seconds.

What looks healthy

What needs attention

Agent Harness Run

Run ID: 20260512T184755Z · Generated UTC: 2026-05-12T18:50:55.602978+00:00

Configuration

{
  "agent": "so_extraction",
  "pipeline": null,
  "models": [
    "sonnet-4-6",
    "sonnet-4-5",
    "opus-4-5",
    "opus-4-6",
    "openai:4.1",
    "openai:5.2",
    "openai:5-mini",
    "openai:5.4",
    "gemini:gemini-2.5-pro",
    "gemini:gemini-2.5-flash"
  ],
  "datasets": [
    "downloaded",
    "acme_foods",
    "nova_exports"
  ],
  "chat": null,
  "chats_glob": null,
  "bulk": false,
  "runs_per_chat": 1,
  "max_workers": 25,
  "few_shot_explicit": [],
  "few_shot_sweep": [],
  "few_shot_pool_argv": [
    "/Users/tripathipranav/Documents/code/harness_agents/raw_data/chats/multiple_product_multiple_shipment_medium.json",
    "/Users/tripathipranav/Documents/code/harness_agents/raw_data/chats/single_product_multiple_shipment_medium.json",
    "/Users/tripathipranav/Documents/code/harness_agents/raw_data/chats/single_product_single_shipment_medium.json",
    "/Users/tripathipranav/Documents/code/harness_agents/raw_data/chats/updates/update_change_quantity.json",
    "/Users/tripathipranav/Documents/code/harness_agents/raw_data/downloaded_chats/03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.json"
  ],
  "few_shot_seed": 42,
  "db_few_shot_limit": 0,
  "skip_without_expected": true,
  "results_dir": "/Users/tripathipranav/Documents/code/harness_agents/results/20260512T184755Z",
  "config_file": "configs/agents.json",
  "few_shot_pool_size": 5,
  "few_shot_default_pool_size": 68,
  "few_shot_pool_override": [
    "raw_data/chats/multiple_product_multiple_shipment_medium.json",
    "raw_data/chats/single_product_multiple_shipment_medium.json",
    "raw_data/chats/single_product_single_shipment_medium.json",
    "raw_data/chats/updates/update_change_quantity.json",
    "raw_data/downloaded_chats/03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.json"
  ],
  "few_shot_variants": [
    {
      "label": "none",
      "count": 0,
      "paths": []
    }
  ],
  "allow_self_fewshot": false
}

Per-agent Totals

AgentRunsSuccessAvg attemptsAvg elapsed (s)Avg mismatch/expectedField match
so_extraction5101.00001.00008.05454.51570.8101

Model + Strategy Leaderboard

AgentModelFS countRunsSuccessAvg attemptsAvg elapsed (s)Avg mismatch/expectedField match
so_extractiongemini:gemini-2.5-flash0511.00001.000013.27746.07840.7526
so_extractiongemini:gemini-2.5-pro0511.00001.000019.64455.92160.7590
so_extractionopenai:4.10511.00001.00001.80134.15690.8308
so_extractionopenai:5-mini0511.00001.000020.94935.62750.7667
so_extractionopenai:5.20511.00001.00003.24355.52940.7749
so_extractionopenai:5.40511.00001.00002.42325.35290.7798
so_extractionopus-4-50511.00001.00005.00712.94120.8708
so_extractionopus-4-60511.00001.00005.08572.92160.8742
so_extractionsonnet-4-50511.00001.00004.62973.58820.8250
so_extractionsonnet-4-60511.00001.00004.48323.03920.8763

Few-shot Count Rollup

AgentFS countRunsSuccessAvg mismatch/expectedField match
so_extraction05101.00004.51570.8101

Per-dataset Breakdown

AgentDatasetRunsSuccessAvg elapsed (s)Avg mismatch/expectedField match
so_extractionacme_foods2101.00008.17932.45710.9014
so_extractiondownloaded901.00009.096514.44440.3783
so_extractionnova_exports2101.00007.48312.31900.8986

Per-chat Breakdown

AgentChatModelFS countRunsSuccessAvg elapsed (s)Avg mismatch/expected
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsongemini:gemini-2.5-flash011.000019.609115.0000
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsongemini:gemini-2.5-pro011.000020.250215.0000
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:4.1011.00003.768317.0000
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5-mini011.000031.641715.0000
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5.2011.00005.273115.0000
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5.4011.00004.633216.0000
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopus-4-5011.00006.785515.0000
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopus-4-6011.00006.611715.0000
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonsonnet-4-5011.00007.382215.0000
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonsonnet-4-6011.00006.096415.0000
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsongemini:gemini-2.5-flash011.000013.484615.0000
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsongemini:gemini-2.5-pro011.000014.009715.0000
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:4.1011.00007.514714.0000
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5-mini011.000018.188413.0000
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5.2011.00007.724514.0000
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5.4011.00005.057314.0000
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopus-4-5011.00006.581714.0000
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopus-4-6011.00006.939514.0000
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonsonnet-4-5011.00006.240814.0000
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonsonnet-4-6011.00006.521515.0000
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsongemini:gemini-2.5-flash011.000020.110920.0000
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsongemini:gemini-2.5-pro011.000032.282720.0000
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:4.1011.00004.466617.0000
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5-mini011.000023.426618.0000
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5.2011.00004.092016.0000
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5.4011.00005.555917.0000
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopus-4-5011.00008.939416.0000
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopus-4-6011.00008.725116.0000
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonsonnet-4-5011.000010.880117.0000
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonsonnet-4-6011.00007.912916.0000
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsongemini:gemini-2.5-flash011.000014.614117.0000
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsongemini:gemini-2.5-pro011.000011.788016.0000
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopenai:4.1011.00001.763515.0000
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopenai:5-mini011.000018.036917.0000
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopenai:5.2011.00003.585416.0000
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopenai:5.4011.00003.982816.0000
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopus-4-5011.00004.923015.0000
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopus-4-6011.00005.025415.0000
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonsonnet-4-5011.00004.973316.0000
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonsonnet-4-6011.00004.355316.0000
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsongemini:gemini-2.5-flash011.000010.331615.0000
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsongemini:gemini-2.5-pro011.000017.963015.0000
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:4.1011.00001.815013.0000
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5-mini011.000013.447313.0000
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5.2011.00002.378214.0000
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5.4011.00002.511514.0000
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopus-4-5011.00005.496014.0000
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopus-4-6011.00005.026113.0000
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonsonnet-4-5011.00005.059713.0000
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonsonnet-4-6011.00004.335114.0000
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsongemini:gemini-2.5-flash011.00008.389714.0000
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsongemini:gemini-2.5-pro011.000017.829214.0000
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:4.1011.00001.634816.0000
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5-mini011.000017.902315.0000
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5.2011.00002.420914.0000
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5.4011.00002.862915.0000
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopus-4-5011.00005.625714.0000
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopus-4-6011.00005.362414.0000
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonsonnet-4-5011.00004.536514.0000
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonsonnet-4-6011.00004.713113.0000
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsongemini:gemini-2.5-flash011.000010.996417.0000
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsongemini:gemini-2.5-pro011.000016.912217.0000
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:4.1011.00001.118017.0000
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5-mini011.000011.088717.0000
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5.2011.00002.114017.0000
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5.4011.00002.789717.0000
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopus-4-5011.00003.08601.0000
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopus-4-6011.00002.93061.0000
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonsonnet-4-5011.00002.86441.0000
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonsonnet-4-6011.00003.883417.0000
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsongemini:gemini-2.5-flash011.000027.189116.0000
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsongemini:gemini-2.5-pro011.000022.998513.0000
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:4.1011.00001.781514.0000
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5-mini011.000026.468612.0000
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5.2011.00002.800514.0000
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5.4011.00003.519115.0000
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopus-4-5011.00004.921714.0000
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopus-4-6011.00005.727012.0000
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonsonnet-4-5011.00005.373319.0000
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonsonnet-4-6011.00004.454318.0000
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsongemini:gemini-2.5-flash011.000032.319213.0000
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsongemini:gemini-2.5-pro011.000023.548811.0000
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:4.1011.00001.709610.0000
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5-mini011.000027.68299.0000
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5.2011.00002.486413.0000
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5.4011.00002.968314.0000
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopus-4-5011.00005.524110.0000
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopus-4-6011.00006.151810.0000
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonsonnet-4-5011.00005.508817.0000
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonsonnet-4-6011.00004.371416.0000
so_extractionfs_acme_simple.jsongemini:gemini-2.5-flash011.00007.92665.0000
so_extractionfs_acme_simple.jsongemini:gemini-2.5-pro011.000016.01021.0000
so_extractionfs_acme_simple.jsonopenai:4.1011.00001.90052.0000
so_extractionfs_acme_simple.jsonopenai:5-mini011.000022.09838.0000
so_extractionfs_acme_simple.jsonopenai:5.2011.00002.56272.0000
so_extractionfs_acme_simple.jsonopenai:5.4011.00002.52843.0000
so_extractionfs_acme_simple.jsonopus-4-5011.00005.30272.0000
so_extractionfs_acme_simple.jsonopus-4-6011.00005.07202.0000
so_extractionfs_acme_simple.jsonsonnet-4-5011.00005.04571.0000
so_extractionfs_acme_simple.jsonsonnet-4-6011.00004.32570.0000
so_extractionfs_nova_simple.jsongemini:gemini-2.5-flash011.00002.34951.0000
so_extractionfs_nova_simple.jsongemini:gemini-2.5-pro011.00009.41511.0000
so_extractionfs_nova_simple.jsonopenai:4.1011.00001.17211.0000
so_extractionfs_nova_simple.jsonopenai:5-mini011.00006.34581.0000
so_extractionfs_nova_simple.jsonopenai:5.2011.00001.83421.0000
so_extractionfs_nova_simple.jsonopenai:5.4011.00001.63881.0000
so_extractionfs_nova_simple.jsonopus-4-5011.00003.20211.0000
so_extractionfs_nova_simple.jsonopus-4-6011.00002.94811.0000
so_extractionfs_nova_simple.jsonsonnet-4-5011.00002.21021.0000
so_extractionfs_nova_simple.jsonsonnet-4-6011.00002.25711.0000
so_extractiongenerated_acme_foods_001.jsongemini:gemini-2.5-flash011.00007.37996.0000
so_extractiongenerated_acme_foods_001.jsongemini:gemini-2.5-pro011.000017.41385.0000
so_extractiongenerated_acme_foods_001.jsonopenai:4.1011.00001.67542.0000
so_extractiongenerated_acme_foods_001.jsonopenai:5-mini011.000022.80335.0000
so_extractiongenerated_acme_foods_001.jsonopenai:5.2011.00001.98284.0000
so_extractiongenerated_acme_foods_001.jsonopenai:5.4011.00002.198210.0000
so_extractiongenerated_acme_foods_001.jsonopus-4-5011.00004.63992.0000
so_extractiongenerated_acme_foods_001.jsonopus-4-6011.00004.94752.0000
so_extractiongenerated_acme_foods_001.jsonsonnet-4-5011.00004.15433.0000
so_extractiongenerated_acme_foods_001.jsonsonnet-4-6011.00004.48290.0000
so_extractiongenerated_acme_foods_002.jsongemini:gemini-2.5-flash011.000016.61146.0000
so_extractiongenerated_acme_foods_002.jsongemini:gemini-2.5-pro011.000021.61724.0000
so_extractiongenerated_acme_foods_002.jsonopenai:4.1011.00001.66091.0000
so_extractiongenerated_acme_foods_002.jsonopenai:5-mini011.000023.25643.0000
so_extractiongenerated_acme_foods_002.jsonopenai:5.2011.00002.86994.0000
so_extractiongenerated_acme_foods_002.jsonopenai:5.4011.00003.01592.0000
so_extractiongenerated_acme_foods_002.jsonopus-4-5011.00006.90930.0000
so_extractiongenerated_acme_foods_002.jsonopus-4-6011.00006.85960.0000
so_extractiongenerated_acme_foods_002.jsonsonnet-4-5011.00005.83891.0000
so_extractiongenerated_acme_foods_002.jsonsonnet-4-6011.00005.09530.0000
so_extractiongenerated_acme_foods_003.jsongemini:gemini-2.5-flash011.000014.95827.0000
so_extractiongenerated_acme_foods_003.jsongemini:gemini-2.5-pro011.000030.57564.0000
so_extractiongenerated_acme_foods_003.jsonopenai:4.1011.00001.14133.0000
so_extractiongenerated_acme_foods_003.jsonopenai:5-mini011.000020.31552.0000
so_extractiongenerated_acme_foods_003.jsonopenai:5.2011.00002.40393.0000
so_extractiongenerated_acme_foods_003.jsonopenai:5.4011.00001.90783.0000
so_extractiongenerated_acme_foods_003.jsonopus-4-5011.00004.83881.0000
so_extractiongenerated_acme_foods_003.jsonopus-4-6011.00004.71531.0000
so_extractiongenerated_acme_foods_003.jsonsonnet-4-5011.00004.49471.0000
so_extractiongenerated_acme_foods_003.jsonsonnet-4-6011.00004.33610.0000
so_extractiongenerated_acme_foods_004.jsongemini:gemini-2.5-flash011.000012.26845.0000
so_extractiongenerated_acme_foods_004.jsongemini:gemini-2.5-pro011.000026.39294.0000
so_extractiongenerated_acme_foods_004.jsonopenai:4.1011.00001.48631.0000
so_extractiongenerated_acme_foods_004.jsonopenai:5-mini011.000018.58434.0000
so_extractiongenerated_acme_foods_004.jsonopenai:5.2011.00001.93596.0000
so_extractiongenerated_acme_foods_004.jsonopenai:5.4011.00002.33605.0000
so_extractiongenerated_acme_foods_004.jsonopus-4-5011.00004.78821.0000
so_extractiongenerated_acme_foods_004.jsonopus-4-6011.00005.33752.0000
so_extractiongenerated_acme_foods_004.jsonsonnet-4-5011.00004.53084.0000
so_extractiongenerated_acme_foods_004.jsonsonnet-4-6011.00004.15690.0000
so_extractiongenerated_acme_foods_005.jsongemini:gemini-2.5-flash011.000013.10003.0000
so_extractiongenerated_acme_foods_005.jsongemini:gemini-2.5-pro011.000018.92182.0000
so_extractiongenerated_acme_foods_005.jsonopenai:4.1011.00001.81840.0000
so_extractiongenerated_acme_foods_005.jsonopenai:5-mini011.000017.00292.0000
so_extractiongenerated_acme_foods_005.jsonopenai:5.2011.00002.44234.0000
so_extractiongenerated_acme_foods_005.jsonopenai:5.4011.00001.90702.0000
so_extractiongenerated_acme_foods_005.jsonopus-4-5011.00004.47220.0000
so_extractiongenerated_acme_foods_005.jsonopus-4-6011.00004.66450.0000
so_extractiongenerated_acme_foods_005.jsonsonnet-4-5011.00005.05870.0000
so_extractiongenerated_acme_foods_005.jsonsonnet-4-6011.00004.84540.0000
so_extractiongenerated_acme_foods_006.jsongemini:gemini-2.5-flash011.000011.44136.0000
so_extractiongenerated_acme_foods_006.jsongemini:gemini-2.5-pro011.000024.13925.0000
so_extractiongenerated_acme_foods_006.jsonopenai:4.1011.00001.98350.0000
so_extractiongenerated_acme_foods_006.jsonopenai:5-mini011.000018.69072.0000
so_extractiongenerated_acme_foods_006.jsonopenai:5.2011.00003.20162.0000
so_extractiongenerated_acme_foods_006.jsonopenai:5.4011.00002.95712.0000
so_extractiongenerated_acme_foods_006.jsonopus-4-5011.00004.48930.0000
so_extractiongenerated_acme_foods_006.jsonopus-4-6011.00004.28690.0000
so_extractiongenerated_acme_foods_006.jsonsonnet-4-5011.00004.43020.0000
so_extractiongenerated_acme_foods_006.jsonsonnet-4-6011.00004.74800.0000
so_extractiongenerated_acme_foods_007.jsongemini:gemini-2.5-flash011.00008.47282.0000
so_extractiongenerated_acme_foods_007.jsongemini:gemini-2.5-pro011.000019.30673.0000
so_extractiongenerated_acme_foods_007.jsonopenai:4.1011.00001.38790.0000
so_extractiongenerated_acme_foods_007.jsonopenai:5-mini011.000017.11484.0000
so_extractiongenerated_acme_foods_007.jsonopenai:5.2011.00003.00803.0000
so_extractiongenerated_acme_foods_007.jsonopenai:5.4011.00002.18612.0000
so_extractiongenerated_acme_foods_007.jsonopus-4-5011.00004.70550.0000
so_extractiongenerated_acme_foods_007.jsonopus-4-6011.00004.72340.0000
so_extractiongenerated_acme_foods_007.jsonsonnet-4-5011.00004.98690.0000
so_extractiongenerated_acme_foods_007.jsonsonnet-4-6011.00005.46600.0000
so_extractiongenerated_acme_foods_008.jsongemini:gemini-2.5-flash011.000013.09311.0000
so_extractiongenerated_acme_foods_008.jsongemini:gemini-2.5-pro011.000016.68972.0000
so_extractiongenerated_acme_foods_008.jsonopenai:4.1011.00001.23310.0000
so_extractiongenerated_acme_foods_008.jsonopenai:5-mini011.000019.33674.0000
so_extractiongenerated_acme_foods_008.jsonopenai:5.2011.00002.16752.0000
so_extractiongenerated_acme_foods_008.jsonopenai:5.4011.00002.27452.0000
so_extractiongenerated_acme_foods_008.jsonopus-4-5011.00004.75700.0000
so_extractiongenerated_acme_foods_008.jsonopus-4-6011.00004.67740.0000
so_extractiongenerated_acme_foods_008.jsonsonnet-4-5011.00004.75460.0000
so_extractiongenerated_acme_foods_008.jsonsonnet-4-6011.00004.88620.0000
so_extractiongenerated_acme_foods_009.jsongemini:gemini-2.5-flash011.000012.98224.0000
so_extractiongenerated_acme_foods_009.jsongemini:gemini-2.5-pro011.000018.19834.0000
so_extractiongenerated_acme_foods_009.jsonopenai:4.1011.00001.59700.0000
so_extractiongenerated_acme_foods_009.jsonopenai:5-mini011.000015.43962.0000
so_extractiongenerated_acme_foods_009.jsonopenai:5.2011.00002.22703.0000
so_extractiongenerated_acme_foods_009.jsonopenai:5.4011.00001.91632.0000
so_extractiongenerated_acme_foods_009.jsonopus-4-5011.00004.68760.0000
so_extractiongenerated_acme_foods_009.jsonopus-4-6011.00005.27350.0000
so_extractiongenerated_acme_foods_009.jsonsonnet-4-5011.00004.08600.0000
so_extractiongenerated_acme_foods_009.jsonsonnet-4-6011.00004.07130.0000
so_extractiongenerated_acme_foods_010.jsongemini:gemini-2.5-flash011.000018.74055.0000
so_extractiongenerated_acme_foods_010.jsongemini:gemini-2.5-pro011.000021.44341.0000
so_extractiongenerated_acme_foods_010.jsonopenai:4.1011.00001.49402.0000
so_extractiongenerated_acme_foods_010.jsonopenai:5-mini011.000014.42494.0000
so_extractiongenerated_acme_foods_010.jsonopenai:5.2011.00002.23286.0000
so_extractiongenerated_acme_foods_010.jsonopenai:5.4011.00002.65852.0000
so_extractiongenerated_acme_foods_010.jsonopus-4-5011.00005.87350.0000
so_extractiongenerated_acme_foods_010.jsonopus-4-6011.00008.40320.0000
so_extractiongenerated_acme_foods_010.jsonsonnet-4-5011.00006.22741.0000
so_extractiongenerated_acme_foods_010.jsonsonnet-4-6011.00004.75430.0000
so_extractiongenerated_nova_exports_001.jsongemini:gemini-2.5-flash011.000010.17056.0000
so_extractiongenerated_nova_exports_001.jsongemini:gemini-2.5-pro011.000015.99544.0000
so_extractiongenerated_nova_exports_001.jsonopenai:4.1011.00001.73412.0000
so_extractiongenerated_nova_exports_001.jsonopenai:5-mini011.000023.04553.0000
so_extractiongenerated_nova_exports_001.jsonopenai:5.2011.00002.21212.0000
so_extractiongenerated_nova_exports_001.jsonopenai:5.4011.00002.12482.0000
so_extractiongenerated_nova_exports_001.jsonopus-4-5011.00004.63492.0000
so_extractiongenerated_nova_exports_001.jsonopus-4-6011.00004.94752.0000
so_extractiongenerated_nova_exports_001.jsonsonnet-4-5011.00005.44873.0000
so_extractiongenerated_nova_exports_001.jsonsonnet-4-6011.00004.39032.0000
so_extractiongenerated_nova_exports_002.jsongemini:gemini-2.5-flash011.000013.34233.0000
so_extractiongenerated_nova_exports_002.jsongemini:gemini-2.5-pro011.000019.29921.0000
so_extractiongenerated_nova_exports_002.jsonopenai:4.1011.00001.42862.0000
so_extractiongenerated_nova_exports_002.jsonopenai:5-mini011.000020.27006.0000
so_extractiongenerated_nova_exports_002.jsonopenai:5.2011.00002.21974.0000
so_extractiongenerated_nova_exports_002.jsonopenai:5.4011.00002.54632.0000
so_extractiongenerated_nova_exports_002.jsonopus-4-5011.00005.90520.0000
so_extractiongenerated_nova_exports_002.jsonopus-4-6011.00005.88520.0000
so_extractiongenerated_nova_exports_002.jsonsonnet-4-5011.00005.07151.0000
so_extractiongenerated_nova_exports_002.jsonsonnet-4-6011.00005.11040.0000
so_extractiongenerated_nova_exports_003.jsongemini:gemini-2.5-flash011.000012.22607.0000
so_extractiongenerated_nova_exports_003.jsongemini:gemini-2.5-pro011.000017.40649.0000
so_extractiongenerated_nova_exports_003.jsonopenai:4.1011.00001.20301.0000
so_extractiongenerated_nova_exports_003.jsonopenai:5-mini011.000011.27593.0000
so_extractiongenerated_nova_exports_003.jsonopenai:5.2011.00002.12003.0000
so_extractiongenerated_nova_exports_003.jsonopenai:5.4011.00001.98675.0000
so_extractiongenerated_nova_exports_003.jsonopus-4-5011.00004.57891.0000
so_extractiongenerated_nova_exports_003.jsonopus-4-6011.00004.92071.0000
so_extractiongenerated_nova_exports_003.jsonsonnet-4-5011.00005.44312.0000
so_extractiongenerated_nova_exports_003.jsonsonnet-4-6011.00004.32510.0000
so_extractiongenerated_nova_exports_004.jsongemini:gemini-2.5-flash011.000012.92196.0000
so_extractiongenerated_nova_exports_004.jsongemini:gemini-2.5-pro011.000017.74395.0000
so_extractiongenerated_nova_exports_004.jsonopenai:4.1011.00001.82321.0000
so_extractiongenerated_nova_exports_004.jsonopenai:5-mini011.000018.38996.0000
so_extractiongenerated_nova_exports_004.jsonopenai:5.2011.00001.72275.0000
so_extractiongenerated_nova_exports_004.jsonopenai:5.4011.00001.83035.0000
so_extractiongenerated_nova_exports_004.jsonopus-4-5011.00004.92831.0000
so_extractiongenerated_nova_exports_004.jsonopus-4-6011.00004.88433.0000
so_extractiongenerated_nova_exports_004.jsonsonnet-4-5011.00004.82134.0000
so_extractiongenerated_nova_exports_004.jsonsonnet-4-6011.00004.07400.0000
so_extractiongenerated_nova_exports_005.jsongemini:gemini-2.5-flash011.000010.11771.0000
so_extractiongenerated_nova_exports_005.jsongemini:gemini-2.5-pro011.000015.88333.0000
so_extractiongenerated_nova_exports_005.jsonopenai:4.1011.00001.15210.0000
so_extractiongenerated_nova_exports_005.jsonopenai:5-mini011.000017.94562.0000
so_extractiongenerated_nova_exports_005.jsonopenai:5.2011.00003.69802.0000
so_extractiongenerated_nova_exports_005.jsonopenai:5.4011.00001.85472.0000
so_extractiongenerated_nova_exports_005.jsonopus-4-5011.00004.37290.0000
so_extractiongenerated_nova_exports_005.jsonopus-4-6011.00005.41550.0000
so_extractiongenerated_nova_exports_005.jsonsonnet-4-5011.00005.24171.0000
so_extractiongenerated_nova_exports_005.jsonsonnet-4-6011.00004.22920.0000
so_extractiongenerated_nova_exports_006.jsongemini:gemini-2.5-flash011.00005.08963.0000
so_extractiongenerated_nova_exports_006.jsongemini:gemini-2.5-pro011.000021.47113.0000
so_extractiongenerated_nova_exports_006.jsonopenai:4.1011.00001.40111.0000
so_extractiongenerated_nova_exports_006.jsonopenai:5-mini011.000016.73223.0000
so_extractiongenerated_nova_exports_006.jsonopenai:5.2011.00002.00742.0000
so_extractiongenerated_nova_exports_006.jsonopenai:5.4011.00002.23822.0000
so_extractiongenerated_nova_exports_006.jsonopus-4-5011.00004.62570.0000
so_extractiongenerated_nova_exports_006.jsonopus-4-6011.00005.11280.0000
so_extractiongenerated_nova_exports_006.jsonsonnet-4-5011.00004.41101.0000
so_extractiongenerated_nova_exports_006.jsonsonnet-4-6011.00004.21170.0000
so_extractiongenerated_nova_exports_007.jsongemini:gemini-2.5-flash011.000011.02781.0000
so_extractiongenerated_nova_exports_007.jsongemini:gemini-2.5-pro011.000012.70480.0000
so_extractiongenerated_nova_exports_007.jsonopenai:4.1011.00001.34881.0000
so_extractiongenerated_nova_exports_007.jsonopenai:5-mini011.000029.34633.0000
so_extractiongenerated_nova_exports_007.jsonopenai:5.2011.00001.66473.0000
so_extractiongenerated_nova_exports_007.jsonopenai:5.4011.00002.18603.0000
so_extractiongenerated_nova_exports_007.jsonopus-4-5011.00004.85050.0000
so_extractiongenerated_nova_exports_007.jsonopus-4-6011.00005.08500.0000
so_extractiongenerated_nova_exports_007.jsonsonnet-4-5011.00004.07170.0000
so_extractiongenerated_nova_exports_007.jsonsonnet-4-6011.00004.22950.0000
so_extractiongenerated_nova_exports_008.jsongemini:gemini-2.5-flash011.000019.63772.0000
so_extractiongenerated_nova_exports_008.jsongemini:gemini-2.5-pro011.000018.35254.0000
so_extractiongenerated_nova_exports_008.jsonopenai:4.1011.00001.71570.0000
so_extractiongenerated_nova_exports_008.jsonopenai:5-mini011.000026.39012.0000
so_extractiongenerated_nova_exports_008.jsonopenai:5.2011.00003.56692.0000
so_extractiongenerated_nova_exports_008.jsonopenai:5.4011.00001.83172.0000
so_extractiongenerated_nova_exports_008.jsonopus-4-5011.00004.44240.0000
so_extractiongenerated_nova_exports_008.jsonopus-4-6011.00004.88950.0000
so_extractiongenerated_nova_exports_008.jsonsonnet-4-5011.00004.47881.0000
so_extractiongenerated_nova_exports_008.jsonsonnet-4-6011.00004.12100.0000
so_extractiongenerated_nova_exports_009.jsongemini:gemini-2.5-flash011.00009.64822.0000
so_extractiongenerated_nova_exports_009.jsongemini:gemini-2.5-pro011.000019.44272.0000
so_extractiongenerated_nova_exports_009.jsonopenai:4.1011.00001.36850.0000
so_extractiongenerated_nova_exports_009.jsonopenai:5-mini011.000020.16292.0000
so_extractiongenerated_nova_exports_009.jsonopenai:5.2011.00002.03802.0000
so_extractiongenerated_nova_exports_009.jsonopenai:5.4011.00002.17982.0000
so_extractiongenerated_nova_exports_009.jsonopus-4-5011.00004.67090.0000
so_extractiongenerated_nova_exports_009.jsonopus-4-6011.00004.83250.0000
so_extractiongenerated_nova_exports_009.jsonsonnet-4-5011.00003.97431.0000
so_extractiongenerated_nova_exports_009.jsonsonnet-4-6011.00004.51800.0000
so_extractiongenerated_nova_exports_010.jsongemini:gemini-2.5-flash011.00008.36144.0000
so_extractiongenerated_nova_exports_010.jsongemini:gemini-2.5-pro011.000014.83093.0000
so_extractiongenerated_nova_exports_010.jsonopenai:4.1011.00001.80494.0000
so_extractiongenerated_nova_exports_010.jsonopenai:5-mini011.000015.76864.0000
so_extractiongenerated_nova_exports_010.jsonopenai:5.2011.00002.46364.0000
so_extractiongenerated_nova_exports_010.jsonopenai:5.4011.00002.20392.0000
so_extractiongenerated_nova_exports_010.jsonopus-4-5011.00006.29100.0000
so_extractiongenerated_nova_exports_010.jsonopus-4-6011.00005.95500.0000
so_extractiongenerated_nova_exports_010.jsonsonnet-4-5011.00005.05021.0000
so_extractiongenerated_nova_exports_010.jsonsonnet-4-6011.00004.90010.0000
so_extractionrealistic_acme_foods_001.jsongemini:gemini-2.5-flash011.00006.11467.0000
so_extractionrealistic_acme_foods_001.jsongemini:gemini-2.5-pro011.000018.47316.0000
so_extractionrealistic_acme_foods_001.jsonopenai:4.1011.00001.11933.0000
so_extractionrealistic_acme_foods_001.jsonopenai:5-mini011.000028.65863.0000
so_extractionrealistic_acme_foods_001.jsonopenai:5.2011.00003.00035.0000
so_extractionrealistic_acme_foods_001.jsonopenai:5.4011.00002.09247.0000
so_extractionrealistic_acme_foods_001.jsonopus-4-5011.00004.78822.0000
so_extractionrealistic_acme_foods_001.jsonopus-4-6011.00004.69292.0000
so_extractionrealistic_acme_foods_001.jsonsonnet-4-5011.00004.84813.0000
so_extractionrealistic_acme_foods_001.jsonsonnet-4-6011.00003.95930.0000
so_extractionrealistic_acme_foods_002.jsongemini:gemini-2.5-flash011.000010.29945.0000
so_extractionrealistic_acme_foods_002.jsongemini:gemini-2.5-pro011.000017.83893.0000
so_extractionrealistic_acme_foods_002.jsonopenai:4.1011.00001.27071.0000
so_extractionrealistic_acme_foods_002.jsonopenai:5-mini011.000026.73884.0000
so_extractionrealistic_acme_foods_002.jsonopenai:5.2011.00007.97184.0000
so_extractionrealistic_acme_foods_002.jsonopenai:5.4011.00002.01504.0000
so_extractionrealistic_acme_foods_002.jsonopus-4-5011.00004.95550.0000
so_extractionrealistic_acme_foods_002.jsonopus-4-6011.00005.26310.0000
so_extractionrealistic_acme_foods_002.jsonsonnet-4-5011.00004.55170.0000
so_extractionrealistic_acme_foods_002.jsonsonnet-4-6011.00004.31460.0000
so_extractionrealistic_acme_foods_003.jsongemini:gemini-2.5-flash011.000012.99143.0000
so_extractionrealistic_acme_foods_003.jsongemini:gemini-2.5-pro011.000023.33193.0000
so_extractionrealistic_acme_foods_003.jsonopenai:4.1011.00001.64834.0000
so_extractionrealistic_acme_foods_003.jsonopenai:5-mini011.000015.48383.0000
so_extractionrealistic_acme_foods_003.jsonopenai:5.2011.000011.01263.0000
so_extractionrealistic_acme_foods_003.jsonopenai:5.4011.00002.06063.0000
so_extractionrealistic_acme_foods_003.jsonopus-4-5011.00005.00830.0000
so_extractionrealistic_acme_foods_003.jsonopus-4-6011.00004.47631.0000
so_extractionrealistic_acme_foods_003.jsonsonnet-4-5011.00004.16271.0000
so_extractionrealistic_acme_foods_003.jsonsonnet-4-6011.00004.03300.0000
so_extractionrealistic_acme_foods_004.jsongemini:gemini-2.5-flash011.000010.16842.0000
so_extractionrealistic_acme_foods_004.jsongemini:gemini-2.5-pro011.000020.48466.0000
so_extractionrealistic_acme_foods_004.jsonopenai:4.1011.00001.29014.0000
so_extractionrealistic_acme_foods_004.jsonopenai:5-mini011.000023.39486.0000
so_extractionrealistic_acme_foods_004.jsonopenai:5.2011.00008.78245.0000
so_extractionrealistic_acme_foods_004.jsonopenai:5.4011.00002.04443.0000
so_extractionrealistic_acme_foods_004.jsonopus-4-5011.00004.56480.0000
so_extractionrealistic_acme_foods_004.jsonopus-4-6011.00004.27120.0000
so_extractionrealistic_acme_foods_004.jsonsonnet-4-5011.00002.63921.0000
so_extractionrealistic_acme_foods_004.jsonsonnet-4-6011.00004.19500.0000
so_extractionrealistic_acme_foods_005.jsongemini:gemini-2.5-flash011.000020.23112.0000
so_extractionrealistic_acme_foods_005.jsongemini:gemini-2.5-pro011.000027.45636.0000
so_extractionrealistic_acme_foods_005.jsonopenai:4.1011.00001.23966.0000
so_extractionrealistic_acme_foods_005.jsonopenai:5-mini011.000025.69743.0000
so_extractionrealistic_acme_foods_005.jsonopenai:5.2011.000012.63236.0000
so_extractionrealistic_acme_foods_005.jsonopenai:5.4011.00001.98054.0000
so_extractionrealistic_acme_foods_005.jsonopus-4-5011.00005.27412.0000
so_extractionrealistic_acme_foods_005.jsonopus-4-6011.00005.03903.0000
so_extractionrealistic_acme_foods_005.jsonsonnet-4-5011.00002.76381.0000
so_extractionrealistic_acme_foods_005.jsonsonnet-4-6011.00004.38751.0000
so_extractionrealistic_acme_foods_006.jsongemini:gemini-2.5-flash011.000012.78442.0000
so_extractionrealistic_acme_foods_006.jsongemini:gemini-2.5-pro011.000023.90750.0000
so_extractionrealistic_acme_foods_006.jsonopenai:4.1011.00001.21620.0000
so_extractionrealistic_acme_foods_006.jsonopenai:5-mini011.00001.96103.0000
so_extractionrealistic_acme_foods_006.jsonopenai:5.2011.00002.08383.0000
so_extractionrealistic_acme_foods_006.jsonopenai:5.4011.00001.70702.0000
so_extractionrealistic_acme_foods_006.jsonopus-4-5011.00005.14560.0000
so_extractionrealistic_acme_foods_006.jsonopus-4-6011.00004.99300.0000
so_extractionrealistic_acme_foods_006.jsonsonnet-4-5011.00005.88720.0000
so_extractionrealistic_acme_foods_006.jsonsonnet-4-6011.00004.13030.0000
so_extractionrealistic_acme_foods_007.jsongemini:gemini-2.5-flash011.000017.50738.0000
so_extractionrealistic_acme_foods_007.jsongemini:gemini-2.5-pro011.000021.94236.0000
so_extractionrealistic_acme_foods_007.jsonopenai:4.1011.00001.35495.0000
so_extractionrealistic_acme_foods_007.jsonopenai:5-mini011.000028.01846.0000
so_extractionrealistic_acme_foods_007.jsonopenai:5.2011.00003.15876.0000
so_extractionrealistic_acme_foods_007.jsonopenai:5.4011.00001.91977.0000
so_extractionrealistic_acme_foods_007.jsonopus-4-5011.00005.23923.0000
so_extractionrealistic_acme_foods_007.jsonopus-4-6011.00005.68593.0000
so_extractionrealistic_acme_foods_007.jsonsonnet-4-5011.00003.98713.0000
so_extractionrealistic_acme_foods_007.jsonsonnet-4-6011.00004.32390.0000
so_extractionrealistic_acme_foods_008.jsongemini:gemini-2.5-flash011.000020.42025.0000
so_extractionrealistic_acme_foods_008.jsongemini:gemini-2.5-pro011.000019.45503.0000
so_extractionrealistic_acme_foods_008.jsonopenai:4.1011.00001.39684.0000
so_extractionrealistic_acme_foods_008.jsonopenai:5-mini011.000023.63055.0000
so_extractionrealistic_acme_foods_008.jsonopenai:5.2011.00004.60576.0000
so_extractionrealistic_acme_foods_008.jsonopenai:5.4011.00002.02185.0000
so_extractionrealistic_acme_foods_008.jsonopus-4-5011.00005.45633.0000
so_extractionrealistic_acme_foods_008.jsonopus-4-6011.00004.66963.0000
so_extractionrealistic_acme_foods_008.jsonsonnet-4-5011.00004.89653.0000
so_extractionrealistic_acme_foods_008.jsonsonnet-4-6011.00004.25150.0000
so_extractionrealistic_acme_foods_009.jsongemini:gemini-2.5-flash011.000014.91206.0000
so_extractionrealistic_acme_foods_009.jsongemini:gemini-2.5-pro011.000024.93357.0000
so_extractionrealistic_acme_foods_009.jsonopenai:4.1011.00001.54704.0000
so_extractionrealistic_acme_foods_009.jsonopenai:5-mini011.000027.85165.0000
so_extractionrealistic_acme_foods_009.jsonopenai:5.2011.00004.33174.0000
so_extractionrealistic_acme_foods_009.jsonopenai:5.4011.00002.05113.0000
so_extractionrealistic_acme_foods_009.jsonopus-4-5011.00005.57101.0000
so_extractionrealistic_acme_foods_009.jsonopus-4-6011.00004.40120.0000
so_extractionrealistic_acme_foods_009.jsonsonnet-4-5011.00002.50071.0000
so_extractionrealistic_acme_foods_009.jsonsonnet-4-6011.00003.94930.0000
so_extractionrealistic_acme_foods_010.jsongemini:gemini-2.5-flash011.00009.11931.0000
so_extractionrealistic_acme_foods_010.jsongemini:gemini-2.5-pro011.000026.06552.0000
so_extractionrealistic_acme_foods_010.jsonopenai:4.1011.00001.46573.0000
so_extractionrealistic_acme_foods_010.jsonopenai:5-mini011.000016.90285.0000
so_extractionrealistic_acme_foods_010.jsonopenai:5.2011.00002.46743.0000
so_extractionrealistic_acme_foods_010.jsonopenai:5.4011.00001.99872.0000
so_extractionrealistic_acme_foods_010.jsonopus-4-5011.00004.47200.0000
so_extractionrealistic_acme_foods_010.jsonopus-4-6011.00004.88860.0000
so_extractionrealistic_acme_foods_010.jsonsonnet-4-5011.00004.15700.0000
so_extractionrealistic_acme_foods_010.jsonsonnet-4-6011.00004.34300.0000
so_extractionrealistic_nova_exports_001.jsongemini:gemini-2.5-flash011.000016.15024.0000
so_extractionrealistic_nova_exports_001.jsongemini:gemini-2.5-pro011.000018.29725.0000
so_extractionrealistic_nova_exports_001.jsonopenai:4.1011.00005.22963.0000
so_extractionrealistic_nova_exports_001.jsonopenai:5-mini011.000014.38265.0000
so_extractionrealistic_nova_exports_001.jsonopenai:5.2011.00001.80666.0000
so_extractionrealistic_nova_exports_001.jsonopenai:5.4011.00002.04895.0000
so_extractionrealistic_nova_exports_001.jsonopus-4-5011.00005.11074.0000
so_extractionrealistic_nova_exports_001.jsonopus-4-6011.00004.85494.0000
so_extractionrealistic_nova_exports_001.jsonsonnet-4-5011.00004.99284.0000
so_extractionrealistic_nova_exports_001.jsonsonnet-4-6011.00004.60041.0000
so_extractionrealistic_nova_exports_002.jsongemini:gemini-2.5-flash011.000013.46305.0000
so_extractionrealistic_nova_exports_002.jsongemini:gemini-2.5-pro011.000026.84976.0000
so_extractionrealistic_nova_exports_002.jsonopenai:4.1011.00001.15121.0000
so_extractionrealistic_nova_exports_002.jsonopenai:5-mini011.000029.64595.0000
so_extractionrealistic_nova_exports_002.jsonopenai:5.2011.00001.85443.0000
so_extractionrealistic_nova_exports_002.jsonopenai:5.4011.00001.54243.0000
so_extractionrealistic_nova_exports_002.jsonopus-4-5011.00005.14871.0000
so_extractionrealistic_nova_exports_002.jsonopus-4-6011.00002.95191.0000
so_extractionrealistic_nova_exports_002.jsonsonnet-4-5011.00003.42181.0000
so_extractionrealistic_nova_exports_002.jsonsonnet-4-6011.00004.12420.0000
so_extractionrealistic_nova_exports_003.jsongemini:gemini-2.5-flash011.000015.57761.0000
so_extractionrealistic_nova_exports_003.jsongemini:gemini-2.5-pro011.000022.79923.0000
so_extractionrealistic_nova_exports_003.jsonopenai:4.1011.00001.56762.0000
so_extractionrealistic_nova_exports_003.jsonopenai:5-mini011.000041.80104.0000
so_extractionrealistic_nova_exports_003.jsonopenai:5.2011.00002.06074.0000
so_extractionrealistic_nova_exports_003.jsonopenai:5.4011.00002.55563.0000
so_extractionrealistic_nova_exports_003.jsonopus-4-5011.00004.69443.0000
so_extractionrealistic_nova_exports_003.jsonopus-4-6011.00004.43123.0000
so_extractionrealistic_nova_exports_003.jsonsonnet-4-5011.00004.13074.0000
so_extractionrealistic_nova_exports_003.jsonsonnet-4-6011.00003.89311.0000
so_extractionrealistic_nova_exports_004.jsongemini:gemini-2.5-flash011.00008.61695.0000
so_extractionrealistic_nova_exports_004.jsongemini:gemini-2.5-pro011.000013.14744.0000
so_extractionrealistic_nova_exports_004.jsonopenai:4.1011.00001.69673.0000
so_extractionrealistic_nova_exports_004.jsonopenai:5-mini011.000030.65934.0000
so_extractionrealistic_nova_exports_004.jsonopenai:5.2011.00002.20763.0000
so_extractionrealistic_nova_exports_004.jsonopenai:5.4011.00002.45085.0000
so_extractionrealistic_nova_exports_004.jsonopus-4-5011.00004.78150.0000
so_extractionrealistic_nova_exports_004.jsonopus-4-6011.00004.76990.0000
so_extractionrealistic_nova_exports_004.jsonsonnet-4-5011.00002.96311.0000
so_extractionrealistic_nova_exports_004.jsonsonnet-4-6011.00004.42483.0000
so_extractionrealistic_nova_exports_005.jsongemini:gemini-2.5-flash011.00009.57584.0000
so_extractionrealistic_nova_exports_005.jsongemini:gemini-2.5-pro011.000022.91537.0000
so_extractionrealistic_nova_exports_005.jsonopenai:4.1011.00001.31925.0000
so_extractionrealistic_nova_exports_005.jsonopenai:5-mini011.000017.56954.0000
so_extractionrealistic_nova_exports_005.jsonopenai:5.2011.00001.90533.0000
so_extractionrealistic_nova_exports_005.jsonopenai:5.4011.00002.47693.0000
so_extractionrealistic_nova_exports_005.jsonopus-4-5011.00003.14421.0000
so_extractionrealistic_nova_exports_005.jsonopus-4-6011.00003.22391.0000
so_extractionrealistic_nova_exports_005.jsonsonnet-4-5011.00005.32082.0000
so_extractionrealistic_nova_exports_005.jsonsonnet-4-6011.00004.19540.0000
so_extractionrealistic_nova_exports_006.jsongemini:gemini-2.5-flash011.000012.57125.0000
so_extractionrealistic_nova_exports_006.jsongemini:gemini-2.5-pro011.000020.17844.0000
so_extractionrealistic_nova_exports_006.jsonopenai:4.1011.00001.36901.0000
so_extractionrealistic_nova_exports_006.jsonopenai:5-mini011.000021.71132.0000
so_extractionrealistic_nova_exports_006.jsonopenai:5.2011.00001.86723.0000
so_extractionrealistic_nova_exports_006.jsonopenai:5.4011.00002.10393.0000
so_extractionrealistic_nova_exports_006.jsonopus-4-5011.00005.71540.0000
so_extractionrealistic_nova_exports_006.jsonopus-4-6011.00004.80420.0000
so_extractionrealistic_nova_exports_006.jsonsonnet-4-5011.00004.95210.0000
so_extractionrealistic_nova_exports_006.jsonsonnet-4-6011.00004.07193.0000
so_extractionrealistic_nova_exports_007.jsongemini:gemini-2.5-flash011.000013.53303.0000
so_extractionrealistic_nova_exports_007.jsongemini:gemini-2.5-pro011.000013.91887.0000
so_extractionrealistic_nova_exports_007.jsonopenai:4.1011.00001.32902.0000
so_extractionrealistic_nova_exports_007.jsonopenai:5-mini011.000027.42655.0000
so_extractionrealistic_nova_exports_007.jsonopenai:5.2011.00001.73713.0000
so_extractionrealistic_nova_exports_007.jsonopenai:5.4011.00001.86822.0000
so_extractionrealistic_nova_exports_007.jsonopus-4-5011.00003.00171.0000
so_extractionrealistic_nova_exports_007.jsonopus-4-6011.00004.42781.0000
so_extractionrealistic_nova_exports_007.jsonsonnet-4-5011.00002.73951.0000
so_extractionrealistic_nova_exports_007.jsonsonnet-4-6011.00004.20590.0000
so_extractionrealistic_nova_exports_008.jsongemini:gemini-2.5-flash011.000010.59305.0000
so_extractionrealistic_nova_exports_008.jsongemini:gemini-2.5-pro011.000011.27065.0000
so_extractionrealistic_nova_exports_008.jsonopenai:4.1011.00002.81962.0000
so_extractionrealistic_nova_exports_008.jsonopenai:5-mini011.000026.79616.0000
so_extractionrealistic_nova_exports_008.jsonopenai:5.2011.00002.12783.0000
so_extractionrealistic_nova_exports_008.jsonopenai:5.4011.00002.10112.0000
so_extractionrealistic_nova_exports_008.jsonopus-4-5011.00004.98783.0000
so_extractionrealistic_nova_exports_008.jsonopus-4-6011.00005.53183.0000
so_extractionrealistic_nova_exports_008.jsonsonnet-4-5011.00003.31501.0000
so_extractionrealistic_nova_exports_008.jsonsonnet-4-6011.00004.15943.0000
so_extractionrealistic_nova_exports_009.jsongemini:gemini-2.5-flash011.000017.11322.0000
so_extractionrealistic_nova_exports_009.jsongemini:gemini-2.5-pro011.000022.23297.0000
so_extractionrealistic_nova_exports_009.jsonopenai:4.1011.00001.39060.0000
so_extractionrealistic_nova_exports_009.jsonopenai:5-mini011.000013.27951.0000
so_extractionrealistic_nova_exports_009.jsonopenai:5.2011.00002.44493.0000
so_extractionrealistic_nova_exports_009.jsonopenai:5.4011.00002.14073.0000
so_extractionrealistic_nova_exports_009.jsonopus-4-5011.00004.75771.0000
so_extractionrealistic_nova_exports_009.jsonopus-4-6011.00004.86550.0000
so_extractionrealistic_nova_exports_009.jsonsonnet-4-5011.00004.02592.0000
so_extractionrealistic_nova_exports_009.jsonsonnet-4-6011.00004.36580.0000
so_extractionrealistic_nova_exports_010.jsongemini:gemini-2.5-flash011.000016.49497.0000
so_extractionrealistic_nova_exports_010.jsongemini:gemini-2.5-pro011.000015.53626.0000
so_extractionrealistic_nova_exports_010.jsonopenai:4.1011.00001.34392.0000
so_extractionrealistic_nova_exports_010.jsonopenai:5-mini011.000024.17964.0000
so_extractionrealistic_nova_exports_010.jsonopenai:5.2011.00001.90144.0000
so_extractionrealistic_nova_exports_010.jsonopenai:5.4011.00002.01813.0000
so_extractionrealistic_nova_exports_010.jsonopus-4-5011.00003.69401.0000
so_extractionrealistic_nova_exports_010.jsonopus-4-6011.00004.79030.0000
so_extractionrealistic_nova_exports_010.jsonsonnet-4-5011.00003.20961.0000
so_extractionrealistic_nova_exports_010.jsonsonnet-4-6011.00004.53460.0000

Top Mismatches (up to 100 rows)

AgentChatModelFS countMismatchesSample
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsongemini:gemini-2.5-flash020
[
  {
    "path": "data[0].items",
    "expected_len": 1,
    "actual_len": 3
  },
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": "Bergapur"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 6300.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3050.0,
    "actual": 3.05
  }
]
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsongemini:gemini-2.5-pro020
[
  {
    "path": "data",
    "expected_len": 1,
    "actual_len": 2
  },
  {
    "path": "data[0].items",
    "expected_len": 1,
    "actual_len": 2
  },
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": "Bergapur"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 6300.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  }
]
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonsonnet-4-5019
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 46000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5-mini018
[
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": "Bergapur"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 6300.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3050.0,
    "actual": 3.05
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/kg"
  }
]
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonsonnet-4-6018
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 46000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:4.1017
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3250.0,
    "actual": 3.25
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:4.1017
[
  {
    "path": "data[0].items",
    "expected_len": 1,
    "actual_len": 3
  },
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": "Bergapur"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 6300.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3050.0,
    "actual": 3.05
  }
]
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5.4017
[
  {
    "path": "data",
    "expected_len": 1,
    "actual_len": 3
  },
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": "Bergapur"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 6300.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3050.0,
    "actual": 3.05
  }
]
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonsonnet-4-5017
[
  {
    "path": "data",
    "expected_len": 1,
    "actual_len": 3
  },
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": "Bergapur"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 6300.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3050.0,
    "actual": 3.05
  }
]
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsongemini:gemini-2.5-flash017
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 9000.0,
    "actual": 9.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Jakarta",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Jakarta"
  }
]
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopenai:5-mini017
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 9000.0,
    "actual": 9.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "USD/KG",
    "actual": "USD/kg"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Jakarta",
    "actual": "CIF"
  }
]
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:4.1017
[
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": ""
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 10500.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3100.0,
    "actual": null
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": ""
  }
]
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonsonnet-4-6017
[
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": ""
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 10500.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3100.0,
    "actual": null
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": ""
  }
]
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5.2017
[
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": ""
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 10500.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3100.0,
    "actual": null
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": ""
  }
]
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5.4017
[
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": ""
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 10500.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3100.0,
    "actual": null
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": ""
  }
]
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5-mini017
[
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": ""
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 10500.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3100.0,
    "actual": null
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": ""
  }
]
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsongemini:gemini-2.5-flash017
[
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": ""
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 10500.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3100.0,
    "actual": null
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": ""
  }
]
so_extraction07__2025-12-23__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsongemini:gemini-2.5-pro017
[
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": ""
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 10500.0,
    "actual": 10.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3100.0,
    "actual": null
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": ""
  }
]
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonsonnet-4-5017
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 23000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5.4016
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3250.0,
    "actual": 3.25
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopenai:5.2016
[
  {
    "path": "data[0].items",
    "expected_len": 1,
    "actual_len": 2
  },
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": "Bergapur"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 6300.0,
    "actual": 6.3
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3050.0,
    "actual": 3.05
  }
]
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonsonnet-4-6016
[
  {
    "path": "data",
    "expected_len": 1,
    "actual_len": 2
  },
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": "Bergapur"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 6300.0,
    "actual": 6.3
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3050.0,
    "actual": 3.05
  }
]
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopus-4-6016
[
  {
    "path": "data",
    "expected_len": 1,
    "actual_len": 2
  },
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": "Bergapur"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 6300.0,
    "actual": 6.3
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3050.0,
    "actual": 3.05
  }
]
so_extraction03__2026-01-30__120363403074656566_g_us__8f477a8f-2a60-4e0a-bf0e-8cc3cdf1dc9f.jsonopus-4-5016
[
  {
    "path": "data",
    "expected_len": 1,
    "actual_len": 2
  },
  {
    "path": "data[0].items[0].description",
    "expected": "BergaPur",
    "actual": "Bergapur"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 6300.0,
    "actual": 6.3
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3050.0,
    "actual": 3.05
  }
]
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopenai:5.2016
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 9000.0,
    "actual": 9.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Jakarta",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Jakarta"
  }
]
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonsonnet-4-6016
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 9000.0,
    "actual": 9.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Jakarta",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Jakarta"
  }
]
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonsonnet-4-5016
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 9000.0,
    "actual": 9.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Jakarta",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Jakarta"
  }
]
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopenai:5.4016
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 9000.0,
    "actual": 9.0
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Jakarta",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Jakarta"
  },
  {
    "path": "data[0].items[0].shipment_date",
    "expected": "",
    "actual": "2026-02-28"
  }
]
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsongemini:gemini-2.5-pro016
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 9000.0,
    "actual": 9.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Jakarta",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Jakarta"
  }
]
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:4.1016
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3500.0,
    "actual": 3.5
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsongemini:gemini-2.5-flash016
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 46000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonsonnet-4-6016
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 23000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5.2015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3250.0,
    "actual": 3.25
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonsonnet-4-6015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3250.0,
    "actual": 3.25
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopus-4-6015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3250.0,
    "actual": 3.25
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopus-4-5015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3250.0,
    "actual": 3.25
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonsonnet-4-5015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3250.0,
    "actual": 3.25
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsongemini:gemini-2.5-flash015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3250.0,
    "actual": 3.25
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/kg"
  }
]
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsongemini:gemini-2.5-pro015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3250.0,
    "actual": 3.25
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction01__2026-02-24__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5-mini015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3250.0,
    "actual": 3.25
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/kg"
  }
]
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonsonnet-4-6015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "Soya Lecithin Powder"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 39000.0,
    "actual": 39.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Nhava Sheva",
    "actual": "CIF"
  }
]
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsongemini:gemini-2.5-flash015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "Soya Lecithin Powder"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 39000.0,
    "actual": 39.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Nhava Sheva",
    "actual": "CIF"
  }
]
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsongemini:gemini-2.5-pro015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "Soya Lecithin Powder"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 39000.0,
    "actual": 39.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Nhava Sheva",
    "actual": "CIF"
  }
]
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopenai:4.1015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 9000.0,
    "actual": 9.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Jakarta",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Jakarta"
  }
]
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopus-4-5015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 9000.0,
    "actual": 9.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Jakarta",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Jakarta"
  }
]
so_extraction04__2026-01-29__120363408498669191_g_us__4b9c2faa-94dd-4236-abcc-398807051f21.jsonopus-4-6015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 9000.0,
    "actual": 9.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Jakarta",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Jakarta"
  }
]
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsongemini:gemini-2.5-flash015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOIFNE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 26000.0,
    "actual": 26.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF NHAVA SHEVA",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Nhava Sheva"
  }
]
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsongemini:gemini-2.5-pro015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOIFNE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 26000.0,
    "actual": 26.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF NHAVA SHEVA",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Nhava Sheva"
  }
]
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5.4015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3500.0,
    "actual": 3.5
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5-mini015
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3500.0,
    "actual": 3.5
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/kg"
  }
]
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5.4015
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 46000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5.4014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "Soya Lecithin Powder"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 39000.0,
    "actual": 39.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Nhava Sheva",
    "actual": "CIF"
  }
]
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonsonnet-4-5014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "Soya Lecithin Powder"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 39000.0,
    "actual": 39.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Nhava Sheva",
    "actual": "CIF"
  }
]
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopus-4-5014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "Soya Lecithin Powder"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 39000.0,
    "actual": 39.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Nhava Sheva",
    "actual": "CIF"
  }
]
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopus-4-6014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "Soya Lecithin Powder"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 39000.0,
    "actual": 39.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Nhava Sheva",
    "actual": "CIF"
  }
]
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:4.1014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "Soya Lecithin Powder"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 39000.0,
    "actual": 39.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Nhava Sheva",
    "actual": "CIF"
  }
]
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5.2014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "Soya Lecithin Powder"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 39000.0,
    "actual": 39.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Nhava Sheva",
    "actual": "CIF"
  }
]
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonsonnet-4-6014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOIFNE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 26000.0,
    "actual": 26.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF NHAVA SHEVA",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Nhava Sheva"
  }
]
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5.2014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOIFNE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 26000.0,
    "actual": 26.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF NHAVA SHEVA",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Nhava Sheva"
  }
]
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5.4014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOIFNE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 26000.0,
    "actual": 26.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF NHAVA SHEVA",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Nhava Sheva"
  }
]
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopus-4-5014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOIFNE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 26000.0,
    "actual": 26.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF NHAVA SHEVA",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Nhava Sheva"
  }
]
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopenai:5.2014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3500.0,
    "actual": 3.5
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonsonnet-4-5014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3500.0,
    "actual": 3.5
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopus-4-6014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3500.0,
    "actual": 3.5
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonopus-4-5014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3500.0,
    "actual": 3.5
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsongemini:gemini-2.5-flash014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3500.0,
    "actual": 3.5
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/kg"
  }
]
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsongemini:gemini-2.5-pro014
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 3500.0,
    "actual": 3.5
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/KG"
  }
]
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:4.1014
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 46000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5.2014
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 46000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopus-4-5014
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 46000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5.4014
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 23000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction02__2026-02-09__120363426578757754_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5-mini013
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "Soya Lecithin Powder"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 39000.0,
    "actual": 39.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF Nhava Sheva",
    "actual": "CIF"
  }
]
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:4.1013
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOIFNE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 26000.0,
    "actual": 26.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF NHAVA SHEVA",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Nhava Sheva"
  }
]
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonsonnet-4-5013
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOIFNE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 26000.0,
    "actual": 26.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF NHAVA SHEVA",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Nhava Sheva"
  }
]
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopus-4-6013
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOIFNE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 26000.0,
    "actual": 26.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF NHAVA SHEVA",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Nhava Sheva"
  }
]
so_extraction05__2026-01-20__120363407382355715_g_us__12a4f3a7-d506-4d32-ae06-3f76508c6abd.jsonopenai:5-mini013
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOIFNE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 26000.0,
    "actual": 26.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "KG",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "CIF NHAVA SHEVA",
    "actual": "CIF"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Nhava Sheva"
  }
]
so_extraction06__2026-01-06__120363421131250401_g_us__e05574ec-b110-4554-9fc3-3abb4f9011a8.jsonsonnet-4-6013
[
  {
    "path": "data[0].items[0].description",
    "expected": "GIIOFINE - P - S",
    "actual": "GIIOFINE-P-S"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 1800.0,
    "actual": 1.8
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "EXW"
  }
]
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsongemini:gemini-2.5-pro013
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 46000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5.2013
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 23000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsongemini:gemini-2.5-flash013
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 23000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopus-4-6012
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 46000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction08__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5-mini012
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 46000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsongemini:gemini-2.5-pro011
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 23000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:4.1010
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 23000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopus-4-5010
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 23000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopus-4-6010
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 23000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD/MT"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF Busan"
  }
]
so_extractiongenerated_acme_foods_001.jsonopenai:5.4010
[
  {
    "path": "data[0].items[0].description",
    "expected": "Arabica coffee bags",
    "actual": ""
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 8.0,
    "actual": null
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": 24.0,
    "actual": null
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "USD",
    "actual": ""
  },
  {
    "path": "data[0].items[0].shipment_date",
    "expected": "2026-05-15",
    "actual": ""
  }
]
so_extraction09__2025-09-29__120363403592950429_g_us__d586d853-694c-42f9-93be-bc7ba5b2110c.jsonopenai:5-mini09
[
  {
    "path": "data[0].items[0].description",
    "expected": "TG - BP102",
    "actual": "BP102"
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 23000.0,
    "actual": 23.0
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "kg",
    "actual": "MT"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "usd/mt",
    "actual": "USD"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "CIF"
  }
]
so_extractiongenerated_nova_exports_003.jsongemini:gemini-2.5-pro09
[
  {
    "path": "data[0].items[0].description",
    "expected": "Espresso blend",
    "actual": "Espresso blend bags"
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "",
    "actual": "BAGS"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "USD",
    "actual": "USD/BAG"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "urgent dispatch within 48 hours"
  },
  {
    "path": "data[0].items[0].shipment_date",
    "expected": "",
    "actual": "2026-05-15"
  }
]
so_extractionfs_acme_simple.jsonopenai:5-mini08
[
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "BAGS",
    "actual": "bags"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "USD/BAG",
    "actual": "USD/bag"
  },
  {
    "path": "data[0].items[0].ship_term",
    "expected": "FOB",
    "actual": ""
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "FOB Singapore",
    "actual": ""
  },
  {
    "path": "data[0].items[0].shipping_address",
    "expected": "100 Finance Ave",
    "actual": ""
  }
]
so_extractionrealistic_acme_foods_007.jsongemini:gemini-2.5-flash08
[
  {
    "path": "data[0].items[0].description",
    "expected": "Green coffee beans",
    "actual": "Green coffee beans sacks"
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "SACKS",
    "actual": "sacks"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "USD/SACK",
    "actual": "USD"
  },
  {
    "path": "data[0].items[0].delivery_terms",
    "expected": "",
    "actual": "9 Harbor Plaza"
  },
  {
    "path": "data[0].items[0].shipment_date",
    "expected": "",
    "actual": "2026-05-14"
  }
]
so_extractiongenerated_acme_foods_003.jsongemini:gemini-2.5-flash07
[
  {
    "path": "data[0].items[0].description",
    "expected": "Espresso blend",
    "actual": "Espresso blend bags"
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "",
    "actual": "bags"
  },
  {
    "path": "data[0].items[0].shipment_date",
    "expected": "",
    "actual": "2026-05-15"
  },
  {
    "path": "data[0].do_date",
    "expected": "",
    "actual": "2026-05-15"
  },
  {
    "path": "data[0].po_date",
    "expected": "",
    "actual": "2026-05-13"
  }
]
so_extractiongenerated_nova_exports_003.jsongemini:gemini-2.5-flash07
[
  {
    "path": "data[0].items[0].description",
    "expected": "Espresso blend",
    "actual": "Espresso blend bags"
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "",
    "actual": "bags"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "USD",
    "actual": "USD/bags"
  },
  {
    "path": "data[0].items[0].shipment_date",
    "expected": "",
    "actual": "2026-05-15"
  },
  {
    "path": "data[0].do_date",
    "expected": "",
    "actual": "2026-05-15"
  }
]
so_extractionrealistic_acme_foods_001.jsonopenai:5.407
[
  {
    "path": "data[0].items[0].description",
    "expected": "Assam tea",
    "actual": "Assam tea cartons"
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "cartons",
    "actual": "CARTONS"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": null,
    "actual": 19.0
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "",
    "actual": "USD/CARTON"
  },
  {
    "path": "data[0].items[0].total",
    "expected": null,
    "actual": 285.0
  }
]
so_extractionrealistic_acme_foods_001.jsongemini:gemini-2.5-flash07
[
  {
    "path": "data[0].items[0].description",
    "expected": "Assam tea",
    "actual": "Assam tea cartons"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": null,
    "actual": 19.0
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "",
    "actual": "USD/carton"
  },
  {
    "path": "data[0].items[0].total",
    "expected": null,
    "actual": 285.0
  },
  {
    "path": "data[0].po_date",
    "expected": "",
    "actual": "2026-05-13"
  }
]
so_extractionrealistic_acme_foods_007.jsonopenai:5.407
[
  {
    "path": "data[0].items[0].description",
    "expected": "Green coffee beans",
    "actual": "Green coffee beans sacks"
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "SACKS",
    "actual": ""
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "USD/SACK",
    "actual": ""
  },
  {
    "path": "data[0].items[0].shipment_date",
    "expected": "",
    "actual": "2026-05-14"
  },
  {
    "path": "data[0].do_date",
    "expected": "",
    "actual": "2026-05-14"
  }
]
so_extractionrealistic_acme_foods_009.jsongemini:gemini-2.5-pro07
[
  {
    "path": "data[0].items[0].description",
    "expected": "Green coffee beans",
    "actual": "Green coffee beans sacks"
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "",
    "actual": "sacks"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": null,
    "actual": 16.0
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "",
    "actual": "USD"
  },
  {
    "path": "data[0].items[0].total",
    "expected": null,
    "actual": 1280.0
  }
]
so_extractionrealistic_nova_exports_005.jsongemini:gemini-2.5-pro07
[
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "",
    "actual": "PACKS"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": null,
    "actual": 14.0
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "",
    "actual": "USD/PACK"
  },
  {
    "path": "data[0].items[0].total",
    "expected": null,
    "actual": 350.0
  },
  {
    "path": "data[0].po_date",
    "expected": "",
    "actual": "2026-05-13"
  }
]
so_extractionrealistic_nova_exports_007.jsongemini:gemini-2.5-pro07
[
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "",
    "actual": "PACKS"
  },
  {
    "path": "data[0].items[0].unit_price",
    "expected": null,
    "actual": 6.0
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "",
    "actual": "USD/PACK"
  },
  {
    "path": "data[0].items[0].total",
    "expected": null,
    "actual": 60.0
  },
  {
    "path": "data[0].po_date",
    "expected": "",
    "actual": "2026-05-13"
  }
]
so_extractionrealistic_nova_exports_009.jsongemini:gemini-2.5-pro07
[
  {
    "path": "data[0].items",
    "expected_len": 1,
    "actual_len": 2
  },
  {
    "path": "data[0].items[0].quantity",
    "expected": 25.0,
    "actual": 12.5
  },
  {
    "path": "data[0].items[0].quantity_unit",
    "expected": "",
    "actual": "PACKS"
  },
  {
    "path": "data[0].items[0].pricing_unit",
    "expected": "USD",
    "actual": "USD/PACK"
  },
  {
    "path": "data[0].items[0].total",
    "expected": 350.0,
    "actual": 175.0
  }
]