ChainForge/chainforge/oaievals/european-date-format-challenge.cforge
ianarawjo b33397930b
TypeScript backend, HuggingFace models, JavaScript evaluators, Comment Nodes, and more ()
* Beginning to convert Python backend to Typescript

* Change all fetch() calls to fetch_from_backend switcher

* wip converting query.py to query.ts

* wip started utils.js conversion. Tested that OpenAI API call works

* more progress on converting utils.py to Typescript

* jest tests for query, utils, template.ts. Confirmed PromptPipeline works.

* wip converting queryLLM in flask_app to TS

* Tested queryLLM and StorageCache compressed saving/loading

* wip execute() in backend.ts

* Added execute() and tested w concrete func. Need to test eval()

* Added craco for optional webpack config. Config'd for TypeScript with Node.js packages browserify'd

* Execute JS code on iframe sandbox

* Tested and working JS Evaluator execution.

* wip swapping backends

* Tested TypeScript backendgit status! :) woot

* Added fetchEnvironAPIKeys to Flask server to fetch os.environ keys when running locally

* Route Anthropic calls through Flask when running locally

* Added info button to Eval nodes. Rebuilt react

* Edits to info modal on Eval node

* Remove/error out on Python eval nodes when not running locally.

* Check browser compat and display error if not supported

* Changed all example flows to use JS. Bug fix in query.ts

* Refactored to LLMProvider to streamline model additions

* Added HuggingFace models API

* Added back Dalai call support, routing through Flask

* Remove flask app calls and socketio server that are no longer used

* Added Comment Nodes. Rebuilt react.

* Fix PaLM temp=0 build, update package vers and rebuild react
2023-06-30 15:11:20 -04:00

1 line
32 KiB
Plaintext

{"flow": {"nodes": [{"width": 312, "height": 311, "id": "prompt-european-date-format-challenge", "type": "prompt", "data": {"prompt": "{prompt}", "n": 1, "llms": [{"key": "aa3c0f03-22bd-416e-af4d-4bf5c4278c99", "settings": {"system_msg": "Answer the following questions as concisely as possible.", "temperature": 1, "functions": [], "function_call": "", "top_p": 1, "stop": [], "presence_penalty": 0, "frequency_penalty": 0}, "name": "GPT3.5", "emoji": "\ud83d\ude42", "model": "gpt-3.5-turbo", "base_model": "gpt-3.5-turbo", "temp": 1, "formData": {"shortname": "GPT3.5", "model": "gpt-3.5-turbo", "system_msg": "Answer the following questions as concisely as possible.", "temperature": 1, "functions": "", "function_call": "", "top_p": 1, "stop": "", "presence_penalty": 0, "frequency_penalty": 0}}]}, "position": {"x": 448, "y": 224}, "selected": false, "positionAbsolute": {"x": 448, "y": 224}, "dragging": false}, {"width": 333, "height": 182, "id": "eval-european-date-format-challenge", "type": "evaluator", "data": {"code": "function evaluate(response) {\n\tlet txt = response.text;\n\tlet ideal = response.meta['Ideal'];\n\treturn ideal.includes(txt) || txt.includes(ideal);\n}", "language": "javascript"}, "position": {"x": 820, "y": 150}, "positionAbsolute": {"x": 820, "y": 150}}, {"width": 228, "height": 196, "id": "vis-european-date-format-challenge", "type": "vis", "data": {"input": "eval-european-date-format-challenge"}, "position": {"x": 1200, "y": 250}, "positionAbsolute": {"x": 1200, "y": 250}}, {"width": 302, "height": 260, "id": "inspect-european-date-format-challenge", "type": "inspect", "data": {"input": "prompt-european-date-format-challenge"}, "position": {"x": 820, "y": 400}, "positionAbsolute": {"x": 820, "y": 400}}, {"width": 423, "height": 417, "id": "table-european-date-format-challenge", "type": "table", "data": {"rows": [{"prompt": "London (published 7/1/2027) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 20/10/2029.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "January"}, {"prompt": "John's appointment is on 1/12/2026 and his follow-up is on 14/10/2030. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "December"}, {"prompt": "Paris (published 5/7/2027) - A Paris-based travel company has announced the world's first commercial time travel tourism package, set to become available on 21/2/2028. The package includes a trip back to the 1960s to witness the rise of rock and roll, a visit to ancient Egypt during the reign of Cleopatra, and a journey to the future to witness life on a distant planet.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "July"}, {"prompt": "Jane's lecture is on 1/4/2024, and Anna's is on 21/8/2028. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "April"}, {"prompt": "A bottle of ketchup lists 4/2/2028 as the date of manufacture and 25/4/2030 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "February"}, {"prompt": "A bottle of ketchup lists 8/3/2025 as the date of manufacture and 22/4/2030 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "March"}, {"prompt": "Jane's lecture is on 2/8/2027, and Anna's is on 25/4/2029. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "August"}, {"prompt": "Given the list of dates:\n(1) 3/6/2030\n(2) 29/3/2024\n)(3) 15/6/2030\n\nIn which month does date #1 fall? Please reply just with the name of the month and nothing else.", "ideal": "June"}, {"prompt": "Amsterdam (published 5/1/2024) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 15/4/2027, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "January"}, {"prompt": "Given the list of dates:\n(1) 14/6/2024\n(2) 12/7/2024\n)(3) 27/1/2025\n\nIn which month does date #2 fall? Please reply just with the name of the month and nothing else.", "ideal": "July"}, {"prompt": "A bottle of ketchup lists 4/11/2025 as the date of manufacture and 18/9/2031 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "November"}, {"prompt": "John's appointment is on 10/4/2029 and his follow-up is on 18/10/2031. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "April"}, {"prompt": "Amsterdam (published 3/7/2027) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 20/10/2029, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "July"}, {"prompt": "London (published 11/4/2028) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 13/7/2030.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "April"}, {"prompt": "Rome (published 5/10/2024) - The world's largest man-made structure, the \"Colosseum Tower\", was completed today after 10 years of construction. Standing at 1,000 meters tall, it offers breathtaking views of the city, set to open to the public on 25/4/2026. \n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "Jane's lecture is on 3/6/2028, and Anna's is on 25/11/2028. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "June"}, {"prompt": "Paris (published 12/6/2027) - A Paris-based travel company has announced the world's first commercial time travel tourism package, set to become available on 26/1/2030. The package includes a trip back to the 1960s to witness the rise of rock and roll, a visit to ancient Egypt during the reign of Cleopatra, and a journey to the future to witness life on a distant planet.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "June"}, {"prompt": "Given the list of dates:\n(1) 16/2/2030\n(2) 9/10/2028\n)(3) 17/2/2029\n\nIn which month does date #2 fall? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "The launch was scheduled for 9/6/2030 but was postponed to 14/9/2031. In which month was the launch originally scheduled? Please reply just with the name of the month and nothing else.", "ideal": "June"}, {"prompt": "A bottle of ketchup lists 2/3/2024 as the date of manufacture and 28/3/2031 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "March"}, {"prompt": "Rome (published 5/8/2028) - The world's largest man-made structure, the \"Colosseum Tower\", was completed today after 10 years of construction. Standing at 1,000 meters tall, it offers breathtaking views of the city, set to open to the public on 25/9/2030. \n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "August"}, {"prompt": "Given the list of dates:\n(1) 30/10/2029\n(2) 1/9/2029\n)(3) 24/10/2030\n\nIn which month does date #2 fall? Please reply just with the name of the month and nothing else.", "ideal": "September"}, {"prompt": "John's appointment is on 10/8/2029 and his follow-up is on 14/6/2030. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "August"}, {"prompt": "Amsterdam (published 4/7/2026) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 22/10/2027, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "July"}, {"prompt": "Given the list of dates:\n(1) 19/5/2028\n(2) 3/2/2029\n)(3) 21/10/2030\n\nIn which month does date #2 fall? Please reply just with the name of the month and nothing else.", "ideal": "February"}, {"prompt": "Jane's lecture is on 11/9/2027, and Anna's is on 19/2/2024. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "September"}, {"prompt": "Jane's lecture is on 7/9/2024, and Anna's is on 16/7/2030. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "September"}, {"prompt": "A bottle of ketchup lists 5/8/2027 as the date of manufacture and 25/8/2029 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "August"}, {"prompt": "London (published 6/5/2030) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 20/11/2030.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "May"}, {"prompt": "London (published 1/4/2024) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 20/5/2029.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "April"}, {"prompt": "Rome (published 1/6/2026) - The world's largest man-made structure, the \"Colosseum Tower\", was completed today after 10 years of construction. Standing at 1,000 meters tall, it offers breathtaking views of the city, set to open to the public on 28/3/2028. \n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "June"}, {"prompt": "A bottle of ketchup lists 7/12/2029 as the date of manufacture and 21/10/2031 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "December"}, {"prompt": "A bottle of ketchup lists 10/3/2025 as the date of manufacture and 16/4/2025 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "March"}, {"prompt": "London (published 5/2/2029) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 29/11/2031.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "February"}, {"prompt": "Paris (published 1/8/2024) - A Paris-based travel company has announced the world's first commercial time travel tourism package, set to become available on 16/4/2026. The package includes a trip back to the 1960s to witness the rise of rock and roll, a visit to ancient Egypt during the reign of Cleopatra, and a journey to the future to witness life on a distant planet.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "August"}, {"prompt": "Given the list of dates:\n(1) 4/12/2029\n(2) 15/2/2028\n)(3) 16/9/2031\n\nIn which month does date #1 fall? Please reply just with the name of the month and nothing else.", "ideal": "December"}, {"prompt": "A bottle of ketchup lists 1/10/2025 as the date of manufacture and 21/5/2029 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "Rome (published 8/1/2025) - The world's largest man-made structure, the \"Colosseum Tower\", was completed today after 10 years of construction. Standing at 1,000 meters tall, it offers breathtaking views of the city, set to open to the public on 27/7/2026. \n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "January"}, {"prompt": "The launch was scheduled for 6/1/2025 but was postponed to 27/9/2027. In which month was the launch originally scheduled? Please reply just with the name of the month and nothing else.", "ideal": "January"}, {"prompt": "John's appointment is on 6/10/2026 and his follow-up is on 30/12/2030. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "Amsterdam (published 3/8/2027) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 31/3/2029, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "August"}, {"prompt": "Paris (published 8/1/2028) - A Paris-based travel company has announced the world's first commercial time travel tourism package, set to become available on 20/3/2028. The package includes a trip back to the 1960s to witness the rise of rock and roll, a visit to ancient Egypt during the reign of Cleopatra, and a journey to the future to witness life on a distant planet.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "January"}, {"prompt": "The launch was scheduled for 4/7/2030 but was postponed to 25/6/2031. In which month was the launch originally scheduled? Please reply just with the name of the month and nothing else.", "ideal": "July"}, {"prompt": "Rome (published 8/2/2027) - The world's largest man-made structure, the \"Colosseum Tower\", was completed today after 10 years of construction. Standing at 1,000 meters tall, it offers breathtaking views of the city, set to open to the public on 13/5/2031. \n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "February"}, {"prompt": "John's appointment is on 2/10/2024 and his follow-up is on 29/11/2027. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "Paris (published 5/10/2028) - A Paris-based travel company has announced the world's first commercial time travel tourism package, set to become available on 29/8/2029. The package includes a trip back to the 1960s to witness the rise of rock and roll, a visit to ancient Egypt during the reign of Cleopatra, and a journey to the future to witness life on a distant planet.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "Rome (published 2/7/2025) - The world's largest man-made structure, the \"Colosseum Tower\", was completed today after 10 years of construction. Standing at 1,000 meters tall, it offers breathtaking views of the city, set to open to the public on 16/2/2029. \n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "July"}, {"prompt": "Given the list of dates:\n(1) 7/3/2029\n(2) 26/2/2029\n)(3) 24/1/2031\n\nIn which month does date #1 fall? Please reply just with the name of the month and nothing else.", "ideal": "March"}, {"prompt": "Amsterdam (published 6/7/2027) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 26/7/2028, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "July"}, {"prompt": "A bottle of ketchup lists 5/10/2025 as the date of manufacture and 24/6/2027 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "London (published 12/11/2024) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 20/12/2024.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "November"}, {"prompt": "Jane's lecture is on 10/1/2024, and Anna's is on 25/8/2029. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "January"}, {"prompt": "Jane's lecture is on 2/12/2024, and Anna's is on 28/9/2030. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "December"}, {"prompt": "John's appointment is on 9/2/2028 and his follow-up is on 26/2/2029. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "February"}, {"prompt": "Amsterdam (published 7/5/2027) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 19/7/2029, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "May"}, {"prompt": "Jane's lecture is on 10/4/2026, and Anna's is on 29/11/2027. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "April"}, {"prompt": "London (published 9/8/2026) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 29/11/2026.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "August"}, {"prompt": "Given the list of dates:\n(1) 2/8/2024\n(2) 16/8/2028\n)(3) 18/6/2028\n\nIn which month does date #1 fall? Please reply just with the name of the month and nothing else.", "ideal": "August"}, {"prompt": "Jane's lecture is on 12/8/2029, and Anna's is on 26/1/2030. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "August"}, {"prompt": "Given the list of dates:\n(1) 21/2/2027\n(2) 6/12/2030\n)(3) 30/11/2031\n\nIn which month does date #2 fall? Please reply just with the name of the month and nothing else.", "ideal": "December"}, {"prompt": "Amsterdam (published 1/11/2025) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 20/8/2028, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "November"}, {"prompt": "Given the list of dates:\n(1) 18/8/2029\n(2) 11/2/2028\n)(3) 19/8/2028\n\nIn which month does date #2 fall? Please reply just with the name of the month and nothing else.", "ideal": "February"}, {"prompt": "A bottle of ketchup lists 7/10/2028 as the date of manufacture and 29/5/2029 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "London (published 4/10/2029) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 15/4/2030.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "Rome (published 11/1/2029) - The world's largest man-made structure, the \"Colosseum Tower\", was completed today after 10 years of construction. Standing at 1,000 meters tall, it offers breathtaking views of the city, set to open to the public on 24/2/2029. \n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "January"}, {"prompt": "A bottle of ketchup lists 2/10/2028 as the date of manufacture and 20/3/2030 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "John's appointment is on 10/8/2024 and his follow-up is on 23/7/2031. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "August"}, {"prompt": "Jane's lecture is on 9/1/2025, and Anna's is on 30/4/2029. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "January"}, {"prompt": "Rome (published 2/5/2024) - The world's largest man-made structure, the \"Colosseum Tower\", was completed today after 10 years of construction. Standing at 1,000 meters tall, it offers breathtaking views of the city, set to open to the public on 25/11/2028. \n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "May"}, {"prompt": "Rome (published 10/5/2025) - The world's largest man-made structure, the \"Colosseum Tower\", was completed today after 10 years of construction. Standing at 1,000 meters tall, it offers breathtaking views of the city, set to open to the public on 15/9/2028. \n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "May"}, {"prompt": "John's appointment is on 2/9/2028 and his follow-up is on 26/2/2029. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "September"}, {"prompt": "Jane's lecture is on 5/11/2027, and Anna's is on 28/1/2028. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "November"}, {"prompt": "Amsterdam (published 11/2/2027) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 24/12/2027, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "February"}, {"prompt": "Given the list of dates:\n(1) 24/12/2024\n(2) 10/6/2028\n)(3) 27/2/2030\n\nIn which month does date #2 fall? Please reply just with the name of the month and nothing else.", "ideal": "June"}, {"prompt": "Rome (published 3/5/2029) - The world's largest man-made structure, the \"Colosseum Tower\", was completed today after 10 years of construction. Standing at 1,000 meters tall, it offers breathtaking views of the city, set to open to the public on 20/4/2030. \n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "May"}, {"prompt": "Rome (published 2/12/2030) - The world's largest man-made structure, the \"Colosseum Tower\", was completed today after 10 years of construction. Standing at 1,000 meters tall, it offers breathtaking views of the city, set to open to the public on 15/4/2031. \n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "December"}, {"prompt": "A bottle of ketchup lists 5/2/2028 as the date of manufacture and 18/6/2030 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "February"}, {"prompt": "London (published 5/9/2028) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 26/7/2029.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "September"}, {"prompt": "London (published 2/9/2026) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 18/11/2027.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "September"}, {"prompt": "London (published 11/3/2028) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 21/8/2028.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "March"}, {"prompt": "Amsterdam (published 1/10/2027) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 26/12/2028, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "John's appointment is on 5/10/2030 and his follow-up is on 23/3/2031. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "Given the list of dates:\n(1) 10/7/2029\n(2) 25/2/2026\n)(3) 30/12/2030\n\nIn which month does date #1 fall? Please reply just with the name of the month and nothing else.", "ideal": "July"}, {"prompt": "A bottle of ketchup lists 2/11/2027 as the date of manufacture and 29/12/2029 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "November"}, {"prompt": "John's appointment is on 5/12/2029 and his follow-up is on 18/11/2030. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "December"}, {"prompt": "Paris (published 6/10/2026) - A Paris-based travel company has announced the world's first commercial time travel tourism package, set to become available on 23/2/2031. The package includes a trip back to the 1960s to witness the rise of rock and roll, a visit to ancient Egypt during the reign of Cleopatra, and a journey to the future to witness life on a distant planet.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "Amsterdam (published 4/6/2025) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 25/5/2026, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "June"}, {"prompt": "London (published 11/7/2024) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 17/12/2024.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "July"}, {"prompt": "Amsterdam (published 2/3/2030) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 17/7/2031, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "March"}, {"prompt": "London (published 5/8/2030) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 20/9/2030.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "August"}, {"prompt": "John's appointment is on 6/12/2030 and his follow-up is on 19/12/2030. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "December"}, {"prompt": "Jane's lecture is on 1/3/2030, and Anna's is on 22/12/2028. In which month is Jane's lecture? Please reply just with the name of the month and nothing else.", "ideal": "March"}, {"prompt": "A bottle of ketchup lists 10/1/2028 as the date of manufacture and 26/8/2029 as the expiration date. In which month was the ketchup manufactured? Please reply just with the name of the month and nothing else.", "ideal": "January"}, {"prompt": "John's appointment is on 12/10/2028 and his follow-up is on 24/8/2031. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "October"}, {"prompt": "Amsterdam (published 12/3/2025) - The world's first flying car, the \"SkyHawk\", was unveiled today by Dutch manufacturer, SkyDrive, and is set to go on sale to the public on 14/2/2027, pending regulatory approval.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "March"}, {"prompt": "London (published 8/6/2025) - A new high-speed transportation system, the \"Underloop\", was successfully tested yesterday, reaching speeds of over 600 kilometers per hour. The company behind the technology plans to have it operational by 16/11/2027.\n\nWhat month was this news blurb published in? Please reply just with the name of the month and nothing else.", "ideal": "June"}, {"prompt": "John's appointment is on 1/9/2024 and his follow-up is on 26/2/2030. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "September"}, {"prompt": "John's appointment is on 2/7/2025 and his follow-up is on 20/11/2025. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "July"}, {"prompt": "The launch was scheduled for 8/7/2027 but was postponed to 19/5/2029. In which month was the launch originally scheduled? Please reply just with the name of the month and nothing else.", "ideal": "July"}, {"prompt": "John's appointment is on 3/2/2026 and his follow-up is on 26/2/2027. In what month is the first appointment? Please reply just with the name of the month and nothing else.", "ideal": "February"}], "columns": [{"key": "prompt", "header": "Prompt"}, {"key": "ideal", "header": "Ideal"}]}, "position": {"x": -16, "y": 160}, "selected": false, "positionAbsolute": {"x": -16, "y": 160}, "dragging": false}], "edges": [{"source": "prompt-european-date-format-challenge", "sourceHandle": "prompt", "target": "eval-european-date-format-challenge", "targetHandle": "responseBatch", "interactionWidth": 100, "markerEnd": {"type": "arrow", "width": "22px", "height": "22px"}, "id": "reactflow__edge-prompt-1686756357355prompt-eval-1686756357355responseBatch"}, {"source": "prompt-european-date-format-challenge", "sourceHandle": "prompt", "target": "inspect-european-date-format-challenge", "targetHandle": "input", "interactionWidth": 100, "markerEnd": {"type": "arrow", "width": "22px", "height": "22px"}, "id": "reactflow__edge-prompt-1686756357355prompt-inspect-1686756357355input"}, {"source": "eval-european-date-format-challenge", "sourceHandle": "output", "target": "vis-european-date-format-challenge", "targetHandle": "input", "interactionWidth": 100, "markerEnd": {"type": "arrow", "width": "22px", "height": "22px"}, "id": "reactflow__edge-eval-1686756357355output-vis-1686756357355input"}, {"source": "table-european-date-format-challenge", "sourceHandle": "Prompt", "target": "prompt-european-date-format-challenge", "targetHandle": "prompt", "interactionWidth": 100, "markerEnd": {"type": "arrow", "width": "22px", "height": "22px"}, "id": "reactflow__edge-table-1686756385002Prompt-prompt-1686756357355prompt"}], "viewport": {"x": 144, "y": 37, "zoom": 1}}, "cache": {"eval-1686756357355.json": {}, "inspect-1686756357355.json": {}, "prompt-1686756357355.json": {}, "table-1686756385002.json": {}, "vis-1686756357355.json": {}}}