ChainForge/chainforge/oaievals/medmcqa.cforge

1 line
77 KiB
Plaintext
Raw Permalink Normal View History

TypeScript backend, HuggingFace models, JavaScript evaluators, Comment Nodes, and more (#81) * Beginning to convert Python backend to Typescript * Change all fetch() calls to fetch_from_backend switcher * wip converting query.py to query.ts * wip started utils.js conversion. Tested that OpenAI API call works * more progress on converting utils.py to Typescript * jest tests for query, utils, template.ts. Confirmed PromptPipeline works. * wip converting queryLLM in flask_app to TS * Tested queryLLM and StorageCache compressed saving/loading * wip execute() in backend.ts * Added execute() and tested w concrete func. Need to test eval() * Added craco for optional webpack config. Config'd for TypeScript with Node.js packages browserify'd * Execute JS code on iframe sandbox * Tested and working JS Evaluator execution. * wip swapping backends * Tested TypeScript backendgit status! :) woot * Added fetchEnvironAPIKeys to Flask server to fetch os.environ keys when running locally * Route Anthropic calls through Flask when running locally * Added info button to Eval nodes. Rebuilt react * Edits to info modal on Eval node * Remove/error out on Python eval nodes when not running locally. * Check browser compat and display error if not supported * Changed all example flows to use JS. Bug fix in query.ts * Refactored to LLMProvider to streamline model additions * Added HuggingFace models API * Added back Dalai call support, routing through Flask * Remove flask app calls and socketio server that are no longer used * Added Comment Nodes. Rebuilt react. * Fix PaLM temp=0 build, update package vers and rebuild react
2023-06-30 15:11:20 -04:00
{"flow": {"nodes": [{"width": 312, "height": 311, "id": "prompt-medmcqa", "type": "prompt", "data": {"prompt": "{prompt}", "n": 1, "llms": [{"key": "aa3c0f03-22bd-416e-af4d-4bf5c4278c99", "settings": {"system_msg": "You are a highly intelligent doctor who answers the following multiple choice question correctly.\nOnly write the answer down.", "temperature": 1, "functions": [], "function_call": "", "top_p": 1, "stop": [], "presence_penalty": 0, "frequency_penalty": 0}, "name": "GPT3.5", "emoji": "\ud83d\ude42", "model": "gpt-3.5-turbo", "base_model": "gpt-3.5-turbo", "temp": 1, "formData": {"shortname": "GPT3.5", "model": "gpt-3.5-turbo", "system_msg": "You are a highly intelligent doctor who answers the following multiple choice question correctly.\nOnly write the answer down.", "temperature": 1, "functions": "", "function_call": "", "top_p": 1, "stop": "", "presence_penalty": 0, "frequency_penalty": 0}}]}, "position": {"x": 448, "y": 224}, "selected": false, "positionAbsolute": {"x": 448, "y": 224}, "dragging": false}, {"width": 333, "height": 182, "id": "eval-medmcqa", "type": "evaluator", "data": {"code": "function evaluate(response) {\n\tlet ideal = response.meta['Ideal'];\n\treturn response.text.startsWith(ideal);\n}", "language": "javascript"}, "position": {"x": 820, "y": 150}, "positionAbsolute": {"x": 820, "y": 150}}, {"width": 228, "height": 196, "id": "vis-medmcqa", "type": "vis", "data": {"input": "eval-medmcqa"}, "position": {"x": 1200, "y": 250}, "positionAbsolute": {"x": 1200, "y": 250}}, {"width": 302, "height": 260, "id": "inspect-medmcqa", "type": "inspect", "data": {"input": "prompt-medmcqa"}, "position": {"x": 820, "y": 400}, "positionAbsolute": {"x": 820, "y": 400}}, {"width": 423, "height": 417, "id": "table-medmcqa", "type": "table", "data": {"rows": [{"prompt": "Subject: Anatomy\n\nChronic urethral obstruction due to benign prismatic hyperplasia can lead to the following change in kidney parenchyma\n\na) Hyperplasia\nb) Hyperophy\nc) Atrophy\nd) Dyplasia", "ideal": "c) Atrophy"}, {"prompt": "Subject: Biochemistry\n\nWhich vitamin is supplied from only animal source:\n\na) Vitamin C\nb) Vitamin B7\nc) Vitamin B12\nd) Vitamin D", "ideal": "c) Vitamin B12"}, {"prompt": "Subject: Surgery\n\nAll of the following are surgical options for morbid obesity except -\n\na) Adjustable gastric banding\nb) Biliopancreatic diversion\nc) Duodenal Switch\nd) Roux en Y Duodenal By pass", "ideal": "d) Roux en Y Duodenal By pass"}, {"prompt": "Subject: Ophthalmology\n\nFollowing endaerectomy on the right common carotid, a patient is found to be blind in the right eye. It is appears that a small thrombus embolized during surgery and lodged in the aery supplying the optic nerve. Which aery would be blocked?\n\na) Central aery of the retina\nb) Infraorbital aery\nc) Lacrimal aery\nd) Nasociliary aretry", "ideal": "a) Central aery of the retina"}, {"prompt": "Subject: Physiology\n\nGrowth hormone has its effect on growth through?\n\na) Directly\nb) IG1-1\nc) Thyroxine\nd) Intranuclear receptors", "ideal": "b) IG1-1"}, {"prompt": "Subject: Social & Preventive Medicine\n\nScrub typhus is transmitted by: September 2004\n\na) Louse\nb) Tick\nc) Mite\nd) Milk", "ideal": "c) Mite"}, {"prompt": "Subject: Gynaecology & Obstetrics\n\nAbnormal vascular patterns seen with colposcopy in case of cervical intraepithelial neoplasia are all except\n\na) Punctation\nb) Mosaicism\nc) Satellite lesions\nd) Atypical vessels", "ideal": "c) Satellite lesions"}, {"prompt": "Subject: Surgery\n\nPer rectum examination is not a useful test for diagnosis of\n\na) Anal fissure\nb) Hemorrhoid\nc) Pilonidal sinus\nd) Rectal ulcer", "ideal": "c) Pilonidal sinus"}, {"prompt": "Subject: Anaesthesia\n\nCharacteristics of Remifentanyl \u2013 a) Metabolised by plasma esteraseb) Short half lifec) More potent than Alfentanyld) Dose reduced in hepatic and renal diseasee) Duration of action more than Alfentanyl\n\na) ab\nb) bc\nc) abc\nd) bcd", "ideal": "c) abc"}, {"prompt": "Subject: Psychiatry\n\nHypomimia is ?\n\na) Decreased ability to copy\nb) Decreased e