Files
agent-framework/python/packages/lab/lightning/samples/data/math/test.jsonl
T
Yuge Zhang b8df0cd03f Python: Add Agent Framework Lab Lightning package with RL training examples (#937)
* add math agent

* .

* update

* update debug mode

* add tau2 training

* .

* .

* .

* .

* add tests

* .

* revert observability

* update readme

* fix task serialization issue

* fix exception

* add inline docs

* update readme

* update pyproject toml

* minor fix

* update and use git lfs

* update

* update ignore file to use lab specific

* fix type

* update depedency

---------

Co-authored-by: Eric Zhu <ekzhu@users.noreply.github.com>
2025-09-30 01:18:49 +00:00

21 lines
8.9 KiB
JSON

{"id": "svamp__chal-551", "question": "Robin has some packages of gum. There are 7 pieces in each package. Robin has 6 extra pieces of gum. In all the number of pieces of gums robin has is 41. How many packages does Robin have?", "chain": "<gadget id=\"calculator\">41 - 6</gadget>\n<output>35</output>\n\n<gadget id=\"calculator\">35 / 7</gadget>\n<output>5</output>\n\n<result>5</result>", "result": "5", "source": "calc"}
{"id": "ape210k__00027150", "question": "2 meters of floral cloth, how much rice is left after 80% is used", "chain": "<gadget id=\"calculator\">80 / 100</gadget>\n<output>4/5 = around 0.8</output>\n\n<gadget id=\"calculator\">1 - (4/5)</gadget>\n<output>1/5 = around 0.2</output>\n\n<gadget id=\"calculator\">2 * (1/5)</gadget>\n<output>2/5 = around 0.4</output>\n\n<result>2/5 = around 0.4</result>", "result": "2/5", "source": "calc"}
{"id": "ape210k__00287396", "question": "There are two schools, A and B. School A has 525 students, and school B has 50 fewer students than school A. How many students are there in school B?", "chain": "<gadget id=\"calculator\">525 * 2</gadget>\n<output>1_050</output>\n\n<gadget id=\"calculator\">1_050 - 50</gadget>\n<output>1_000</output>\n\n<result>1_000</result>", "result": "1_000", "source": "calc"}
{"id": "gsm8k__xtQ5d23fzgEAhUdB", "question": "If Mark weighs 150 pounds and Susan weighs 20 pounds less than Mark. And their friend Bob weighs twice as much as Susan. What is the average weight of the 3 friends?", "chain": "Susan weighs 150 pounds - 20 pounds = \n<gadget id=\"calculator\">150-20</gadget>\n<output>130</output>\n130 pounds.\nBob weighs 2 * 130 pounds = \n<gadget id=\"calculator\">2*130</gadget>\n<output>260</output>\n260 pounds.\nThe friends total weight is 150 + 130 + 260 pounds = \n<gadget id=\"calculator\">150+130+260</gadget>\n<output>540</output>\n540 pounds.\nThe friends' average weight is 540 pounds / 3 = \n<gadget id=\"calculator\">540/3</gadget>\n<output>180</output>\n180 pounds.\n\n<result>180</result>", "result": "180", "source": "calc"}
{"id": "svamp__chal-741", "question": "He had a total of 40 saltwater animals in different aquariums. Each aquarium has 2 animals in it. How many aquariums did he have?", "chain": "<gadget id=\"calculator\">40 / 2</gadget>\n<output>20</output>\n\n<result>20</result>", "result": "20", "source": "calc"}
{"id": "asdiv_a__nluds-0023", "question": "You have collected 7 crickets. How many more crickets do you need to collect to have 11 crickets?", "chain": "<gadget id=\"calculator\">11 - 7</gadget>\n<output>4</output>\n\n<result>4</result>", "result": "4", "source": "calc"}
{"id": "gsm8k__0oOjz5Ub66DF4inZ", "question": "There are 6 trees in Chris's yard. Ferdinand has half the number of trees that Chris has. Harry has 5 more than twice the number of trees that Ferdinand has. How many more trees are in Harry's yard than Ferdinand's yard?", "chain": "Ferdinand:6/2=\n<gadget id=\"calculator\">6/2</gadget>\n<output>3</output>\n3 trees.\nHarry:5+2(3)=5+6=11 trees\n11-3=\n<gadget id=\"calculator\">11-3</gadget>\n<output>8</output>\n8 trees.\n\n<result>8</result>", "result": "8", "source": "calc"}
{"id": "ape210k__00565195", "question": "During the May 1st period, Xiaoqiang\u2019s family went on a trip to other places. The planned consumption was 2,000 yuan, but the actual consumption was 1,800 yuan. How much less was the actual consumption than the plan?", "chain": "<gadget id=\"calculator\">2_000 - 1_800</gadget>\n<output>200</output>\n\n<gadget id=\"calculator\">200 / 2_000</gadget>\n<output>1/10 = around 0.1</output>\n\n<result>1/10 = around 0.1</result>", "result": "1/10", "source": "calc"}
{"id": "mawps__E0wRRdRDwTmdqH2u", "question": "Milton had 238 peach. William clasped some peach. Now Milton has 51 peach. How many did William claspeds?", "chain": "<gadget id=\"calculator\">238 - 51</gadget>\n<output>187</output>\n\n<result>187</result>", "result": "187", "source": "calc"}
{"id": "asdiv_a__nluds-0318", "question": "The map led them through the forest and into a cave. To open the cave doors, they need to put weights on the switch. If the switch already has 234 lbs. of weights and the total needed is 712 lbs.,, how much more weight to they need to add?", "chain": "<gadget id=\"calculator\">712 - 234</gadget>\n<output>478</output>\n\n<result>478</result>", "result": "478", "source": "calc"}
{"id": "ape210k__00965281", "question": "The annual interest rate of the five-year national debt is 2.75%. If a person buys a national debt of 20,000 yuan, what is the total amount of principal and interest after maturity?", "chain": "<gadget id=\"calculator\">2.75 / 100</gadget>\n<output>0.0275</output>\n\n<gadget id=\"calculator\">20_000 * 0.0275 * 5</gadget>\n<output>2_750</output>\n\n<gadget id=\"calculator\">20_000 + 2_750</gadget>\n<output>22_750</output>\n\n<result>22_750</result>", "result": "22_750", "source": "calc"}
{"id": "svamp__chal-289", "question": "Jack received 6 emails in the morning and 8 emails in the afternoon. How many more emails did Jack receive in the afternoon than in the morning?", "chain": "<gadget id=\"calculator\">8 - 6</gadget>\n<output>2</output>\n\n<result>2</result>", "result": "2", "source": "calc"}
{"id": "ape210k__00829979", "question": "The fifth grade students participate in the big break exercise, and 12 people or 18 people can be divided into a row. If the number of students is less than 200, how many students can participate in the big break exercise this time?", "chain": "<gadget id=\"calculator\">36 * 5</gadget>\n<output>180</output>\n\n<result>180</result>", "result": "180", "source": "calc"}
{"id": "ape210k__00909867", "question": "A and B process 1200 parts at the same time, and the plan is to complete it in 6 hours. A processes 80 parts per hour. To complete the work on time, how many parts does B need to process per hour? (column equations to solve problems)", "chain": "<gadget id=\"calculator\">80 * 6</gadget>\n<output>480</output>\n\n<gadget id=\"calculator\">1_200 - 480</gadget>\n<output>720</output>\n\n<gadget id=\"calculator\">720 / 6</gadget>\n<output>120</output>\n\n<result>120</result>", "result": "120", "source": "calc"}
{"id": "svamp__chal-972", "question": "A mailman has to give 4 pieces of junk mail to each house in each of the 16 blocks. If there are 17 houses in each block, how many pieces of junk mail should he give in total?", "chain": "<gadget id=\"calculator\">4 * 17</gadget>\n<output>68</output>\n\n<gadget id=\"calculator\">68 * 16</gadget>\n<output>1_088</output>\n\n<result>1088</result>", "result": "1_088", "source": "calc"}
{"id": "aqua_rat__j7vMuYEEajqH6GTH", "question": "5 horses are in a race. Mr.Jain selects two of horses at random and bets on them. The probability that he selected the winning horse is Choose the correct choice: A) 1/5 B) 2/5 C) 3/5 D) 4/5 E) 6/5", "chain": "There are 5 horses. Probability of winning for each horse = 1/5. Probability of winning with 2 selected horses= (1/5)+(1/5)= 2/5. Answer is 2/5. ANSWER:2/5\n<result>B</result>", "result": "B", "source": "calc"}
{"id": "asdiv_a__nluds-0263", "question": "Feeling good about what he did, Mr. Anderson decided to continue giving to others. He went around the city and gave clothes to homeless people. If he gave 589 shirts and 345 trousers,, how many pieces of clothing did he gave out in total?", "chain": "<gadget id=\"calculator\">589 + 345</gadget>\n<output>934</output>\n\n<result>934</result>", "result": "934", "source": "calc"}
{"id": "svamp__chal-968", "question": "Mary is baking a cake. The recipe calls for 10 cups of flour 2 cups of sugar and 80 cups of salt. She already put in 7 cups of flour. How many more cups of flour than cups of sugar does she need to add now?", "chain": "<gadget id=\"calculator\">10 - 7</gadget>\n<output>3</output>\n\n<gadget id=\"calculator\">3 - 2</gadget>\n<output>1</output>\n\n<result>1</result>", "result": "1", "source": "calc"}
{"id": "gsm8k__aIzJoU5IRgriERup", "question": "A tub of ice cream costing $13 is now sold at $11. A packet of milk was sold at a discount of $0.5. How much will you save if you buy 2 tubs of ice cream and 4 packets of milk?", "chain": "The discount for each tub of ice cream is $13 - $11 = $\n<gadget id=\"calculator\">13-11</gadget>\n<output>2</output>\n2.\nSo the discount for 2 tubs of ice cream is $2 x 2 = $\n<gadget id=\"calculator\">2*2</gadget>\n<output>4</output>\n4.\nThe total discount for 4 packets of milk is $0.5 x 4 = $\n<gadget id=\"calculator\">0.5*4</gadget>\n<output>2</output>\n2.\nYou will save $4 + $2 = $6 for 2 tubs of ice cream and 4 packets of milk.\n\n<result>6</result>", "result": "6", "source": "calc"}
{"id": "ape210k__00623575", "question": "In the art group, boys are girls (4/5), how much less boys than girls.", "chain": "<gadget id=\"calculator\">4 / 5</gadget>\n<output>4/5 = around 0.8</output>\n\n<gadget id=\"calculator\">1 - (4/5)</gadget>\n<output>1/5 = around 0.2</output>\n\n<gadget id=\"calculator\">(1/5) / 1</gadget>\n<output>1/5 = around 0.2</output>\n\n<result>1/5 = around 0.2</result>", "result": "1/5", "source": "calc"}