• Communist@lemmy.frozeninferno.xyz
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    1 day ago

    The calculator does not tell them if they’re getting closer? This isn’t how anything works. No I can’t say I’m very interested in whether or not the llm has access to python/a calculator as long as it completes the task, that doesn’t matter.

    • zbyte64@awful.systems
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 day ago

      If you are not interested in how it completes the task then you are not an authority on how it works.

      • Communist@lemmy.frozeninferno.xyz
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        22 hours ago

        I’m academically interested, what I mean when I say I’m not interested is that I just don’t see the significance when we’re talking about if it’s capable of the task.

        • zbyte64@awful.systems
          link
          fedilink
          English
          arrow-up
          1
          ·
          21 hours ago

          How are you able to understand it’s capability without understanding what tools it is capable of manipulating to effect?

          • Communist@lemmy.frozeninferno.xyz
            link
            fedilink
            English
            arrow-up
            1
            ·
            20 hours ago

            You aren’t, and that’s exactly what I’m saying, it’s capable of doing these things with tools, therefore it’s capable of doing these things.

            • zbyte64@awful.systems
              link
              fedilink
              English
              arrow-up
              1
              ·
              16 hours ago

              So why are you allergic to people talking about the quality of the tools in regards to capability?

                • zbyte64@awful.systems
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  16 hours ago

                  You are the one collapsing tool use into a binary when there are varying degrees of competency and hand holding.

                  • Communist@lemmy.frozeninferno.xyz
                    link
                    fedilink
                    English
                    arrow-up
                    1
                    ·
                    14 hours ago

                    I am not, you inaccurately said that the math olympiad was not bested by llm’s because they had a tool that told them if they were close but incorrect and can just try an infinite number of times. This is incorrect, they had a number of tries with python. This just isn’t a true statement. I think them besting it with use of python is equally significant and still counts as them besting it, and saying they can’t do math work is absurd.