We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers

Abstract: Numerous recent works aim to enhance the efficacy of Large Language Models (LLMs) through strategic prompting. In particular, the Optimization by PROmpting (OPRO) approach provides state-of-the-art performance by leveraging LLMs as optimizers where the optimization task is to find instructions that maximize the task accuracy. In this paper, we revisit OPRO for automated prompting with relatively small-scale LLMs, such as LLaMa-2 family and Mistral 7B. Our investigation reveals that OPRO shows limited effectiveness in small-scale LLMs, with limited inference capabilities constraining optimization ability. We suggest future automatic prompting engineering to consider both model capabilities and computational costs. Additionally, for small-scale LLMs, we recommend direct instructions that clearly outline objectives and methodologies as robust prompt baselines, ensuring efficient and effective prompt engineering in ongoing research.
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
Journal reference: ACL Findings 2024
Cite as: arXiv:2405.10276 [cs.CL]
  (or arXiv:2405.10276v1 [cs.CL] for this version)

Submission history

From: Tuo Zhang [view email]
[v1] Thu, 16 May 2024 17:33:50 GMT (474kb,D)

Link back to: arXiv, form interface, contact.