One-Shot Model Querying

Summary: The foundation model is accessed in a single instance to generate all necessary steps for the plan.

Context: When users interact with the agent for specific goals, the included foundation model is queried for plan generation.

Problem: How can the agent generate the steps for a plan efficiently?

Forces:

Efficiency. For certain pressing tasks, the agent should be able to conduct planning and respond in a short amount of time.
Overhead. Users need to pay for each interaction with commercial foundation models.

Solution: Fig. 1 illustrates interactions between user and agent within one-shot model query. After a user specifies goals and constraints in one prompt, the agent will query the incorporated foundation model to generate a corresponding response (e.g. plan for action). The foundation model does not require multiple interactions to comprehend the context and requirements. In this manner, the agent can devise a multi-step plan to achieve a broad goal, and provide a holistic explanation for this plan without delving into detailed reasoning steps.

Benefits:

Efficiency. The agent can generate a plan to achieve users’ goals by querying the underlying foundation model only once, which saves consumed time.
Cost-efficiency. Users’ expenses can be reduced since the foundation model is queried for one time.
Simplicity. One-shot model querying can satisfy the tasks that do not require complex action plans.

Drawbacks:

Oversimplification. For complex tasks, one-shot model querying may not be able to fully capture all requirements at one time, hence oversimplifying the tasks and cannot return a correct response.
Lack of explainability. One-shot model querying may suffer the lack of explainability as the incorporated foundation model is queried only once, which may not provide detailed reasoning steps for plan generation.
Size of the context window. The response quality may be constrained considering the foundation models’ current capability of handling long conversational contexts and the token limits.

Known uses: One-shot model querying can be considered configuration or use by default when a user is leveraging a foundation model, while CoT and Zero-shot-CoT both exemplify this pattern [1, 2].

Related patterns:

Incremental model querying. Incremental model querying can be regarded an alternative of one-shot model querying with iteration.
Single-path plan generator. One-shot model querying enables the generation of single-path plans by only querying the foundation model for one time.
Multimodal guardrails. Multimodal guardrails serve as an intermediate layer, managing the inputs and outputs of model querying.

References:

[1] L. Wang, C. Ma, X. Feng, Z. Zhang, H. Yang, J. Zhang, Z. Chen, J. Tang, X. Chen, Y. Lin et al., “A survey on large language model based autonomous agents,” Frontiers of Computer Science, vol. 18, no. 6, pp. 1–26, 2024.
[2] L. Wang, W. Xu, Y. Lan, Z. Hu, Y. Lan, R. K.-W. Lee, and E.-P. Lim, “Plan-and-solve prompting: Improving zero-shot chain-of-thought reasoning by large language models,” in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), A. Rogers, J. Boyd-Graber, and N. Okazaki, Eds. Toronto, Canada: Association for Computational Linguistics, Jul. 2023, pp. 2609–2634. [Online]. Available: https://aclanthology.org/2023.acl-long.147