본문 바로가기
NLP/LLM

GPT 4 (2023) 리뷰

by 아르카눔 2025. 4. 26.

 

GPT4 부터는 아예 논문이 아니라 Technical Report라고 나온다. (링크)

 

OpenAI가 아니라 ClosedAI라고 해야하는게 아닌가 싶긴 하지만 내용을 간략하게 살펴본다.

 

다들 알다시피 첫 보고서는 2023년에 나왔지만 지금은 6번째로 리비전이 되어서 v6다. 

 

 

아래의 Table 1과 Figure 4에서 GPT-3.5와 비교하면서 시작한다.

 

 

 

 

Academic and professional exams에서 GPT 3.5에 비해서 GPT-4는 월등하게 좋은 성능을 보여준다.

 

사실 ChatGPT가 아주 처음에 나왔을 때 GPT-3.5를 무료로 써봤을 때 생각보다 별로여서 실망했던 기억이 난다.

 

GPT-4 이후에 점차 쓸만해졌다고 느꼈던 기억이 난다.

 

 

 

 

GPT-4는 Table 3에 나와있듯이 이미지를 입력으로 받아서 처리할 수 있다.

본격적인 Multimodal LLM이다.

 

아래처럼 Chart, Paper, Mathematical Figure, Picture에 대해서 모두 답변을 한다.

 

 

 

 

 

 

 

사실 본인이 흥미로웠던 지점은 정말 긴 few-shot prompt다.

 

어마어마한 크기의 입력 사이즈 지원된다. 

 

Appendix A.8 Example few-shot prompts 중 하나인 Example prompt for a multiple choice exam (AP Art History)를 하나 적는다.

 

더보기

ANSWER KEY


Here are the answers for the problems in the exam.


Problem 1. Choose the most likely completion of the following sentence.
Honore Daumier’s Nadar Raising Photography to the Height of Art was done immediately after __.


Choose from the following options: [A] the first photographic exhibition in history [B] the first use of color photography [C] a court decision that determined that photographs could be considered works of art [D] the invention of the zoopraxiscope


Explanation for Problem 1: The painting depicts Nadar, a famous French photographer, raising a camera to the sky, as if to signify the recognition of photography as a legitimate form of art. This recognition would have likely come from a court decision, rather than the other options listed. The first photographic exhibition and the invention of the zoopraxiscope, which was a device used to create moving images, would not have had any bearing on the recognition of photography as art. Similarly, the first use of color photography would not have been a significant enough event to warrant such a celebratory depiction in the painting. Therefore, the most likely explanation for the painting is that it was done in response to a court decision that determined that photographs could be considered works of art.


The answer is therefore [C]


Problem 2. Artists working in New Spain were most influenced by contemporary art in Europe during which of the following periods?


Choose from the following options: [A] Romanticism [B] Renaissance [C] Mannerism [D] Baroque


Explanation for Problem 2: Artists working in New Spain, which is now modern-day Mexico, were most influenced by contemporary art in Europe during the Baroque period. This is because the Baroque period was characterized by ornate and elaborate styles, which were heavily influenced by the Catholic Church and its religious themes. This style was well suited to the art produced in New Spain, which was heavily influenced by the Catholic Church and its religious themes. In contrast, the other options listed were not as influential in New Spain during this time period. The Renaissance and Mannerism were primarily focused on classical themes and styles, which were not as prevalent in New Spain. Similarly, Romanticism was a later movement that was not as influential in New Spain during this time period.


The answer is therefore [D]


Problem 3. Choose the most likely completion of the following sentence. Works like the Sistine Chapel frescoes directly influenced the form and scale of works by __.


Choose from the following options: [A] Gianlorenzo Bernini [B] Giovanni Battista Gaulli [C] Peter Paul Rubens [D] Rachel Ruysch


Explanation for Problem 3: Giovanni Battista Gaulli, also known as Baciccio, was heavily influenced by the Sistine Chapel frescoes in his own works. This is evident in his use of large scale, dramatic compositions and his focus on religious themes, which were similar to those found in the Sistine Chapel frescoes. In contrast, the other options listed were not as directly influenced by the Sistine Chapel frescoes. Gianlorenzo Bernini was a contemporary of Baciccio, but his works were more focused on classical themes and styles, rather than the religious themes found in the Sistine Chapel frescoes. Peter Paul Rubens was a later artist who was more influenced by the Baroque style, rather than the Renaissance style of the Sistine Chapel frescoes. Rachel Ruysch was a Dutch artist who was not known for large-scale religious works, and therefore would not have been directly influenced by the Sistine Chapel frescoes.


The answer is therefore [B]


Problem 4. Choose the most likely completion of the following sentence. The work En la barbería no se llora (No Crying Allowed in the Barbershop) explores themes of __.


Choose from the following options: [A] sexual stereotyping [B] women’s liberation [C] conflict and peace [D] racial discrimination


Explanation for Problem 4: The title of the work, En la barbería no se llora (No Crying Allowed in the Barbershop), suggests that the work explores themes of sexual stereotyping. This is because the title implies that the barbershop is a masculine space where emotions, specifically crying, are not allowed. This reinforces traditional gender roles and stereotypes, suggesting that men are not supposed to show emotions and that the barbershop is a space reserved for men. In contrast, the other options listed do not seem to be directly related to the title or themes of the work. Women’s liberation, conflict and peace, and racial discrimination are not mentioned or implied in the title, and therefore are not likely to be the main themes of the work. Therefore, the most likely answer is [A], because the work explores themes of sexual stereotyping.


The answer is therefore [A]


Problem 5. Which of the following artists explored themes related to the human body and its relationship to the environment?


Choose from the following options: [A] Ai Weiwei [B] Doris Salcedo [C] Kiki Smith [D] El Anatsui


Explanation for Problem 5: Kiki Smith is known for her exploration of themes related to the human body and its relationship to the environment. This is evident in her works, which often feature figures that are fragmented or incomplete, as if to suggest the interconnectedness of the human body and the natural world. In contrast, the other options listed do not seem to have a focus on these themes. Ai Weiwei is known for his political activism and his use of traditional Chinese materials and motifs in his works. Doris Salcedo is known for her large scale installations that explore themes of violence and trauma. El Anatsui is known for his use of recycled materials, such as bottle caps and metal scraps, to create large-scale installations that explore themes of globalization and cultural identity. Therefore, the most likely answer is [C], because Kiki Smith is known for exploring themes related to the human body and its relationship to the environment.


The answer is therefore [C]


Problem 6. <PROBLEM TEXT AND ANSWER CHOICES GO HERE>


Explanation for Problem 4: <MODEL EXPLANATION (t=0.3, n=1, max_tokens=512, stop=’\nThe answer is therefore’) SAMPLED HERE>


The answer is therefore [<MODEL ANSWER CHOICE (t=0.0, n=1, stop=’]’) SAMPLED HERE>]

 

미디엄 블로그 (링크)을 보면 흥미로운 지점이 있는데 GPT-4에서 MoE (Mixture of Experts) 구조를 사용했으며 Expert 하나당 110B라서 총 1.76T 라는 어마어마하게 큰 파라미터 수를 가진다는 내용이다. 

 

Mixtral 같은 MoE 모델이 괜히 나온게 아닌가 싶다. 

 

 

 

 

References:

https://devocean.sk.com/blog/techBoardDetail.do?ID=164627&boardType=techBlog

https://moon-walker.medium.com/gpt-4%EC%9D%98-%EC%84%B8%EB%B6%80-%EC%A0%95%EB%B3%B4%EA%B0%80-%EC%9C%A0%EC%B6%9C%EB%90%98%EB%8B%A4-db00d6a13f6

https://moon-walker.medium.com/compute-budget-%EC%A0%9C%ED%95%9C%EC%9D%84-%EA%B3%A0%EB%A0%A4%ED%95%9C-google%EC%9D%98-%EC%B5%9C%EC%8B%A0-language-model-%EC%97%B0%EA%B5%AC-%EB%8F%99%ED%96%A5-79aaafe3d2c8

https://the-decoder.com/gpt-4-architecture-datasets-costs-and-more-leaked/

https://semianalysis.com/2023/07/10/gpt-4-architecture-infrastructure/

https://www.ikangai.com/the-secrets-of-gpt-4-leaked/

 

 

 

 

 

 

'NLP > LLM' 카테고리의 다른 글

LLM 서빙 관련 글 모음  (0) 2025.04.27
Mistral 7B (2023) 논문 리뷰  (0) 2025.04.27
Self-Instruct (2022) 논문 리뷰  (0) 2025.04.17
LLaMA 2 (2023) 논문 리뷰  (0) 2025.04.17
Emergent Abilities of Large Language Models (2022) 논문 리뷰  (0) 2025.04.17