Dataset: LLM Token Usage in Everyday Office Tasks

This dataset provides realistic token usage estimates for 64 common AI tasks across different categories common in enterprise office environments: Communication, Coding, Analysis, Planning, Document Processing, and Multimodal tasks (Vision, Audio, Mixed).

About This Dataset

The dataset compares token consumption between standard models (like GPT-4o) and reasoning models (like OpenAI o1). Reasoning models use additional hidden tokens to “think” through problems step-by-step before generating responses, leading to more accurate and reliable results.

This dataset has been compiled from real-world use cases in our enterprise AI implementation projects, then validated and extended with data from authoritative sources listed below. The token estimates reflect actual production workloads and can be used for modeling theoretical scenarios.

Key Insights:

  • Simple tasks (e.g., “Hello World”) use ~300-500 additional tokens for reasoning
  • Complex tasks (math, debugging, logic) benefit most: reasoning models use 10-20x more tokens but deliver significantly better accuracy
  • Multimodal tasks (images, audio) have high base costs before any reasoning
  • Audio is extremely token-dense: ~1,000-1,200 tokens per minute

Token Usage Dataset

The complete dataset is available for download in CSV file format.

CategoryTaskDescriptionInput TokensOutput Tokens (Normal)Output Tokens (Reasoning)ImagesAudio (min)
CommunicationDrafting a Short EmailWrite a sick leave email to boss5010045000
CommunicationPolite RejectionDecline a wedding invitation politely6012050000
CommunicationRewrite for ToneMake paragraph sound more professional15015080000
CommunicationCover Letter GenerationWrite cover letter for Sales role200400180000
CommunicationReplying to a TextGive 3 witty replies to text message406050000
CommunicationGrammar CheckFix grammar in 200-word memo20015090000
CodingHello World ScriptWrite a Python Hello World script302035000
CodingExcel Formula HelpFormula to VLOOKUP column A in Sheet 25050120000
CodingRegex GenerationRegex to validate email address8070150000
CodingSQL Query GenerationSelect top 5 users by spend from tables100100180000
CodingDebugging CodeFind error in 50-line Python function600200450000
CodingCode RefactoringRewrite code to be more efficient700300500000
CodingExplain Error LogWhat does this stack trace mean?350150250000
AnalysisSummarize ArticleSummarize 1000-word article1400200350000
AnalysisExtract DataList all dates and names from text1000200280000
AnalysisMath Word ProblemIf train leaves Chicago at 60mph…10050250000
AnalysisLogic Riddle SolvingSolve two doors two guards riddle12080200000
AnalysisFinancial AnalysisAnalyze CSV rows for trends600200350000
AnalysisSentiment AnalysisIs customer review positive?807060000
PlanningMeal PlanCreate healthy 3-day meal plan150350150000
PlanningTrip ItineraryPlan 3-day weekend in Tokyo200600250000
PlanningBrainstorm Titles10 catchy titles for AI blog10010080000
PlanningWrite a HaikuWrite haiku about the ocean303040000
PlanningGift IdeasGift ideas for dad who likes golf100200100000
PlanningRoleplay ScenarioPretend you are a career coach150450150000
Document ProcessingExtract Invoice DataExtract vendor total date from 2-page invoice900300280000
Document ProcessingSummarize ContractSummarize key terms from 10-page legal contract4000500900000
Document ProcessingResume ScreeningExtract relevant skills from 2-page resume1100400320000
Document ProcessingTranslate DocumentTranslate 5-page Spanish document to English2300700650000
Document ProcessingFormat MarkdownConvert 500-word Word doc to structured Markdown1200600400000
Document ProcessingParse JSON SchemaValidate and fix malformed JSON document500300220000
Document ProcessingCSV to SQLConvert 100-row CSV to INSERT statements1200800450000
Document ProcessingExtract Table DataExtract and restructure table from PDF (500 rows)2800700700000
Document ProcessingCompare VersionsIdentify changes between 2 versions of 5-page doc1700500550000
Document ProcessingReview Code PRReview 200-line code pull request for bugs1300500450000
Document ProcessingGenerate API DocsCreate documentation from 50-function source file1800700550000
Multimodal (Vision)Describe ImageDescribe content of single photograph800150220010
Multimodal (Vision)OCR DocumentExtract text from image of handwritten note850200240010
Multimodal (Vision)Analyze ChartInterpret data trends from bar chart image950350300010
Multimodal (Vision)Screenshot AnalysisDebug UI from application screenshot900350380010
Multimodal (Vision)Identify ObjectsList all objects in image of warehouse800300280010
Multimodal (Vision)Compare ImagesFind differences between 2 product photos1600600450020
Multimodal (Vision)Read WhiteboardTranscribe equation written on whiteboard photo800250260010
Multimodal (Audio)Transcribe AudioTranscribe 5-minute audio interview50008001100005
Multimodal (Audio)Extract Meeting NotesGenerate summary and action items from 30-min meeting30000100058000030
Multimodal (Audio)Identify SpeakerIdentify speaker and emotion in 2-min audio clip2000300480002
Multimodal (Audio)Translate AudioTranscribe and translate 10-min German audio to English10000100021000010
Multimodal (Mixed)Document + ImageMatch text document to related photos15001000550020
Multimodal (Mixed)Video DescriptionDescribe content from 2-min video (frames + audio)23002200950032
Multimodal (Mixed)Multi-Image ComparisonCompare changes across 5 product design mockups4200600950050
Document ProcessingSummarize 50-page Technical ReportSummarize key findings from 50-page technical PDF without images2000012002600000
Document ProcessingExtract KPIs from 50-page Annual ReportExtract revenue profit and growth KPIs from 50-page annual report2200015002800000
Document ProcessingSummarize 100-page Regulatory FilingCreate executive summary of 100-page regulatory filing (10-K/10-Q)4000020005200000
Document ProcessingCompare Two 50-page ContractsIdentify differences and risks between two 50-page legal contracts3800025006000000
Document ProcessingAudit 5k-line Codebase FileReview a 5000-line single code file for bugs and architecture issues3500030007000000
Multimodal (Vision)Process 20-page Scanned PDFOCR and structure 20-page scanned PDF (image-only)16000200030000200
Multimodal (Mixed)Analyze 50-page Report with ChartsSummarize 50-page PDF containing text plus 10 chart images23000200032000100
Multimodal (Audio)Transcribe 60-min PodcastFull transcription of a 60-minute podcast episode60000300075000060
Multimodal (Audio)Summarize 90-min University LectureGenerate structured notes and sections from a 90-minute lecture recording90000400090000090
Multimodal (Audio)Analyze 2-hour Support Call LogExtract issues sentiments and escalation points from 2-hour support call12000050001100000120
Multimodal (Mixed)Describe 10-min Product Demo VideoSummarize features and UX from 10-minute demo video (screen + narration)180003000220001010
Multimodal (Mixed)Summarize 45-min Webinar with SlidesGenerate structured summary from 45-min webinar audio plus 30 slide images750004000800003045
Multimodal (Mixed)Review 60-min Security Camera FootageIdentify key events in 60-min silent security recording48000250052000400

Data Sources & Methodology

This dataset was compiled from the following authoritative sources:

General Tokenizer

  • Tiktokenizer (OpenAI): Standard text tokenization rule: 1 word ≈ 1.3 tokens (1000 tokens ≈ 750 words).

Reasoning Models

Vision Tasks

  • OpenAI Vision Documentation: Images are processed in 512x512 tiles. High-detail mode costs ~85 tokens base + 170 tokens per tile. A standard 1080p image is often ~765-1105 tokens.
  • Cursor IDE Blog (GPT-4o Image Costs): Practical breakdown of image costs: Low detail is fixed at 85 tokens. High detail scales with resolution.

Audio Tasks

  • OpenAI Pricing (Audio): Audio inputs are billed separately from text. GPT-4o Audio input is ~$0.06/min (Realtime).
  • Microsoft Azure AI Blog (Audio Tokens): Audio tokenization is dense. Approximately 1 minute of audio ≈ 1,000 - 1,200 audio tokens for billing purposes.
  • OpenAI GPT-4o Audio Guide: Technical details on how audio is tokenized and processed, confirming the distinction between input audio tokens and output text tokens.

Document Processing

  • Arxiv: Chain of Draft: Discusses token efficiency in reasoning models for drafting and document tasks, highlighting the overhead of “thinking” steps.

General Tasks


Use Cases

  • Cost Estimation: Calculate expected API costs for your AI applications
  • Model Selection: Choose between standard and reasoning models based on task complexity
  • Budgeting: Plan AI infrastructure costs for production workloads
  • Research: Benchmark and compare token efficiency across different task types

Want to see how these tasks translate to real-world workloads? Check out our detailed analysis:

AI Costs by Office Role - We use this dataset to calculate typical daily token consumption for different business roles (Executive Assistant, Recruiter, Financial Analyst, Corporate Counsel, Software Engineer) and reveal what drives AI costs in your organization.


Citation

If you use this dataset in your research or applications, please cite:

onprem.ai Research (2025). Real-World LLM Token Usage Dataset.
Retrieved from https://onprem.ai/en/knowhow/llm-token-usage-dataset/

Last Updated: December 2025 Version: 1.0 License: Creative Commons BY 4.0