feat: lite deep researcher implementation

2026-04-17 19:44:45 +08:00 · 2025-04-07 16:25:55 +08:00
commit 03798ded08
58 changed files with 4242 additions and 0 deletions
--- a/examples/bitcoin_price_fluctuation.md
+++ b/examples/bitcoin_price_fluctuation.md
@@ -0,0 +1,45 @@
+## Bitcoin Price Fluctuations in the Recent 3 Months
+
+### Executive Summary
+
+This report analyzes Bitcoin price fluctuations over the past three months, based on available search results. The analysis considers market sentiment, regulatory influences, economic factors, and technical analysis indicators. Due to limitations in accessing and processing raw data, the report relies on summarized findings from various sources.
+
+### Key Findings
+
+*   **Trump Administration Policies:** Tariffs imposed in April 2025 impacted Bitcoin, causing it to fall from $109K to $84K.
+*   **Economic Uncertainty:** General economic uncertainty contributed to Bitcoin falling below $90,000.
+*   **Market Sentiment:** The Crypto Fear and Greed Index reflects the overall market sentiment, which fluctuates based on news and events.
+*   **Technical Analysis:** Key support levels around $80,400 and $74,000, with resistance levels near $98,500 and $106,000.
+
+### Detailed Analysis
+
+**Influencing Factors:**
+
+*   **Regulatory Environment:** The Trump administration's approach to crypto regulation and SEC actions appear to have influenced Bitcoin's price.
+*   **Market Sentiment:** The Crypto Fear and Greed Index is a key indicator of market sentiment.
+*   **Trading Volume:** Historical data from Yahoo Finance and Investing.com shows Bitcoin trading volume over the past 3 months.
+*   **Social Media Sentiment:** Sentiment analysis from platforms like the r/cryptocurrency subreddit and Twitter (X) can provide insights into market perceptions.
+*   **GBTC Holdings:** Grayscale Bitcoin Trust (GBTC) historical prices and data reflect its holdings.
+*   **Bitcoin Futures:** Historical data for Bitcoin Futures (BTC=F) is available on Yahoo Finance.
+*   **Google Trends:** Google Trends data indicates the search popularity of "bitcoin" over time. Recent articles suggest a decline in interest in "bitcoin" and "bitcoin price" searches.
+
+**Price Movements:**
+
+*   Bitcoin experienced a drop from $109K to $84K following Trump's tariffs on April 2, 2025.
+*   Bitcoin fell below $90,000 due to economic uncertainty.
+*   Key support levels to watch are around $80,400 and $74,000, with resistance levels near $98,500 and $106,000.
+
+### Conclusions and Recommendations
+
+Based on the available information, Bitcoin's price fluctuations in the last three months have been influenced by a combination of regulatory actions, economic conditions, and market sentiment.
+
+**Recommendations:**
+
+*   Monitor regulatory developments and their potential impact on the cryptocurrency market.
+*   Track economic indicators and assess their influence on investor behavior.
+*   Analyze market sentiment using tools like the Crypto Fear and Greed Index and social media analysis.
+*   Consider technical analysis indicators to identify potential support and resistance levels.
+
+**Limitations:**
+
+This report is based on summarized search results and lacks access to raw data for comprehensive analysis. Further investigation with detailed data analysis is recommended for more accurate conclusions.
--- a/examples/what_is_llm.md
+++ b/examples/what_is_llm.md
@@ -0,0 +1,106 @@
+## Report on Large Language Models (LLMs)
+
+This report provides a comprehensive overview of Large Language Models (LLMs), covering their definition, architecture, training, applications, limitations, biases, ethical considerations, and mitigation strategies, based on the provided search results.
+
+### Executive Summary
+
+LLMs are deep learning models that use transformer architecture and are trained on massive datasets. They excel at various Natural Language Processing (NLP) tasks, including text generation, translation, and question answering. However, they also present limitations, biases, and ethical challenges that need to be addressed for responsible development and deployment.
+
+### Key Findings
+
+*   **Definition and Architecture**: LLMs are deep learning algorithms that perform NLP tasks using transformer models and are trained on massive datasets. They consist of encoders, decoders, and attention mechanisms, with key components like embedding layers and attention mechanisms.
+*   **Training Data and Methodologies**: LLMs are trained on datasets like Common Crawl (5.4 trillion tokens) and The Pile (800 GB). Training methodologies include unsupervised pre-training, supervised fine-tuning, and transfer learning.
+*   **Applications**: LLMs are used in text generation, machine translation, question answering, code generation, text summarization, and sentiment analysis.
+*   **Performance Benchmarks**: LLM performance is evaluated using metrics like accuracy, precision, recall, F1 score, BLEU, ROUGE, perplexity, and HumanEval (pass@k).
+*   **Limitations**: LLMs have computational constraints, struggle with complex linguistic elements, lack long-term memory, and can perpetuate biases.
+*   **Biases**: LLMs exhibit gender, racial, cultural, and socio-economic stereotypes due to biases in their training data.
+*   **Ethical Considerations**: LLMs raise ethical concerns about misuse, privacy, and accountability.
+*   **Mitigation Strategies**: Mitigation strategies include data curation, model adjustments, and post-processing techniques.
+
+### Detailed Analysis
+
+#### Definition and Architecture
+
+LLMs are a specific type of generative AI designed for text-based content generation. They leverage deep learning algorithms and transformer models to perform various NLP tasks. A typical LLM architecture includes:
+
+*   **Embedding Layer**: Converts input text into numerical embeddings, capturing semantic and syntactic meaning.
+*   **Attention Mechanism**: Allows the model to focus on relevant parts of the input text.
+*   **Transformer Models**: A tokenizer converts text into numerical values (tokens), and encoders create meaningful embeddings.
+
+LLMs typically have at least one billion or more parameters.
+
+#### Training Data and Methodologies
+
+LLMs require vast amounts of data for effective training. Some key datasets include:
+
+*   **Common Crawl**: 5.4 trillion tokens
+*   **Cosmopedia**: 25 billion tokens
+*   **The Pile**: 800 GB
+
+Training methodologies include:
+
+*   **Unsupervised Pre-training**: Learning general language representations.
+*   **Supervised Fine-tuning**: Adapting models to specific tasks.
+*   **Transfer Learning**: Leveraging knowledge gained from one task to improve performance on another.
+
+#### Applications
+
+LLMs have a wide array of applications across various domains:
+
+*   **Text Generation**: Creating coherent and contextually relevant text.
+*   **Machine Translation**: Converting text from one language to another.
+*   **Question Answering**: Providing answers to questions posed in natural language.
+*   **Code Generation**: Generating code snippets or complete programs.
+*   **Text Summarization**: Condensing large amounts of text into shorter summaries.
+*   **Sentiment Analysis**: Determining the emotional tone or attitude expressed in text.
+
+#### Performance Benchmarks and Evaluation Metrics
+
+Evaluating LLM performance involves using standardized benchmarks and metrics. Key metrics include:
+
+*   **Accuracy**: Measures the correctness of the model's outputs.
+*   **Precision and Recall**: Assess the relevance and completeness of the results.
+*   **F1 Score**: Provides a balanced measure of precision and recall.
+*   **BLEU and ROUGE**: Evaluate the quality of machine-translated or summarized text.
+*   **Perplexity**: Measures the uncertainty of the model in predicting the next word in a sequence.
+*   **HumanEval (pass@k)**: Assesses code generation performance.
+
+#### Limitations, Biases, and Ethical Considerations
+
+LLMs face several limitations:
+
+*   **Computational Constraints**: Limited by fixed token limits.
+*   **Complex Linguistic Elements**: Struggle with nuanced language.
+*   **Lack of Long-Term Memory**: Difficulty retaining information over extended contexts.
+*   **Perpetuation of Biases**: Reinforce stereotypes from training data.
+
+Biases in LLMs can manifest as:
+
+*   **Gender Stereotypes**: Skewed outputs based on gender.
+*   **Racial Stereotypes**: Unfair representations of different racial groups.
+*   **Cultural Stereotypes**: Biased outputs related to specific cultures.
+
+Ethical considerations include:
+
+*   **Potential Misuse**: Disinformation and manipulation.
+*   **Privacy Issues**: Data usage and potential exposure of personal information.
+*   **Accountability Challenges**: Difficulty in tracing the reasoning processes of LLMs.
+
+#### Mitigation Strategies
+
+Various strategies can be employed to mitigate limitations and biases:
+
+*   **Data Curation**: Refining training data to reduce biases.
+*   **Model Adjustments**: Implementing fairness constraints during training.
+*   **Post-processing Corrections**: Fine-tuning outputs to reduce biases.
+*   **Resampling and Augmentation**: Balancing and expanding the training dataset.
+
+### Conclusions and Recommendations
+
+LLMs are powerful tools with a wide range of applications, but they are not without limitations and risks. Addressing these challenges requires:
+
+*   **Ongoing Research**: Continued investigation into biases, limitations, and mitigation strategies.
+*   **Ethical Frameworks**: Development of updated ethical guidelines for responsible development and deployment.
+*   **Collaboration**: Interdisciplinary efforts involving researchers, developers, and policymakers.
+*   **Data Transparency**: Increased transparency about training data and model development processes.
+*   **Careful Implementation**: Strategic application of mitigation techniques to avoid unintended performance trade-offs.
--- a/examples/what_is_mcp.md
+++ b/examples/what_is_mcp.md
@@ -0,0 +1,81 @@
+# Report: Understanding MCP (Multiple Contexts)
+
+## Executive Summary
+
+This report provides a comprehensive overview of the term "MCP" in various contexts, including Model Context Protocol, Monocalcium Phosphate, and Micro-channel Plate. The report is structured to cover the definitions, applications, and stakeholders involved with each interpretation of MCP. The information is sourced from reliable references such as authoritative websites, industry reports, and expert publications.
+
+## Key Findings
+
+1. **Model Context Protocol (MCP)**
+   - **Definition**: MCP is an open standard that allows AI models to connect to various applications and data sources using a common language.
+   - **Applications**: Used in AI and large language models (LLMs) to standardize interactions and enable seamless integration with different software tools.
+   - **Stakeholders**: Project managers, AI developers, and application providers.
+
+20. **Monocalcium Phosphate (MCP)**
+   - **Definition**: MCP is a chemical compound used in various industries, including food, agriculture, and construction.
+   - **Applications**: Used as a leavening agent in baked goods, in animal feed, as a fertilizer, and in the production of emulsion polymers for everyday products.
+   - **Stakeholders**: Food manufacturers, agricultural companies, and construction material producers.
+
+3. **Micro-channel Plate (MCP)**
+   - **Definition**: MCP is a high-gain electron multiplier used in scientific and military applications for enhanced detection and imaging.
+   - **Applications**: Used in night vision devices, electron microscopes, mass spectrometers, and radar systems.
+   - **Stakeholders**: Scientific researchers, medical imaging professionals, and defense contractors.
+
+## Detailed Analysis
+
+### 1. Model Context Protocol (MCP)
+
+#### Definition
+- **Model Context Protocol (MCP)** is an open standard that standardizes how applications provide context information to large language models (LLMs). It acts as a universal plug, enabling AI assistants to interact with different software tools using a common language, eliminating the need for custom integrations for each application.
+
+#### Applications
+- **AI and LLMs**: MCP is crucial in the AI and LLM ecosystem, allowing these models to integrate with various applications and data sources seamlessly.
+- **Client-Server Connections**: MCP defines a lifecycle for client-server connections, ensuring proper capability negotiation and state management. This enables language models to automatically discover and invoke tools based on their contextual understanding and user prompts.
+
+#### Stakeholders
+- **Project Managers and AI Developers**: Responsible for implementing and managing MCP in AI projects.
+- **Application Providers**: Integrate MCP into their software tools to ensure compatibility with AI models.
+
+### 2. Monocalcium Phosphate (MCP)
+
+#### Definition
+- **Monocalcium Phosphate (MCP)** is a chemical compound with the formula Ca(H2PO4)2. It is used in various forms, including anhydrous (MCP-A) and hydrated (MCP-H).
+
+#### Applications
+- **Food Industry**: MCP is used as a leavening agent in baked goods, providing aeration and improving texture.
+- **Agriculture**: MCP is used as a fertilizer, providing essential nutrients to plants.
+- **Construction**: MCP-based emulsion polymers are used in the production of adhesives, coatings, and other construction materials.
+
+#### Stakeholders
+- **Food Manufacturers**: Use MCP in the production of baked goods.
+- **Agricultural Companies**: Utilize MCP as a fertilizer.
+- **Construction Material Producers**: Incorporate MCP-based emulsion polymers in their products.
+
+### 3. Micro-channel Plate (MCP)
+
+#### Definition
+- **Micro-channel Plate (MCP)** is a high-gain electron multiplier used in scientific and military applications. It consists of a thin plate with a honeycomb structure, where each channel acts as an electron multiplier.
+
+#### Applications
+- **Scientific Research**: MCPs are used in electron microscopes and mass spectrometers for high-sensitivity detection.
+- **Medical Imaging**: MCPs are used in medical imaging systems, providing high sensitivity and rapid response times.
+- **Military and Aerospace**: MCPs are critical in radar systems, missile detection, and imaging systems, where precision and reliability are essential.
+
+#### Stakeholders
+- **Scientific Researchers**: Use MCPs in advanced research instruments.
+- **Medical Imaging Professionals**: Utilize MCPs in medical imaging systems.
+- **Defense Contractors**: Integrate MCPs into military and aerospace applications.
+
+## Conclusions and Recommendations
+
+### Conclusions
+- **Model Context Protocol (MCP)** is an open standard that facilitates the integration of AI models with various applications, enhancing interoperability and efficiency.
+- **Monocalcium Phosphate (MCP)** is a versatile chemical compound with applications in the food, agriculture, and construction industries.
+- **Micro-channel Plate (MCP)** is a high-gain electron multiplier used in scientific, medical, and military applications, providing high sensitivity and precision.
+
+### Recommendations
+- **For AI and LLM Projects**: Implement MCP to standardize interactions between AI models and applications, reducing the need for custom integrations.
+- **For Food and Agriculture Industries**: Consider the use of MCP in the production of baked goods and as a fertilizer to improve product quality and crop yields.
+- **For Scientific and Military Applications**: Utilize MCPs in advanced research and imaging systems to achieve high sensitivity and precision.
+
+By understanding the different contexts and applications of MCP, stakeholders can make informed decisions and leverage the benefits of this versatile technology.