Data Exploration & Cleaning
Prepare and understand your data with AI-assisted exploration and quality assurance.
Analyze this dataset for quality issues: identify missing values, outliers, data type inconsistencies, and duplicate records. Provide a summary of data quality issues and prioritized recommendations for cleaning. [Insert data sample or describe dataset structure]
Perform exploratory data analysis on this dataset. Identify key statistics, distributions, correlations between variables, and potential patterns. Suggest which variables are most important for further analysis. [Insert data or dataset description]
I have raw data that needs transformation before analysis. Help me plan the transformation steps: normalization, feature engineering, aggregation, and encoding. Explain how each transformation will improve the data for analysis.
SQL & Database Analysis
Write efficient SQL queries and optimize database performance with AI assistance.
Write a SQL query that: [describe your analysis requirement]. The query should handle [specify edge cases]. Include comments explaining the logic and return [specify desired output format].
Help me optimize this SQL query for performance: [paste your query]. It currently takes [timeframe] to execute. What indexing strategies, JOIN optimizations, or query restructuring would improve speed? Provide the optimized query with explanations.
I need to extract insights from this database schema: [describe tables, columns, relationships]. Help me identify the best way to query this for [specify analysis goal]. Suggest joins, aggregations, and filtering strategies.
Python Data Analysis
Generate and optimize Python code for data manipulation, analysis, and visualization.
Write a Python script using pandas to: [describe your data manipulation task]. The input is [describe data format]. I need the output to be [specify desired output]. Include error handling and comments.
Help me perform [specify statistical analysis: regression, correlation, hypothesis testing, ANOVA, etc.] on my dataset. [Describe your data and variables]. Provide the Python code using scipy and statsmodels, including interpretation of results.
Review this Python data analysis code for efficiency, readability, and correctness: [paste your code]. Identify potential bugs, suggest optimizations, and recommend better approaches or libraries. Provide refactored code with explanations.
Visualization & Dashboarding
Design effective visualizations and dashboards that communicate insights clearly.
I have [describe your data: sales by region and time, customer segments, etc.]. I need to show [describe your analytical goal]. What visualization types would be most effective? Suggest 2-3 options with pros/cons of each.
Help me design a dashboard for [describe audience and purpose: executive dashboard, operational KPIs, customer analytics, etc.]. Key metrics include [list metrics]. Suggest layout, visualization types, drill-down capabilities, and refresh frequency.
I have these key findings from my analysis: [describe findings]. Help me craft a compelling data story with: talking points, recommended visuals, key statistics, and recommendations. Make it suitable for [describe audience: executive, team, investors].
Modeling & Forecasting
Build predictive models and forecasts to drive data-driven decisions.
Help me build a predictive model to [describe your prediction goal: forecast revenue, predict churn, identify high-value customers]. I have [describe data available]. Recommend an algorithm, suggest features, and outline model validation approach.
I need to forecast [describe what: monthly revenue, website traffic, demand, etc.] for the next [time period]. I have [describe historical data]. Suggest appropriate forecasting methods, explain seasonality/trends, and provide confidence intervals.
Help me engineer features for my predictive model. I have raw data including [describe available fields]. For predicting [describe target], what new features should I create? How should I handle categorical variables, missing values, and scaling?