All of lore.kernel.org
 help / color / mirror / Atom feed
From: Mike Dawson <mike.dawson@cloudapt.com>
To: Stefan Priebe <s.priebe@profihost.ag>,
	Samuel Just <sam.just@inktank.com>,
	"josh.durgin@inktank.com" <josh.durgin@inktank.com>,
	Oliver Francke <Oliver.Francke@filoo.de>
Cc: "ceph-devel@vger.kernel.org" <ceph-devel@vger.kernel.org>,
	Stefan Hajnoczi <stefanha@redhat.com>
Subject: Re: still recovery issues with cuttlefish
Date: Thu, 08 Aug 2013 10:05:11 -0400	[thread overview]
Message-ID: <5203A597.4060701@cloudapt.com> (raw)
In-Reply-To: <51FBFE85.5040700@profihost.ag>

Stefan,

I see the same behavior and I theorize it is linked to an issue detailed 
in another thread [0]. Do your VM guests ever hang while your cluster is 
HEALTH_OK like described in that other thread?

[0] http://comments.gmane.org/gmane.comp.file-systems.ceph.user/2982

A few observations:

- The VMs that hang do lots of writes (video surveillance).
- I use rbd and qemu. The problem exists in both qemu 1.4.x and 1.5.2.
- The problem exists with or without joshd's qemu async flush patch.
- Windows VMs seem to be more vulnerable than Linux VMs.
- If I restart the qemu-system-x86_64 process, the guest will come back 
to life.
- A partial workaround seems to be console input (NoVNC or 'virsh 
screenshot'), but restarting qemu-system-x86_64 works better.
- The issue of VMs hanging seems worse with RBD writeback cache enabled
- I typically run virtio, but I believe I've seen it with e1000, too.
- VM guests hang at different times, not all at once on a host (or 
across all hosts).
- I co-mingle VM guests on servers that host ceph OSDs.



Oliver,

If your cluster has to recover/backfill, do your guest VMs hang with 
more frequency than under normal HEALTH_OK conditions, even if you 
prioritize client IO as Sam wrote below?


Sam,

Turning down all the settings you mentioned certainly does slow the 
recover/backfill process, but it doesn't prevent the VM guests backed by 
RBD volumes from hanging. In fact, I often try to prioritize 
recovery/backfill because my guests tend to hang until I get back to 
HEALTH_OK. Given this apparent bug, completing recovery/backfill quicker 
leads to less total outage, it seems.


Josh,

How can I help you investigate if RBD is the common source of both of 
these issues?


Thanks,
Mike Dawson


On 8/2/2013 2:46 PM, Stefan Priebe wrote:
> Hi,
>
>          osd recovery max active = 1
>          osd max backfills = 1
>          osd recovery op priority = 5
>
> still no difference...
>
> Stefan
>
> Am 02.08.2013 20:21, schrieb Samuel Just:
>> Also, you have osd_recovery_op_priority at 50.  That is close to the
>> priority of client IO.  You want it below 10 (defaults to 10), perhaps
>> at 1.  You can also adjust down osd_recovery_max_active.
>> -Sam
>>
>> On Fri, Aug 2, 2013 at 11:16 AM, Stefan Priebe <s.priebe@profihost.ag>
>> wrote:
>>> I already tried both values this makes no difference. The drives are
>>> not the
>>> bottleneck.
>>>
>>> Am 02.08.2013 19:35, schrieb Samuel Just:
>>>
>>>> You might try turning osd_max_backfills to 2 or 1.
>>>> -Sam
>>>>
>>>> On Fri, Aug 2, 2013 at 12:44 AM, Stefan Priebe <s.priebe@profihost.ag>
>>>> wrote:
>>>>>
>>>>> Am 01.08.2013 23:23, schrieb Samuel Just:> Can you dump your osd
>>>>> settings?
>>>>>
>>>>>> sudo ceph --admin-daemon ceph-osd.<osdid>.asok config show
>>>>>
>>>>>
>>>>> Sure.
>>>>>
>>>>>
>>>>>
>>>>> { "name": "osd.0",
>>>>>     "cluster": "ceph",
>>>>>     "none": "0\/5",
>>>>>     "lockdep": "0\/0",
>>>>>     "context": "0\/0",
>>>>>     "crush": "0\/0",
>>>>>     "mds": "0\/0",
>>>>>     "mds_balancer": "0\/0",
>>>>>     "mds_locker": "0\/0",
>>>>>     "mds_log": "0\/0",
>>>>>     "mds_log_expire": "0\/0",
>>>>>     "mds_migrator": "0\/0",
>>>>>     "buffer": "0\/0",
>>>>>     "timer": "0\/0",
>>>>>     "filer": "0\/0",
>>>>>     "striper": "0\/1",
>>>>>     "objecter": "0\/0",
>>>>>     "rados": "0\/0",
>>>>>     "rbd": "0\/0",
>>>>>     "journaler": "0\/0",
>>>>>     "objectcacher": "0\/0",
>>>>>     "client": "0\/0",
>>>>>     "osd": "0\/0",
>>>>>     "optracker": "0\/0",
>>>>>     "objclass": "0\/0",
>>>>>     "filestore": "0\/0",
>>>>>     "journal": "0\/0",
>>>>>     "ms": "0\/0",
>>>>>     "mon": "0\/0",
>>>>>     "monc": "0\/0",
>>>>>     "paxos": "0\/0",
>>>>>     "tp": "0\/0",
>>>>>     "auth": "0\/0",
>>>>>     "crypto": "1\/5",
>>>>>     "finisher": "0\/0",
>>>>>     "heartbeatmap": "0\/0",
>>>>>     "perfcounter": "0\/0",
>>>>>     "rgw": "0\/0",
>>>>>     "hadoop": "0\/0",
>>>>>     "javaclient": "1\/5",
>>>>>     "asok": "0\/0",
>>>>>     "throttle": "0\/0",
>>>>>     "host": "cloud1-1268",
>>>>>     "fsid": "00000000-0000-0000-0000-000000000000",
>>>>>     "public_addr": "10.255.0.90:0\/0",
>>>>>     "cluster_addr": "10.255.0.90:0\/0",
>>>>>     "public_network": "10.255.0.1\/24",
>>>>>     "cluster_network": "10.255.0.1\/24",
>>>>>     "num_client": "1",
>>>>>     "monmap": "",
>>>>>     "mon_host": "",
>>>>>     "lockdep": "false",
>>>>>     "run_dir": "\/var\/run\/ceph",
>>>>>     "admin_socket": "\/var\/run\/ceph\/ceph-osd.0.asok",
>>>>>     "daemonize": "true",
>>>>>     "pid_file": "\/var\/run\/ceph\/osd.0.pid",
>>>>>     "chdir": "\/",
>>>>>     "max_open_files": "0",
>>>>>     "fatal_signal_handlers": "true",
>>>>>     "log_file": "\/var\/log\/ceph\/ceph-osd.0.log",
>>>>>     "log_max_new": "1000",
>>>>>     "log_max_recent": "10000",
>>>>>     "log_to_stderr": "false",
>>>>>     "err_to_stderr": "true",
>>>>>     "log_to_syslog": "false",
>>>>>     "err_to_syslog": "false",
>>>>>     "log_flush_on_exit": "true",
>>>>>     "log_stop_at_utilization": "0.97",
>>>>>     "clog_to_monitors": "true",
>>>>>     "clog_to_syslog": "false",
>>>>>     "clog_to_syslog_level": "info",
>>>>>     "clog_to_syslog_facility": "daemon",
>>>>>     "mon_cluster_log_to_syslog": "false",
>>>>>     "mon_cluster_log_to_syslog_level": "info",
>>>>>     "mon_cluster_log_to_syslog_facility": "daemon",
>>>>>     "mon_cluster_log_file": "\/var\/log\/ceph\/ceph.log",
>>>>>     "key": "",
>>>>>     "keyfile": "",
>>>>>     "keyring": "\/etc\/ceph\/osd.0.keyring",
>>>>>     "heartbeat_interval": "5",
>>>>>     "heartbeat_file": "",
>>>>>     "heartbeat_inject_failure": "0",
>>>>>     "perf": "true",
>>>>>     "ms_tcp_nodelay": "true",
>>>>>     "ms_tcp_rcvbuf": "0",
>>>>>     "ms_initial_backoff": "0.2",
>>>>>     "ms_max_backoff": "15",
>>>>>     "ms_nocrc": "false",
>>>>>     "ms_die_on_bad_msg": "false",
>>>>>     "ms_die_on_unhandled_msg": "false",
>>>>>     "ms_dispatch_throttle_bytes": "104857600",
>>>>>     "ms_bind_ipv6": "false",
>>>>>     "ms_bind_port_min": "6800",
>>>>>     "ms_bind_port_max": "7100",
>>>>>     "ms_rwthread_stack_bytes": "1048576",
>>>>>     "ms_tcp_read_timeout": "900",
>>>>>     "ms_pq_max_tokens_per_priority": "4194304",
>>>>>     "ms_pq_min_cost": "65536",
>>>>>     "ms_inject_socket_failures": "0",
>>>>>     "ms_inject_delay_type": "",
>>>>>     "ms_inject_delay_max": "1",
>>>>>     "ms_inject_delay_probability": "0",
>>>>>     "ms_inject_internal_delays": "0",
>>>>>     "mon_data": "\/var\/lib\/ceph\/mon\/ceph-0",
>>>>>     "mon_initial_members": "",
>>>>>     "mon_sync_fs_threshold": "5",
>>>>>     "mon_compact_on_start": "false",
>>>>>     "mon_compact_on_bootstrap": "false",
>>>>>     "mon_compact_on_trim": "true",
>>>>>     "mon_tick_interval": "5",
>>>>>     "mon_subscribe_interval": "300",
>>>>>     "mon_osd_laggy_halflife": "3600",
>>>>>     "mon_osd_laggy_weight": "0.3",
>>>>>     "mon_osd_adjust_heartbeat_grace": "true",
>>>>>     "mon_osd_adjust_down_out_interval": "true",
>>>>>     "mon_osd_auto_mark_in": "false",
>>>>>     "mon_osd_auto_mark_auto_out_in": "true",
>>>>>     "mon_osd_auto_mark_new_in": "true",
>>>>>     "mon_osd_down_out_interval": "300",
>>>>>     "mon_osd_down_out_subtree_limit": "rack",
>>>>>     "mon_osd_min_up_ratio": "0.3",
>>>>>     "mon_osd_min_in_ratio": "0.3",
>>>>>     "mon_stat_smooth_intervals": "2",
>>>>>     "mon_lease": "5",
>>>>>     "mon_lease_renew_interval": "3",
>>>>>     "mon_lease_ack_timeout": "10",
>>>>>     "mon_clock_drift_allowed": "0.05",
>>>>>     "mon_clock_drift_warn_backoff": "5",
>>>>>     "mon_timecheck_interval": "300",
>>>>>     "mon_accept_timeout": "10",
>>>>>     "mon_pg_create_interval": "30",
>>>>>     "mon_pg_stuck_threshold": "300",
>>>>>     "mon_osd_full_ratio": "0.95",
>>>>>     "mon_osd_nearfull_ratio": "0.85",
>>>>>     "mon_globalid_prealloc": "100",
>>>>>     "mon_osd_report_timeout": "900",
>>>>>     "mon_force_standby_active": "true",
>>>>>     "mon_min_osdmap_epochs": "500",
>>>>>     "mon_max_pgmap_epochs": "500",
>>>>>     "mon_max_log_epochs": "500",
>>>>>     "mon_max_osd": "10000",
>>>>>     "mon_probe_timeout": "2",
>>>>>     "mon_slurp_timeout": "10",
>>>>>     "mon_slurp_bytes": "262144",
>>>>>     "mon_client_bytes": "104857600",
>>>>>     "mon_daemon_bytes": "419430400",
>>>>>     "mon_max_log_entries_per_event": "4096",
>>>>>     "mon_health_data_update_interval": "60",
>>>>>     "mon_data_avail_crit": "5",
>>>>>     "mon_data_avail_warn": "30",
>>>>>     "mon_config_key_max_entry_size": "4096",
>>>>>     "mon_sync_trim_timeout": "30",
>>>>>     "mon_sync_heartbeat_timeout": "30",
>>>>>     "mon_sync_heartbeat_interval": "5",
>>>>>     "mon_sync_backoff_timeout": "30",
>>>>>     "mon_sync_timeout": "30",
>>>>>     "mon_sync_max_retries": "5",
>>>>>     "mon_sync_max_payload_size": "1048576",
>>>>>     "mon_sync_debug": "false",
>>>>>     "mon_sync_debug_leader": "-1",
>>>>>     "mon_sync_debug_provider": "-1",
>>>>>     "mon_sync_debug_provider_fallback": "-1",
>>>>>     "mon_debug_dump_transactions": "false",
>>>>>     "mon_debug_dump_location": "\/var\/log\/ceph\/ceph-osd.0.tdump",
>>>>>     "mon_sync_leader_kill_at": "0",
>>>>>     "mon_sync_provider_kill_at": "0",
>>>>>     "mon_sync_requester_kill_at": "0",
>>>>>     "mon_leveldb_write_buffer_size": "33554432",
>>>>>     "mon_leveldb_cache_size": "268435456",
>>>>>     "mon_leveldb_block_size": "65536",
>>>>>     "mon_leveldb_bloom_size": "0",
>>>>>     "mon_leveldb_max_open_files": "0",
>>>>>     "mon_leveldb_compression": "false",
>>>>>     "mon_leveldb_paranoid": "false",
>>>>>     "mon_leveldb_log": "",
>>>>>     "paxos_stash_full_interval": "25",
>>>>>     "paxos_max_join_drift": "100",
>>>>>     "paxos_propose_interval": "1",
>>>>>     "paxos_min_wait": "0.05",
>>>>>     "paxos_min": "500",
>>>>>     "paxos_trim_min": "500",
>>>>>     "paxos_trim_max": "1000",
>>>>>     "paxos_trim_disabled_max_versions": "108000",
>>>>>     "paxos_service_trim_min": "500",
>>>>>     "paxos_service_trim_max": "1000",
>>>>>     "clock_offset": "0",
>>>>>     "auth_cluster_required": "none",
>>>>>     "auth_service_required": "none",
>>>>>     "auth_client_required": "none",
>>>>>     "auth_supported": "none",
>>>>>     "cephx_require_signatures": "false",
>>>>>     "cephx_cluster_require_signatures": "false",
>>>>>     "cephx_service_require_signatures": "false",
>>>>>     "cephx_sign_messages": "true",
>>>>>     "auth_mon_ticket_ttl": "43200",
>>>>>     "auth_service_ticket_ttl": "3600",
>>>>>     "auth_debug": "false",
>>>>>     "mon_client_hunt_interval": "3",
>>>>>     "mon_client_ping_interval": "10",
>>>>>     "mon_client_max_log_entries_per_message": "1000",
>>>>>     "mon_max_pool_pg_num": "65536",
>>>>>     "mon_pool_quota_warn_threshold": "0",
>>>>>     "mon_pool_quota_crit_threshold": "0",
>>>>>     "client_cache_size": "16384",
>>>>>     "client_cache_mid": "0.75",
>>>>>     "client_use_random_mds": "false",
>>>>>     "client_mount_timeout": "300",
>>>>>     "client_tick_interval": "1",
>>>>>     "client_trace": "",
>>>>>     "client_readahead_min": "131072",
>>>>>     "client_readahead_max_bytes": "0",
>>>>>     "client_readahead_max_periods": "4",
>>>>>     "client_snapdir": ".snap",
>>>>>     "client_mountpoint": "\/",
>>>>>     "client_notify_timeout": "10",
>>>>>     "client_caps_release_delay": "5",
>>>>>     "client_oc": "true",
>>>>>     "client_oc_size": "209715200",
>>>>>     "client_oc_max_dirty": "104857600",
>>>>>     "client_oc_target_dirty": "8388608",
>>>>>     "client_oc_max_dirty_age": "5",
>>>>>     "client_oc_max_objects": "1000",
>>>>>     "client_debug_force_sync_read": "false",
>>>>>     "client_debug_inject_tick_delay": "0",
>>>>>     "fuse_use_invalidate_cb": "false",
>>>>>     "fuse_allow_other": "true",
>>>>>     "fuse_default_permissions": "true",
>>>>>     "fuse_big_writes": "true",
>>>>>     "fuse_atomic_o_trunc": "true",
>>>>>     "fuse_debug": "false",
>>>>>     "objecter_tick_interval": "5",
>>>>>     "objecter_timeout": "10",
>>>>>     "objecter_inflight_op_bytes": "104857600",
>>>>>     "objecter_inflight_ops": "1024",
>>>>>     "journaler_allow_split_entries": "true",
>>>>>     "journaler_write_head_interval": "15",
>>>>>     "journaler_prefetch_periods": "10",
>>>>>     "journaler_prezero_periods": "5",
>>>>>     "journaler_batch_interval": "0.001",
>>>>>     "journaler_batch_max": "0",
>>>>>     "mds_data": "\/var\/lib\/ceph\/mds\/ceph-0",
>>>>>     "mds_max_file_size": "1099511627776",
>>>>>     "mds_cache_size": "100000",
>>>>>     "mds_cache_mid": "0.7",
>>>>>     "mds_mem_max": "1048576",
>>>>>     "mds_dir_commit_ratio": "0.5",
>>>>>     "mds_dir_max_commit_size": "90",
>>>>>     "mds_decay_halflife": "5",
>>>>>     "mds_beacon_interval": "4",
>>>>>     "mds_beacon_grace": "15",
>>>>>     "mds_enforce_unique_name": "true",
>>>>>     "mds_blacklist_interval": "1440",
>>>>>     "mds_session_timeout": "60",
>>>>>     "mds_session_autoclose": "300",
>>>>>     "mds_reconnect_timeout": "45",
>>>>>     "mds_tick_interval": "5",
>>>>>     "mds_dirstat_min_interval": "1",
>>>>>     "mds_scatter_nudge_interval": "5",
>>>>>     "mds_client_prealloc_inos": "1000",
>>>>>     "mds_early_reply": "true",
>>>>>     "mds_use_tmap": "true",
>>>>>     "mds_default_dir_hash": "2",
>>>>>     "mds_log": "true",
>>>>>     "mds_log_skip_corrupt_events": "false",
>>>>>     "mds_log_max_events": "-1",
>>>>>     "mds_log_segment_size": "0",
>>>>>     "mds_log_max_segments": "30",
>>>>>     "mds_log_max_expiring": "20",
>>>>>     "mds_bal_sample_interval": "3",
>>>>>     "mds_bal_replicate_threshold": "8000",
>>>>>     "mds_bal_unreplicate_threshold": "0",
>>>>>     "mds_bal_frag": "false",
>>>>>     "mds_bal_split_size": "10000",
>>>>>     "mds_bal_split_rd": "25000",
>>>>>     "mds_bal_split_wr": "10000",
>>>>>     "mds_bal_split_bits": "3",
>>>>>     "mds_bal_merge_size": "50",
>>>>>     "mds_bal_merge_rd": "1000",
>>>>>     "mds_bal_merge_wr": "1000",
>>>>>     "mds_bal_interval": "10",
>>>>>     "mds_bal_fragment_interval": "5",
>>>>>     "mds_bal_idle_threshold": "0",
>>>>>     "mds_bal_max": "-1",
>>>>>     "mds_bal_max_until": "-1",
>>>>>     "mds_bal_mode": "0",
>>>>>     "mds_bal_min_rebalance": "0.1",
>>>>>     "mds_bal_min_start": "0.2",
>>>>>     "mds_bal_need_min": "0.8",
>>>>>     "mds_bal_need_max": "1.2",
>>>>>     "mds_bal_midchunk": "0.3",
>>>>>     "mds_bal_minchunk": "0.001",
>>>>>     "mds_bal_target_removal_min": "5",
>>>>>     "mds_bal_target_removal_max": "10",
>>>>>     "mds_replay_interval": "1",
>>>>>     "mds_shutdown_check": "0",
>>>>>     "mds_thrash_exports": "0",
>>>>>     "mds_thrash_fragments": "0",
>>>>>     "mds_dump_cache_on_map": "false",
>>>>>     "mds_dump_cache_after_rejoin": "false",
>>>>>     "mds_verify_scatter": "false",
>>>>>     "mds_debug_scatterstat": "false",
>>>>>     "mds_debug_frag": "false",
>>>>>     "mds_debug_auth_pins": "false",
>>>>>     "mds_debug_subtrees": "false",
>>>>>     "mds_kill_mdstable_at": "0",
>>>>>     "mds_kill_export_at": "0",
>>>>>     "mds_kill_import_at": "0",
>>>>>     "mds_kill_link_at": "0",
>>>>>     "mds_kill_rename_at": "0",
>>>>>     "mds_kill_openc_at": "0",
>>>>>     "mds_kill_journal_at": "0",
>>>>>     "mds_kill_journal_expire_at": "0",
>>>>>     "mds_kill_journal_replay_at": "0",
>>>>>     "mds_inject_traceless_reply_probability": "0",
>>>>>     "mds_wipe_sessions": "false",
>>>>>     "mds_wipe_ino_prealloc": "false",
>>>>>     "mds_skip_ino": "0",
>>>>>     "max_mds": "1",
>>>>>     "mds_standby_for_name": "",
>>>>>     "mds_standby_for_rank": "-1",
>>>>>     "mds_standby_replay": "false",
>>>>>     "osd_auto_upgrade_tmap": "true",
>>>>>     "osd_tmapput_sets_uses_tmap": "false",
>>>>>     "osd_max_backfills": "5",
>>>>>     "osd_backfill_full_ratio": "0.85",
>>>>>     "osd_backfill_retry_interval": "10",
>>>>>     "osd_uuid": "00000000-0000-0000-0000-000000000000",
>>>>>     "osd_data": "\/ceph\/osd.0\/",
>>>>>     "osd_journal": "\/dev\/disk\/by-partlabel\/journalosd0",
>>>>>     "osd_journal_size": "5120",
>>>>>     "osd_max_write_size": "90",
>>>>>     "osd_max_pgls": "1024",
>>>>>     "osd_client_message_size_cap": "524288000",
>>>>>     "osd_client_message_cap": "100",
>>>>>     "osd_pg_bits": "6",
>>>>>     "osd_pgp_bits": "6",
>>>>>     "osd_crush_chooseleaf_type": "1",
>>>>>     "osd_min_rep": "1",
>>>>>     "osd_max_rep": "10",
>>>>>     "osd_pool_default_crush_rule": "0",
>>>>>     "osd_pool_default_size": "2",
>>>>>     "osd_pool_default_min_size": "0",
>>>>>     "osd_pool_default_pg_num": "8",
>>>>>     "osd_pool_default_pgp_num": "8",
>>>>>     "osd_pool_default_flags": "0",
>>>>>     "osd_map_dedup": "true",
>>>>>     "osd_map_cache_size": "500",
>>>>>     "osd_map_message_max": "100",
>>>>>     "osd_map_share_max_epochs": "100",
>>>>>     "osd_op_threads": "2",
>>>>>     "osd_peering_wq_batch_size": "20",
>>>>>     "osd_op_pq_max_tokens_per_priority": "4194304",
>>>>>     "osd_op_pq_min_cost": "65536",
>>>>>     "osd_disk_threads": "1",
>>>>>     "osd_recovery_threads": "2",
>>>>>     "osd_recover_clone_overlap": "true",
>>>>>     "osd_backfill_scan_min": "64",
>>>>>     "osd_backfill_scan_max": "512",
>>>>>     "osd_op_thread_timeout": "15",
>>>>>     "osd_recovery_thread_timeout": "30",
>>>>>     "osd_snap_trim_thread_timeout": "3600",
>>>>>     "osd_scrub_thread_timeout": "60",
>>>>>     "osd_scrub_finalize_thread_timeout": "600",
>>>>>     "osd_remove_thread_timeout": "3600",
>>>>>     "osd_command_thread_timeout": "600",
>>>>>     "osd_age": "0.8",
>>>>>     "osd_age_time": "0",
>>>>>     "osd_heartbeat_addr": ":\/0",
>>>>>     "osd_heartbeat_interval": "6",
>>>>>     "osd_heartbeat_grace": "20",
>>>>>     "osd_mon_heartbeat_interval": "30",
>>>>>     "osd_mon_report_interval_max": "120",
>>>>>     "osd_mon_report_interval_min": "5",
>>>>>     "osd_pg_stat_report_interval_max": "500",
>>>>>     "osd_mon_ack_timeout": "30",
>>>>>     "osd_min_down_reporters": "1",
>>>>>     "osd_min_down_reports": "3",
>>>>>     "osd_default_data_pool_replay_window": "45",
>>>>>     "osd_preserve_trimmed_log": "false",
>>>>>     "osd_auto_mark_unfound_lost": "false",
>>>>>     "osd_recovery_delay_start": "0",
>>>>>     "osd_recovery_max_active": "5",
>>>>>     "osd_recovery_max_chunk": "8388608",
>>>>>     "osd_recovery_forget_lost_objects": "false",
>>>>>     "osd_max_scrubs": "1",
>>>>>     "osd_scrub_load_threshold": "0.5",
>>>>>     "osd_scrub_min_interval": "86400",
>>>>>     "osd_scrub_max_interval": "604800",
>>>>>     "osd_deep_scrub_interval": "604800",
>>>>>     "osd_deep_scrub_stride": "524288",
>>>>>     "osd_scan_list_ping_tp_interval": "100",
>>>>>     "osd_auto_weight": "false",
>>>>>     "osd_class_dir": "\/usr\/lib\/rados-classes",
>>>>>     "osd_check_for_log_corruption": "false",
>>>>>     "osd_use_stale_snap": "false",
>>>>>     "osd_rollback_to_cluster_snap": "",
>>>>>     "osd_default_notify_timeout": "30",
>>>>>     "osd_kill_backfill_at": "0",
>>>>>     "osd_pg_epoch_persisted_max_stale": "200",
>>>>>     "osd_min_pg_log_entries": "500",
>>>>>     "osd_max_pg_log_entries": "1500",
>>>>>     "osd_op_complaint_time": "30",
>>>>>     "osd_command_max_records": "256",
>>>>>     "osd_op_log_threshold": "5",
>>>>>     "osd_verify_sparse_read_holes": "false",
>>>>>     "osd_debug_drop_ping_probability": "0",
>>>>>     "osd_debug_drop_ping_duration": "0",
>>>>>     "osd_debug_drop_pg_create_probability": "0",
>>>>>     "osd_debug_drop_pg_create_duration": "1",
>>>>>     "osd_debug_drop_op_probability": "0",
>>>>>     "osd_debug_op_order": "false",
>>>>>     "osd_debug_verify_snaps_on_info": "false",
>>>>>     "osd_debug_verify_stray_on_activate": "false",
>>>>>     "osd_debug_skip_full_check_in_backfill_reservation": "false",
>>>>>     "osd_op_history_size": "20",
>>>>>     "osd_op_history_duration": "600",
>>>>>     "osd_target_transaction_size": "30",
>>>>>     "osd_failsafe_full_ratio": "0.97",
>>>>>     "osd_failsafe_nearfull_ratio": "0.9",
>>>>>     "osd_leveldb_write_buffer_size": "0",
>>>>>     "osd_leveldb_cache_size": "0",
>>>>>     "osd_leveldb_block_size": "0",
>>>>>     "osd_leveldb_bloom_size": "0",
>>>>>     "osd_leveldb_max_open_files": "0",
>>>>>     "osd_leveldb_compression": "true",
>>>>>     "osd_leveldb_paranoid": "false",
>>>>>     "osd_leveldb_log": "",
>>>>>     "osd_client_op_priority": "63",
>>>>>     "osd_recovery_op_priority": "50",
>>>>>     "osd_mon_shutdown_timeout": "5",
>>>>>     "filestore": "false",
>>>>>     "filestore_index_retry_probability": "0",
>>>>>     "filestore_debug_inject_read_err": "false",
>>>>>     "filestore_debug_omap_check": "false",
>>>>>     "filestore_xattr_use_omap": "false",
>>>>>     "filestore_max_inline_xattr_size": "512",
>>>>>     "filestore_max_inline_xattrs": "2",
>>>>>     "filestore_max_sync_interval": "5",
>>>>>     "filestore_min_sync_interval": "0.01",
>>>>>     "filestore_btrfs_snap": "true",
>>>>>     "filestore_btrfs_clone_range": "true",
>>>>>     "filestore_fsync_flushes_journal_data": "false",
>>>>>     "filestore_fiemap": "false",
>>>>>     "filestore_flusher": "true",
>>>>>     "filestore_flusher_max_fds": "512",
>>>>>     "filestore_flush_min": "65536",
>>>>>     "filestore_sync_flush": "false",
>>>>>     "filestore_journal_parallel": "false",
>>>>>     "filestore_journal_writeahead": "false",
>>>>>     "filestore_journal_trailing": "false",
>>>>>     "filestore_queue_max_ops": "500",
>>>>>     "filestore_queue_max_bytes": "104857600",
>>>>>     "filestore_queue_committing_max_ops": "5000",
>>>>>     "filestore_queue_committing_max_bytes": "104857600",
>>>>>     "filestore_op_threads": "2",
>>>>>     "filestore_op_thread_timeout": "60",
>>>>>     "filestore_op_thread_suicide_timeout": "180",
>>>>>     "filestore_commit_timeout": "600",
>>>>>     "filestore_fiemap_threshold": "4096",
>>>>>     "filestore_merge_threshold": "10",
>>>>>     "filestore_split_multiple": "2",
>>>>>     "filestore_update_to": "1000",
>>>>>     "filestore_blackhole": "false",
>>>>>     "filestore_dump_file": "",
>>>>>     "filestore_kill_at": "0",
>>>>>     "filestore_inject_stall": "0",
>>>>>     "filestore_fail_eio": "true",
>>>>>     "filestore_replica_fadvise": "true",
>>>>>     "filestore_debug_verify_split": "false",
>>>>>     "journal_dio": "true",
>>>>>     "journal_aio": "true",
>>>>>     "journal_force_aio": "false",
>>>>>     "journal_max_corrupt_search": "10485760",
>>>>>     "journal_block_align": "true",
>>>>>     "journal_write_header_frequency": "0",
>>>>>     "journal_max_write_bytes": "10485760",
>>>>>     "journal_max_write_entries": "100",
>>>>>     "journal_queue_max_ops": "300",
>>>>>     "journal_queue_max_bytes": "33554432",
>>>>>     "journal_align_min_size": "65536",
>>>>>     "journal_replay_from": "0",
>>>>>     "journal_zero_on_create": "false",
>>>>>     "journal_ignore_corruption": "false",
>>>>>     "rbd_cache": "false",
>>>>>     "rbd_cache_writethrough_until_flush": "false",
>>>>>     "rbd_cache_size": "33554432",
>>>>>     "rbd_cache_max_dirty": "25165824",
>>>>>     "rbd_cache_target_dirty": "16777216",
>>>>>     "rbd_cache_max_dirty_age": "1",
>>>>>     "rbd_cache_block_writes_upfront": "false",
>>>>>     "rbd_concurrent_management_ops": "10",
>>>>>     "rbd_default_format": "1",
>>>>>     "rbd_default_order": "22",
>>>>>     "rbd_default_stripe_count": "1",
>>>>>     "rbd_default_stripe_unit": "4194304",
>>>>>     "rbd_default_features": "3",
>>>>>     "nss_db_path": "",
>>>>>     "rgw_data": "\/var\/lib\/ceph\/radosgw\/ceph-0",
>>>>>     "rgw_enable_apis": "s3, swift, swift_auth, admin",
>>>>>     "rgw_cache_enabled": "true",
>>>>>     "rgw_cache_lru_size": "10000",
>>>>>     "rgw_socket_path": "",
>>>>>     "rgw_host": "",
>>>>>     "rgw_port": "",
>>>>>     "rgw_dns_name": "",
>>>>>     "rgw_script_uri": "",
>>>>>     "rgw_request_uri": "",
>>>>>     "rgw_swift_url": "",
>>>>>     "rgw_swift_url_prefix": "swift",
>>>>>     "rgw_swift_auth_url": "",
>>>>>     "rgw_swift_auth_entry": "auth",
>>>>>     "rgw_keystone_url": "",
>>>>>     "rgw_keystone_admin_token": "",
>>>>>     "rgw_keystone_accepted_roles": "Member, admin",
>>>>>     "rgw_keystone_token_cache_size": "10000",
>>>>>     "rgw_keystone_revocation_interval": "900",
>>>>>     "rgw_admin_entry": "admin",
>>>>>     "rgw_enforce_swift_acls": "true",
>>>>>     "rgw_swift_token_expiration": "86400",
>>>>>     "rgw_print_continue": "true",
>>>>>     "rgw_remote_addr_param": "REMOTE_ADDR",
>>>>>     "rgw_op_thread_timeout": "600",
>>>>>     "rgw_op_thread_suicide_timeout": "0",
>>>>>     "rgw_thread_pool_size": "100",
>>>>>     "rgw_num_control_oids": "8",
>>>>>     "rgw_zone_root_pool": ".rgw.root",
>>>>>     "rgw_log_nonexistent_bucket": "false",
>>>>>     "rgw_log_object_name": "%Y-%m-%d-%H-%i-%n",
>>>>>     "rgw_log_object_name_utc": "false",
>>>>>     "rgw_usage_max_shards": "32",
>>>>>     "rgw_usage_max_user_shards": "1",
>>>>>     "rgw_enable_ops_log": "false",
>>>>>     "rgw_enable_usage_log": "false",
>>>>>     "rgw_ops_log_rados": "true",
>>>>>     "rgw_ops_log_socket_path": "",
>>>>>     "rgw_ops_log_data_backlog": "5242880",
>>>>>     "rgw_usage_log_flush_threshold": "1024",
>>>>>     "rgw_usage_log_tick_interval": "30",
>>>>>     "rgw_intent_log_object_name": "%Y-%m-%d-%i-%n",
>>>>>     "rgw_intent_log_object_name_utc": "false",
>>>>>     "rgw_init_timeout": "300",
>>>>>     "rgw_mime_types_file": "\/etc\/mime.types",
>>>>>     "rgw_gc_max_objs": "32",
>>>>>     "rgw_gc_obj_min_wait": "7200",
>>>>>     "rgw_gc_processor_max_time": "3600",
>>>>>     "rgw_gc_processor_period": "3600",
>>>>>     "rgw_s3_success_create_obj_status": "0",
>>>>>     "rgw_resolve_cname": "false",
>>>>>     "rgw_obj_stripe_size": "4194304",
>>>>>     "rgw_extended_http_attrs": "",
>>>>>     "rgw_exit_timeout_secs": "120",
>>>>>     "rgw_get_obj_window_size": "16777216",
>>>>>     "rgw_get_obj_max_req_size": "4194304",
>>>>>     "rgw_relaxed_s3_bucket_names": "false",
>>>>>     "rgw_list_buckets_max_chunk": "1000",
>>>>>     "mutex_perf_counter": "false",
>>>>>     "internal_safe_to_start_threads": "true"}
>>>>>
>>>>>
>>>>>
>>>>> Stefan
>>>>>
>>>>>
>>>>>> -Sam
>>>>>>
>>>>>> On Thu, Aug 1, 2013 at 12:07 PM, Stefan Priebe
>>>>>> <s.priebe@profihost.ag>
>>>>>> wrote:
>>>>>>>
>>>>>>>
>>>>>>> Mike we already have the async patch running. Yes it helps but only
>>>>>>> helps
>>>>>>> it
>>>>>>> does not solve. It just hides the issue ...
>>>>>>> Am 01.08.2013 20:54, schrieb Mike Dawson:
>>>>>>>
>>>>>>>> I am also seeing recovery issues with 0.61.7. Here's the process:
>>>>>>>>
>>>>>>>> - ceph osd set noout
>>>>>>>>
>>>>>>>> - Reboot one of the nodes hosting OSDs
>>>>>>>>         - VMs mounted from RBD volumes work properly
>>>>>>>>
>>>>>>>> - I see the OSD's boot messages as they re-join the cluster
>>>>>>>>
>>>>>>>> - Start seeing active+recovery_wait, peering, and active+recovering
>>>>>>>>         - VMs mounted from RBD volumes become unresponsive.
>>>>>>>>
>>>>>>>> - Recovery completes
>>>>>>>>         - VMs mounted from RBD volumes regain responsiveness
>>>>>>>>
>>>>>>>> - ceph osd unset noout
>>>>>>>>
>>>>>>>> Would joshd's async patch for qemu help here, or is there something
>>>>>>>> else
>>>>>>>> going on?
>>>>>>>>
>>>>>>>> Output of ceph -w at: http://pastebin.com/raw.php?i=JLcZYFzY
>>>>>>>>
>>>>>>>> Thanks,
>>>>>>>>
>>>>>>>> Mike Dawson
>>>>>>>> Co-Founder & Director of Cloud Architecture
>>>>>>>> Cloudapt LLC
>>>>>>>> 6330 East 75th Street, Suite 170
>>>>>>>> Indianapolis, IN 46250
>>>>>>>>
>>>>>>>> On 8/1/2013 2:34 PM, Samuel Just wrote:
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Can you reproduce and attach the ceph.log from before you stop the
>>>>>>>>> osd
>>>>>>>>> until after you have started the osd and it has recovered?
>>>>>>>>> -Sam
>>>>>>>>>
>>>>>>>>> On Thu, Aug 1, 2013 at 1:22 AM, Stefan Priebe - Profihost AG
>>>>>>>>> <s.priebe@profihost.ag> wrote:
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Hi,
>>>>>>>>>>
>>>>>>>>>> i still have recovery issues with cuttlefish. After the OSD comes
>>>>>>>>>> back
>>>>>>>>>> it seem to hang for around 2-4 minutes and then recovery seems to
>>>>>>>>>> start
>>>>>>>>>> (pgs in recovery_wait start to decrement). This is with ceph
>>>>>>>>>> 0.61.7.
>>>>>>>>>> I
>>>>>>>>>> get a lot of slow request messages an hanging VMs.
>>>>>>>>>>
>>>>>>>>>> What i noticed today is that if i leave the OSD off as long as
>>>>>>>>>> ceph
>>>>>>>>>> starts to backfill - the recovery and "re" backfilling wents
>>>>>>>>>> absolutely
>>>>>>>>>> smooth without any issues and no slow request messages at all.
>>>>>>>>>>
>>>>>>>>>> Does anybody have an idea why?
>>>>>>>>>>
>>>>>>>>>> Greets,
>>>>>>>>>> Stefan
>>>>>>>>>> --
>>>>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>>>>> ceph-devel"
>>>>>>>>>> in
>>>>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>>>>> More majordomo info at
>>>>>>>>>> http://vger.kernel.org/majordomo-info.html
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> --
>>>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>>>> ceph-devel"
>>>>>>>>> in
>>>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>>>>>
>>>>>>>
>>>>>> --
>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>> ceph-devel" in
>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>>
>>>>>
>>>

  reply	other threads:[~2013-08-08 14:05 UTC|newest]

Thread overview: 40+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-08-01  8:22 still recovery issues with cuttlefish Stefan Priebe - Profihost AG
2013-08-01 14:50 ` Andrey Korolyov
2013-08-01 18:38   ` Samuel Just
2013-08-02 17:56     ` Andrey Korolyov
2013-08-01 18:34 ` Samuel Just
2013-08-01 18:34   ` Stefan Priebe
2013-08-01 18:36     ` Samuel Just
2013-08-01 18:36       ` Samuel Just
2013-08-01 18:46       ` Stefan Priebe
2013-08-01 18:54   ` Mike Dawson
2013-08-01 19:07     ` Stefan Priebe
2013-08-01 21:23       ` Samuel Just
2013-08-02  7:44         ` Stefan Priebe
2013-08-02 17:35           ` Samuel Just
2013-08-02 18:16             ` Stefan Priebe
2013-08-02 18:21               ` Samuel Just
2013-08-02 18:46                 ` Stefan Priebe
2013-08-08 14:05                   ` Mike Dawson [this message]
2013-08-08 15:43                     ` Oliver Francke
2013-08-08 18:13                     ` Stefan Priebe
2013-08-09 21:44                       ` Samuel Just
2013-08-10 19:08                         ` Stefan Priebe
2013-08-11  3:50                           ` Samuel Just
2013-08-13  4:39                             ` Stefan Priebe - Profihost AG
2013-08-13  5:34                               ` Samuel Just
2013-08-13 20:43                                 ` Samuel Just
2013-08-13 21:03                                   ` Stefan Priebe - Profihost AG
2013-08-13 23:11                                     ` Samuel Just
2013-08-14  7:04                                       ` Stefan Priebe - Profihost AG
2013-08-21 15:28                                         ` Mike Dawson
2013-08-21 15:32                                           ` Samuel Just
2013-08-21 16:25                                             ` Yann ROBIN
2013-08-21 17:12                                               ` Samuel Just
2013-08-21 17:55                                                 ` Mike Dawson
2013-08-21 18:05                                                   ` Samuel Just
2013-08-21 18:21                                             ` Stefan Priebe
2013-08-21 19:13                                               ` Samuel Just
2013-08-21 19:37                                                 ` Stefan Priebe
2013-08-22  3:34                                                   ` Samuel Just
2013-08-22  7:41                                                     ` Stefan Priebe - Profihost AG

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5203A597.4060701@cloudapt.com \
    --to=mike.dawson@cloudapt.com \
    --cc=Oliver.Francke@filoo.de \
    --cc=ceph-devel@vger.kernel.org \
    --cc=josh.durgin@inktank.com \
    --cc=s.priebe@profihost.ag \
    --cc=sam.just@inktank.com \
    --cc=stefanha@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.